|
gptkbp:instanceOf
|
gptkb:software
|
|
gptkbp:describes
|
A library for easily accessing and sharing datasets for machine learning and NLP.
|
|
gptkbp:developedBy
|
gptkb:Hugging_Face
|
|
gptkbp:documentation
|
https://huggingface.co/docs/datasets
|
|
gptkbp:feature
|
data caching
dataset versioning
streaming large datasets
data processing pipelines
community dataset sharing
data splits
dataset loading
|
|
gptkbp:firstReleased
|
2020
|
|
gptkbp:integratesWith
|
gptkb:TensorFlow
gptkb:Hugging_Face_Hub
gptkb:JAX
gptkb:PyTorch
|
|
gptkbp:license
|
gptkb:Apache_License_2.0
|
|
gptkbp:npmPackage
|
datasets
|
|
gptkbp:programmingLanguage
|
gptkb:Python
|
|
gptkbp:repository
|
https://github.com/huggingface/datasets
|
|
gptkbp:supports
|
audio datasets
NLP datasets
computer vision datasets
|
|
gptkbp:usedFor
|
gptkb:machine_learning
data analysis
natural language processing
|
|
gptkbp:bfsParent
|
gptkb:Emotion_Dataset
gptkb:C4_(Colossal_Clean_Crawled_Corpus)
gptkb:Unitxt
gptkb:GSM8K
gptkb:Multi50
|
|
gptkbp:bfsLayer
|
8
|
|
https://www.w3.org/2000/01/rdf-schema#label
|
Hugging Face Datasets
|