Hugging Face Datasets library

GPTKB entity

Statements (50)
Predicate Object
gptkbp:instanceOf gptkb:software
gptkbp:category gptkb:artificial_intelligence
gptkb:machine_learning
gptkb:software
data science
data management
gptkbp:compatibleWith gptkb:TensorFlow
gptkb:Hugging_Face_Hub
gptkb:JAX
gptkb:PyTorch
gptkb:Hugging_Face_Transformers
Hugging Face Tokenizers
gptkbp:developedBy gptkb:Hugging_Face
gptkbp:feature gptkb:transformation
data processing
data loading
data filtering
data splitting
integration with PyTorch
dataset versioning
data shuffling
integration with Hugging Face Hub
integration with JAX
integration with TensorFlow
streaming large datasets
community dataset sharing
dataset caching
dataset card documentation
dataset map and batch operations
dataset metrics
easy dataset download
support for Arrow format
support for Parquet format
gptkbp:firstReleased 2020
https://www.w3.org/2000/01/rdf-schema#label Hugging Face Datasets library
gptkbp:license gptkb:Apache_License_2.0
gptkbp:officialWebsite https://huggingface.co/docs/datasets
gptkbp:programmingLanguage gptkb:Python
gptkbp:repository https://github.com/huggingface/datasets
gptkbp:supports audio datasets
NLP datasets
computer vision datasets
gptkbp:usedFor gptkb:machine_learning
data analysis
natural language processing
data preprocessing
model evaluation
model training
gptkbp:bfsParent gptkb:Hugging_Face_Model_Hub
gptkbp:bfsLayer 7