Statements (24)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:software
|
| gptkbp:availableOn |
gptkb:PyPI
|
| gptkbp:developedBy |
gptkb:Hugging_Face
|
| gptkbp:documentation |
https://huggingface.co/docs/tokenizers
|
| gptkbp:feature |
normalization
customizable pipelines post-processing fast tokenization pre-tokenization |
| gptkbp:firstReleased |
2019
|
| gptkbp:integratesWith |
gptkb:Transformers_library
|
| gptkbp:license |
gptkb:Apache_License_2.0
|
| gptkbp:programmingLanguage |
gptkb:Python
gptkb:Rust |
| gptkbp:repository |
https://github.com/huggingface/tokenizers
|
| gptkbp:supports |
gptkb:WordPiece
gptkb:BPE gptkb:SentencePiece Unigram |
| gptkbp:usedFor |
natural language processing
text tokenization |
| gptkbp:bfsParent |
gptkb:BPE
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Hugging Face tokenizers
|