Statements (31)
Predicate | Object |
---|---|
gptkbp:instanceOf |
Text representation model
|
gptkbp:alternativeName |
BoW
|
gptkbp:application |
Topic modeling
Document classification Sentiment analysis Spam detection |
gptkbp:featureType |
Word frequency
Word occurrence |
gptkbp:firstDescribed |
1950s
|
https://www.w3.org/2000/01/rdf-schema#label |
Bag of Words Model
|
gptkbp:ignores |
gptkb:syntax
Word order |
gptkbp:implementedIn |
gptkb:NLTK
gptkb:spaCy gptkb:scikit-learn |
gptkbp:input |
Text document
|
gptkbp:limitation |
High dimensionality
Ignores semantics Loss of context |
gptkbp:originatedIn |
Information retrieval research
|
gptkbp:output |
gptkb:Vector
Sparse matrix |
gptkbp:relatedTo |
gptkb:TF-IDF
N-gram model Word Embedding |
gptkbp:represents |
Text as unordered collection of words
|
gptkbp:usedIn |
gptkb:museum
gptkb:Natural_Language_Processing gptkb:Text_Mining |
gptkbp:bfsParent |
gptkb:Unigram_Language_Model
|
gptkbp:bfsLayer |
8
|