Statements (23)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:algorithm
|
gptkbp:application |
plagiarism detection
document deduplication large-scale search web page clustering |
gptkbp:describedBy |
gptkb:Broder,_A._Z._(1997)._On_the_resemblance_and_containment_of_documents._Compression_and_Complexity_of_Sequences.
|
https://www.w3.org/2000/01/rdf-schema#label |
MinHash
|
gptkbp:input |
sets
|
gptkbp:introducedIn |
1997
|
gptkbp:inventedBy |
gptkb:Andrei_Broder
|
gptkbp:output |
signature
|
gptkbp:property |
probabilistic
scalable efficient |
gptkbp:relatedTo |
gptkb:Locality-sensitive_hashing
gptkb:Jaccard_index set similarity |
gptkbp:usedFor |
estimating Jaccard similarity
|
gptkbp:usedIn |
information retrieval
data mining near-duplicate detection |
gptkbp:bfsParent |
gptkb:Locality-sensitive_hashing
|
gptkbp:bfsLayer |
5
|