Statements (23)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:algorithm
|
| gptkbp:application |
plagiarism detection
document deduplication large-scale search web page clustering |
| gptkbp:describedBy |
gptkb:Broder,_A._Z._(1997)._On_the_resemblance_and_containment_of_documents._Compression_and_Complexity_of_Sequences.
|
| gptkbp:input |
sets
|
| gptkbp:introducedIn |
1997
|
| gptkbp:inventedBy |
gptkb:Andrei_Broder
|
| gptkbp:output |
signature
|
| gptkbp:property |
probabilistic
scalable efficient |
| gptkbp:relatedTo |
gptkb:Locality-sensitive_hashing
gptkb:Jaccard_index set similarity |
| gptkbp:usedFor |
estimating Jaccard similarity
|
| gptkbp:usedIn |
information retrieval
data mining near-duplicate detection |
| gptkbp:bfsParent |
gptkb:Locality-sensitive_hashing
|
| gptkbp:bfsLayer |
5
|
| https://www.w3.org/2000/01/rdf-schema#label |
MinHash
|