Statements (23)
| Predicate | Object | 
|---|---|
| gptkbp:instanceOf | gptkb:algorithm | 
| gptkbp:application | plagiarism detection document deduplication large-scale search web page clustering | 
| gptkbp:describedBy | gptkb:Broder,_A._Z._(1997)._On_the_resemblance_and_containment_of_documents._Compression_and_Complexity_of_Sequences. | 
| gptkbp:input | sets | 
| gptkbp:introducedIn | 1997 | 
| gptkbp:inventedBy | gptkb:Andrei_Broder | 
| gptkbp:output | signature | 
| gptkbp:property | probabilistic scalable efficient | 
| gptkbp:relatedTo | gptkb:Locality-sensitive_hashing gptkb:Jaccard_index set similarity | 
| gptkbp:usedFor | estimating Jaccard similarity | 
| gptkbp:usedIn | information retrieval data mining near-duplicate detection | 
| gptkbp:bfsParent | gptkb:Locality-sensitive_hashing | 
| gptkbp:bfsLayer | 5 | 
| https://www.w3.org/2000/01/rdf-schema#label | MinHash |