gptkbp:instanceOf
|
string metric
similarity measure
|
gptkbp:advantage
|
gives higher scores to strings that match from the beginning
|
gptkbp:application
|
natural language processing
data deduplication
fuzzy string matching
|
gptkbp:basedOn
|
gptkb:Jaro_distance
|
gptkbp:developedBy
|
gptkb:Matthew_A._Jaro
gptkb:William_E._Winkler
|
gptkbp:field
|
gptkb:information_theory
computer science
data matching
|
gptkbp:form
|
Jaro similarity
common prefix length
prefix scale
|
gptkbp:higherScoreMeans
|
more similar strings
|
https://www.w3.org/2000/01/rdf-schema#label
|
Jaro–Winkler distance
|
gptkbp:introducedIn
|
1990
|
gptkbp:measures
|
similarity between two strings
|
gptkbp:parameter
|
maximum prefix length l
scaling factor p
|
gptkbp:publishedIn
|
gptkb:Journal_of_the_American_Statistical_Association
|
gptkbp:range
|
0 to 1
|
gptkbp:relatedTo
|
gptkb:Hamming_distance
gptkb:Smith–Waterman_algorithm
gptkb:Levenshtein_distance
gptkb:Damerau–Levenshtein_distance
|
gptkbp:usedFor
|
spelling correction
duplicate detection
record linkage
|
gptkbp:bfsParent
|
gptkb:Ratcliff/Obershelp_pattern-matching_algorithm
gptkb:Jaro_distance
|
gptkbp:bfsLayer
|
8
|