a library for doing approximate and phonetic matching of strings.
Jellyfish is a python library for doing approximate and phonetic matching of strings.
Written by James Turk <email@example.com> and Michael Stephens.
See https://github.com/jamesturk/jellyfish/graphs/contributors for contributors.
Source is available at http://github.com/jamesturk/jellyfish.
- Levenshtein Distance
- Damerau-Levenshtein Distance
- Jaro Distance
- Jaro-Winkler Distance
- Match Rating Approach Comparison
- Hamming Distance
- American Soundex
- NYSIIS (New York State Identification and Intelligence System)
- Match Rating Codex
>>> import jellyfish >>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish') 2 >>> jellyfish.jaro_distance('jellyfish', 'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs') 1
>>> jellyfish.metaphone('Jellyfish') 'JLFX' >>> jellyfish.soundex('Jellyfish') 'J412' >>> jellyfish.nysiis('Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex('Jellyfish') 'JLLFSH'