Ratio¶
Fuzzy Wuzzy Ratio Similarity Measure
-
class
py_stringmatching.similarity_measure.ratio.
Ratio
[source]¶ Computes Fuzzy Wuzzy ratio similarity measure.
Fuzzy Wuzzy ratio raw score is a measure of the strings similarity as an int in the range [0, 100]. For two strings X and Y, the score is defined by int(round((2.0 * M / T) * 100)) where T is the total number of characters in both strings, and M is the number of matches in the two strings. Fuzzy Wuzzy ratio sim score is a float in the range [0, 1] and is obtained by dividing the raw score by 100.
- Note:
In the case where either of strings X or Y are empty, we define the Fuzzy Wuzzy ratio similarity score to be 0.
-
get_raw_score
(string1, string2)[source]¶ Computes the Fuzzy Wuzzy ratio measure raw score between two strings. This score is in the range [0,100].
- Parameters
string1 (str) – Input strings
string2 (str) – Input strings
- Returns
Ratio measure raw score (int) is returned
- Raises
TypeError – If the inputs are not strings
Examples
>>> s = Ratio() >>> s.get_raw_score('Robert', 'Rupert') 67 >>> s.get_raw_score('Sue', 'sue') 67 >>> s.get_raw_score('example', 'samples') 71
References
-
get_sim_score
(string1, string2)[source]¶ Computes the Fuzzy Wuzzy ratio similarity score between two strings. This score is in the range [0,1].
- Parameters
string1 (str) – Input strings
string2 (str) – Input strings
- Returns
Ratio measure similarity score (float) is returned
- Raises
TypeError – If the inputs are not strings
Examples
>>> s = Ratio() >>> s.get_sim_score('Robert', 'Rupert') 0.67 >>> s.get_sim_score('Sue', 'sue') 0.67 >>> s.get_sim_score('example', 'samples') 0.71
References