Ratio¶
Fuzzy Wuzzy Ratio Similarity Measure
-
class
py_stringmatching.similarity_measure.ratio.
Ratio
[source]¶ Computes Fuzzy Wuzzy ratio similarity measure.
Fuzzy Wuzzy ratio raw score is a measure of the strings similarity as an int in the range [0, 100]. For two strings X and Y, the score is defined by int(round((2.0 * M / T) * 100)) where T is the total number of characters in both strings, and M is the number of matches in the two strings. Fuzzy Wuzzy ratio sim score is a float in the range [0, 1] and is obtained by dividing the raw score by 100.
- Note:
- In the case where either of strings X or Y are empty, we define the Fuzzy Wuzzy ratio similarity score to be 0.
-
get_raw_score
(string1, string2)[source]¶ Computes the Fuzzy Wuzzy ratio measure raw score between two strings. This score is in the range [0,100].
Parameters: string1,string2 (str) – Input strings Returns: Ratio measure raw score (int) is returned Raises: TypeError
– If the inputs are not stringsExamples
>>> s = Ratio() >>> s.get_raw_score('Robert', 'Rupert') 67 >>> s.get_raw_score('Sue', 'sue') 67 >>> s.get_raw_score('example', 'samples') 71
References
-
get_sim_score
(string1, string2)[source]¶ Computes the Fuzzy Wuzzy ratio similarity score between two strings. This score is in the range [0,1].
Parameters: string1,string2 (str) – Input strings Returns: Ratio measure similarity score (float) is returned Raises: TypeError
– If the inputs are not stringsExamples
>>> s = Ratio() >>> s.get_sim_score('Robert', 'Rupert') 0.67 >>> s.get_sim_score('Sue', 'sue') 0.67 >>> s.get_sim_score('example', 'samples') 0.71
References