py_stringmatching
Installation
Requirements
Platforms
Dependencies
Installing Using pip
Installing from Source Distribution
Tutorial
1. Selecting a Similarity Measure
2. Selecting a Tokenizer Type
3. Creating a Tokenizer Object and Using It to Tokenize the Input Strings
4. Creating a Similarity Measure Object and Using It to Compute a Similarity Score
Handling a Large Number of String Pairs
Handling Missing Values
References
Tokenizers
Alphabetic Tokenizer
Alphanumeric Tokenizer
Delimiter Tokenizer
Qgram Tokenizer
Whitespace Tokenizer
Similarity Measures
Affine Gap
Cosine
Dice
Hamming Distance
Jaccard
Jaro
Jaro Winkler
Levenshtein
Monge Elkan
Needleman Wunsch
Overlap Coefficient
Smith Waterman
Soft TF/IDF
TF/IDF
py_stringmatching
Docs
»
Tokenizers
View page source
Tokenizers
ΒΆ
Alphabetic Tokenizer
Alphanumeric Tokenizer
Delimiter Tokenizer
Qgram Tokenizer
Whitespace Tokenizer