py_stringmatching
0.3
  • What is New?
  • Installation
  • Tutorial
  • Tokenizers
  • Similarity Measures
py_stringmatching
  • Docs »
  • User Manual for py_stringmatching
  • View page source

User Manual for py_stringmatching¶

This document shows the users how to install and use the package. To contribute to or further develop the package, see the project website, section “For Contributors and Developers”.

Contents¶

  • What is New?
  • Installation
    • Requirements
    • Platforms
    • Dependencies
    • Installing Using pip
    • Installing from Source Distribution
  • Tutorial
    • 1. Selecting a Similarity Measure
    • 2. Selecting a Tokenizer Type
    • 3. Creating a Tokenizer Object and Using It to Tokenize the Input Strings
    • 4. Creating a Similarity Measure Object and Using It to Compute a Similarity Score
    • Handling a Large Number of String Pairs
    • Handling Missing Values
    • Adding Prefix and Suffix to the Input String for Qgram Tokenizers
    • Class Hierarchy for Tokenizers and Similarity Measures
    • References
  • Tokenizers
    • Alphabetic Tokenizer
    • Alphanumeric Tokenizer
    • Delimiter Tokenizer
    • Qgram Tokenizer
    • Whitespace Tokenizer
  • Similarity Measures
    • Affine Gap
    • Bag Distance
    • Cosine
    • Dice
    • Editex
    • Generalized Jaccard
    • Hamming Distance
    • Jaccard
    • Jaro
    • Jaro Winkler
    • Levenshtein
    • Monge Elkan
    • Needleman Wunsch
    • Overlap Coefficient
    • Partial Ratio
    • Partial Token Sort
    • Ratio
    • Smith Waterman
    • Soft TF/IDF
    • Soundex
    • TF/IDF
    • Token Sort
    • Tversky Index

Indices and tables¶

  • Index
  • Module Index
  • Search Page
Next

© Copyright 2017, py_stringmatching Team.

Built with Sphinx using a theme provided by Read the Docs.