Installation

Requirements

  • Python 2.7 or Python 3.4+

Platforms

py_entitymatching has been tested on Linux (Redhat enterprise Linux with 2.6 .32 kernel), OS X (Sierra), and Windows 10.

Dependencies

  • pandas (provides data structures to store and manage tables)
  • scikit-learn (provides implementations for common machine learning algorithms)
  • joblib (provides multiprocessing capabilities)
  • pyqt4 (provides tools to build GUIs)
  • py_stringsimjoin (provides implementations for string similarity joins)
  • py_stringmatching (provides a set of string tokenizers and string similarity functions)
  • cloudpickle (provides functions to serialize Python constructs)
  • pyprind (library to display progress indicators)
  • pyparsing (library to parse strings)
  • six (provides functions to write compatible code across Python 2 and 3)

py_entitymatching can be installed using conda, pip, or source distribution, as described below.

Installing Using conda

The easiest and recommended way to install the package is to use the command conda, which will retrieve py_entitymatching from Anaconda repository then install it:

conda install -c uwmagellan py_entitymatching

The above command will install py_entitymatching and all of its dependencies.

Note

To use the command conda, first you must install Miniconda or Anaconda. For more details refer to this conda page.

Installing Using pip

To install the package using pip, execute the following command:

pip install -U numpy scipy py_entitymatching

The above command will install py_entitymatching and all of its dependencies, except PyQt4. This is because pip can only install the dependency packages that are available in PyPI and PyQt4 is not in PyPI. So next, you need to install PyQt4, using the instructions on this page.

Installing from Source Distribution

Download the py_entitymatching package from here, unzip the package, then execute the following commands from the package root:

pip install -U numpy scipy
python setup.py install

which installs py_stringmatching into the default Python directory on your machine. If you do not have installation permission for that directory then you can install the package in your home directory as follows:

python setup.py install --user

For more information see this StackOverflow link.

The above commands will install py_entitymatching and all of its dependencies, except PyQt4. This is because similar to pip, setup.py can only install the dependency packages that are available in PyPI and PyQt4 is not in PyPI. So next, you need to install PyQt4, using the instructions on this page.

Scroll To Top