============ Installation ============ Requirements ------------ * Python 2.7 or Python 3.4+ * C++ compiler (parts of the package are in Cython for efficiency reasons, and you need C++ compiler to compile these parts) Platforms ------------ py_stringsimjoin has been tested on Linux (Ubuntu with Kernel Version 3.13.0-40-generic), OS X (Darwin with Kernel Version 13.4.0), and Windows 8.1. Dependencies ------------ * pandas (to manage tables of tuples to be joined) * joblib (to write code that runs over multiple cores) * py_stringmatching (to tokenize and compute similarity scores between strings) * pyprind (to display progress bars) * six (to ensure our code run on both Python 2.x and Python 3.x) .. note:: The py_stringsimjoin installer will automatically install the above required packages. C Compiler Required ------------------- Before installing this package, you need to make sure that you have a C compiler installed. This is necessary because this package contains Cython files. Go `here `_ for more information about how to check whether you already have a C compiler and how to install a C compiler. After you have confirmed that you have a C compiler installed, you are ready to install the package. There are two ways to install py_stringsimjoin package: using pip or source distribution. Installing Using pip -------------------- The easiest way to install the package is to use pip, which will retrieve py_stringsimjoin from PyPI then install it:: pip install py_stringsimjoin Installing from Source Distribution ------------------------------------- Step 1: Download the source code of the py_stringsimjoin package from `here `_. (Download code in tar.gz format for Linux and OS X, and code in zip format for Windows.) Step 2: Untar or unzip the package and execute the following command from the package root:: python setup.py install .. note:: The above command will try to install py_stringsimjoin into the defaul Python directory on your machine. If you do not have installation permission for that directory then you can install the package in your home directory as follows:: python setup.py install --user For more information see the following StackOverflow `link `_.