Guides

The goal of this page is to give you some concrete examples for using py_entitymatching. These are examples with sample data that is already bundled along with the package. The examples are in the form of Jupyter notebooks.

A Quick Tour of Jupyter Notebook

This tutorial gives a quick tour on installing and using Jupyter notebook.

End-to-End EM Workflows

  • EM workflow with blocking using a overlap blocker and matching using Random Forest matcher: Jupyter notebook

  • EM workflow with blocking using a overlap blocker, selecting among multiple matchers, using the selected matcher to predict matches, and evaluating the predicted matches: Jupyter notebook

  • EM workflow with blocking using multiple blockers (overlap and attribute equivalence blocker), debugging the blocker output, selecting among multiple matchers, debugging the matcher output, using the selected matcher to predict matches, and evaluating the predicted matches: Jupyter notebook

Stepwise Guides