Metadata-Version: 2.1
Name: blockingpy
Version: 0.1.1
Summary: Blocking records for record linkage and data deduplication based on ANN algorithms.
Home-page: https://github.com/T-Strojny/BlockingPy
License: MIT
Keywords: record-linkage,deduplication,ANN,blocking,data-matching
Author: Tymoteusz Strojny
Author-email: tymek.strojny@gmail.com
Requires-Python: >=3.10,<4.0
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Dist: annoy (>=1.17.3,<2.0.0)
Requires-Dist: faiss-cpu (>=1.9.0,<2.0.0)
Requires-Dist: hnswlib (>=0.8.0,<0.9.0)
Requires-Dist: mlpack (>=4.4.0,<5.0.0)
Requires-Dist: networkx (>=3.1,<4.0)
Requires-Dist: nltk (>=3.9.1,<4.0.0)
Requires-Dist: numpy (>=1.21,<2.0)
Requires-Dist: pandas (>=2.2.0,<3.0.0)
Requires-Dist: pynndescent (>=0.5.13,<0.6.0)
Requires-Dist: scikit-learn (>=1.5.2,<2.0.0)
Requires-Dist: scipy (>=1.5.0)
Requires-Dist: voyager (>=2.0.9,<3.0.0)
Project-URL: Bug Tracker, https://github.com/T-Strojny/BlockingPy/issues
Project-URL: Repository, https://github.com/T-Strojny/BlockingPy
Description-Content-Type: text/markdown

[![License](https://img.shields.io/github/license/T-Strojny/BlockingPy)](https://github.com/T-Strojny/BlockingPy/blob/main/LICENSE)

Unfinished python port of [R's blocking package](https://github.com/ncn-foreigners/blocking) for blocking records for record linkage and data deduplication based on ANN algorithms.

