Sign in Sign up
pypi

goldencheck

Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.

Latest release
Jun 01, 2026
Releases
14
Known CVEs
0
First release
Mar 23, 2026
License
MIT
View on Pypi
Repository

Source

benseverndev-oss/goldenmatch
Stars
86
Forks
10
Open issues
2
Language
Python
  • data-engineering
  • data-quality
  • deduplication
  • entity-resolution
  • fuzzy-matching
  • llm
  • polars
  • python

Security score

6.5 / 10 OpenSSF
CII-Best-Practices
0
Code-Review
0
Contributors
0
Maintained
0
Token-Permissions
0
Branch-Protection
4

Packages from this repo

Insights

Activity

Total releases
14
Last 12 months
14
Cadence
~daily
Dependencies
19

Releases per month

last 12 months

Release mix

  • major 1
  • minor 8
  • patch 4
14 releases
Releases
Version Released
1.3.0 minor
1.2.0 minor
1.1.2 patch
1.1.1 patch
1.1.0 minor
1.0.2 patch
1.0.1 patch
1.0.0 major
0.9.0 minor
0.6.0 minor
0.5.0 minor
0.3.0 minor
0.2.0 minor
0.1.0 initial