goldencheck
Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.
- Latest release
- Jun 01, 2026
- Releases
- 14
- Known CVEs
- 0
- First release
- Mar 23, 2026
- License
- MIT
Repository
Source
- Stars
- 86
- Forks
- 10
- Open issues
- 2
- Language
- Python
- data-engineering
- data-quality
- deduplication
- entity-resolution
- fuzzy-matching
- llm
- polars
- python
Security score
6.5
/ 10
OpenSSF
- CII-Best-Practices
- 0
- Code-Review
- 0
- Contributors
- 0
- Maintained
- 0
- Token-Permissions
- 0
- Branch-Protection
- 4
Packages from this repo
Insights
Activity
- Total releases
- 14
- Last 12 months
- 14
- Cadence
- ~daily
- Dependencies
- 19
Releases per month
last 12 monthsRelease mix
- major 1
- minor 8
- patch 4
14
releases
Dependencies
Depends on
1.3.0-
aiohttp >=3.9
-
anthropic >=0.30
-
connectorx >=0.3
-
goldencheck-types
-
mcp >=1.0
-
numpy >=1.26
-
openai >=1.30
-
openpyxl >=3.1
-
polars >=1.0
-
pydantic >=2.7
1–10 of 19
Used by
4Releases
| Version | Released | |
|---|---|---|
1.3.0
minor
| ||
1.2.0
minor
| ||
1.1.2
patch
| ||
1.1.1
patch
| ||
1.1.0
minor
| ||
1.0.2
patch
| ||
1.0.1
patch
| ||
1.0.0
major
| ||
0.9.0
minor
| ||
0.6.0
minor
| ||
0.5.0
minor
| ||
0.3.0
minor
| ||
0.2.0
minor
| ||
0.1.0
initial
|