kreuzberg
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
- Latest release
- 14h ago
- Releases
- 108
- Known CVEs
- 0
- First release
- Dec 09, 2025
- License
- Elastic-2.0
Repository
Source
- Stars
- 8.5k
- Forks
- 497
- Open issues
- 8
- Language
- Rust
- text-extraction
- document-intelligence
- metadata-extraction
- pdf-extraction
- pdfium
- python
- rag
- table-extraction
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
-
kreuzberg-cli
-
kreuzberg-paddle-ocr
-
kreuzberg-tesseract
-
kreuzberg
-
@kreuzberg/node-darwin-arm64
-
@kreuzberg/node-darwin-x64
-
@kreuzberg/node-linux-arm64-gnu
-
@kreuzberg/node-linux-arm64-musl
-
@kreuzberg/node-linux-x64-gnu
-
@kreuzberg/node-linux-x64-musl
-
@kreuzberg/node-win32-arm64-msvc
-
@kreuzberg/node-win32-x64-msvc
Insights
Activity
- Total releases
- 108
- Last 12 months
- 108
- Cadence
- ~daily
- Dependencies
- 118
Releases per month
last 12 monthsRelease mix
- minor 9
- patch 61
- pre 37
108
releases
Dependencies
Depends on
5.0.0-rc.10-
ahash ^0.8.12
-
anyhow ^1.0 dev
-
async-trait ^0.1.89
-
auto_enums ^0.8
-
axum ^0.8
-
base64 ^0.22.1
-
biblatex ^0.12
-
biblib ^0.4
-
bitvec ^1.0
-
blake3 ^1
1–10 of 118
Used by
1Releases
| Version | Released | |
|---|---|---|
5.0.0-rc.10
pre
| ||
5.0.0-rc.8
pre
| ||
5.0.0-rc.7
pre
| ||
5.0.0-rc.5
pre
| ||
5.0.0-rc.4
pre
| ||
4.9.9
patch
| ||
5.0.0-rc.3
pre
| ||
5.0.0-rc.2
pre
| ||
5.0.0-rc.1
pre
| ||
4.9.8
patch
| ||
4.9.7
patch
| ||
4.9.6
patch
| ||
4.10.0-rc.15
pre
| ||
4.10.0-rc.14
pre
| ||
4.10.0-rc.12
pre
| ||
4.10.0-rc.11
pre
| ||
4.10.0-rc.9
pre
| ||
4.9.4
patch
| ||
4.9.3
patch
| ||
4.9.2
patch
|
1–20 of 108