kreuzberg
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
- Latest release
- 5d ago
- Releases
- 134
- Known CVEs
- 0
- First release
- Feb 01, 2025
- License
- Elastic-2.0
Repository
Source
- Stars
- 8.5k
- Forks
- 497
- Open issues
- 8
- Language
- Rust
- text-extraction
- document-intelligence
- metadata-extraction
- pdf-extraction
- pdfium
- python
- rag
- table-extraction
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
-
kreuzberg
-
kreuzberg-cli
-
kreuzberg-paddle-ocr
-
kreuzberg-tesseract
-
github.com/kreuzberg-dev/kreuzberg
-
github.com/kreuzberg-dev/kreuzberg/packages/go/v4
-
kreuzberg
-
@kreuzberg/node-darwin-arm64
-
@kreuzberg/node-darwin-x64
-
@kreuzberg/node-linux-arm64-gnu
-
@kreuzberg/node-linux-arm64-musl
-
@kreuzberg/node-linux-x64-gnu
Insights
Activity
- Total releases
- 134
- Last 12 months
- 110
- Cadence
- ~daily
- Dependencies
- 3
Releases per month
last 12 monthsRelease mix
- major 3
- minor 38
- patch 91
- pre 1
134
releases
Dependencies
Depends on
4.9.9Used by
17-
ai-prishtina-agentic-rag
-
code-knowledge-graph-tool
-
crab-scholar
-
docrunr
-
iflow-mcp_yuanjua-chiken
-
kreuzberg-txtai
-
kreuzberg-haystack
-
kreuzberg
-
kreuzberg-crewai
-
kreuzberg-surrealdb
1–10 of 17
Releases
| Version | Released | |
|---|---|---|
4.9.9
patch
| ||
5.0.0rc3
pre
| ||
4.9.7
patch
| ||
4.9.5
patch
| ||
4.9.4
patch
| ||
4.9.3
patch
| ||
4.9.2
patch
| ||
4.9.1
minor
| ||
4.8.6
patch
| ||
4.8.5
patch
| ||
4.8.2
patch
| ||
4.8.0
minor
| ||
4.7.4
patch
| ||
4.7.3
patch
| ||
4.7.2
patch
| ||
4.7.1
patch
| ||
4.7.0
minor
| ||
4.6.3
patch
| ||
4.6.2
patch
| ||
4.6.1
patch
|
1–20 of 134