datatrove
HuggingFace library to process and filter large amounts of webdata
- Latest release
- Mar 04, 2026
- Releases
- 10
- Known CVEs
- 0
- First release
- Dec 06, 2023
- License
- Apache-2.0
Repository
Source
- Stars
- —
- Forks
- —
- Open issues
- —
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
Insights
Activity
- Total releases
- 10
- Last 12 months
- 4
- Cadence
- ~3 months
- Dependencies
- 61
Releases per month
last 12 monthsRelease mix
- minor 8
- pre 1
10
releases
Dependencies
Depends on
0.9.0-
aiofiles
-
aiosqlite
-
bitsandbytes
-
botok
-
datasets >=3.1.0
-
datatrove
-
dill >=0.3.0
-
fasteners
-
fasttext-numpy2-wheel
-
faust-cchardet
1–10 of 61
Used by
3Releases
| Version | Released | |
|---|---|---|
0.9.0
minor
| ||
0.8.0
minor
| ||
0.7.0
minor
| ||
0.6.0
minor
| ||
0.5.0
minor
| ||
0.4.0
minor
| ||
0.3.0
minor
| ||
0.2.0
minor
| ||
0.0.1
initial
| ||
0.0.1.dev0
pre
|