auralith-data-pipeline
Production-grade data collection and processing pipeline for training LLMs and multimodal AI
- Latest release
- Mar 16, 2026
- Releases
- 8
- Known CVEs
- 0
- First release
- Mar 02, 2026
- License
- Apache-2.0
Repository
Source
- Stars
- —
- Forks
- —
- Open issues
- —
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
No other tracked packages from this repository.
Insights
Activity
- Total releases
- 8
- Last 12 months
- 8
- Cadence
- ~daily
- Dependencies
- 51
Releases per month
last 12 monthsRelease mix
- patch 7
8
releases
Dependencies
Depends on
0.1.11-
astropy <7.0,>=5.3.0
-
azure-storage-blob <13.0,>=12.19.0
-
black >=23.7.0
-
boto3 <2.0,>=1.28.0
-
click <9.0,>=8.1.0
-
datasets <4.0,>=2.14.0
-
datasketch <2.0,>=1.6.0
-
decord <1.0,>=0.6.0
-
extract-msg <1.0,>=0.48.0
-
faiss-cpu <2.0,>=1.7.4
1–10 of 51
Used by
Nothing tracked depends on this yet.
Releases
| Version | Released | |
|---|---|---|
0.1.11
patch
| ||
0.1.10
patch
| ||
0.1.9
patch
| ||
0.1.8
patch
| ||
0.1.7
patch
| ||
0.1.6
patch
| ||
0.1.5
patch
| ||
0.1.4
initial
|