thecrawler
Web scraper, PDF/DOCX parser, LLM-ready markdown. Extract structured data from any URL.
- Latest release
- Apr 16, 2026
- Releases
- 2
- Known CVEs
- 0
- First release
- Apr 16, 2026
- License
- unknown
Repository
Source
- Stars
- —
- Forks
- —
- Open issues
- —
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
No other tracked packages from this repository.
Insights
Activity
- Total releases
- 2
- Last 12 months
- 2
- Cadence
- ~daily
- Dependencies
- 10
Releases per month
last 12 monthsRelease mix
- patch 1
2
releases
Dependencies
Depends on
0.1.1-
cheerio ^1.0.0
-
commander ^12.0.0
-
crawlee ^3.9.0
-
google-search-results-nodejs ^2.1.0
-
mammoth ^1.8.0
-
@modelcontextprotocol/sdk ^1.29.0
-
pdf-parse ^1.1.4
-
playwright *
-
turndown ^7.2.0
-
turndown-plugin-gfm ^1.0.2
Used by
Nothing tracked depends on this yet.
Releases