vllm-tpu
A high-throughput and memory-efficient inference and serving engine for LLMs
- Latest release
- Jun 05, 2026
- Releases
- 22
- Known CVEs
- 0
- First release
- May 27, 2025
- License
- Apache-2.0
Repository
Source
- Stars
- —
- Forks
- —
- Open issues
- —
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
Insights
Activity
- Total releases
- 22
- Last 12 months
- 20
- Cadence
- ~5 days
- Dependencies
- 79
Releases per month
last 12 monthsRelease mix
- minor 8
- pre 13
22
releases
Dependencies
Depends on
0.21.0-
aiohttp >=3.13.3
-
anthropic >=0.71.0
-
av
-
blake3
-
cachetools
-
cbor2
-
cloudpickle
-
cmake >=3.26.1
-
compressed-tensors ==0.15.0.1
-
datasets
1–10 of 79
Used by
Nothing tracked depends on this yet.
Releases
| Version | Released | |
|---|---|---|
0.21.0
minor
| ||
0.20.0
minor
| ||
0.19.0
minor
| ||
0.18.0
minor
| ||
0.18.0rc1
pre
| ||
0.13.3
minor
| ||
0.13.2.post6
pre
| ||
0.13.2rc4
pre
| ||
0.13.2rc4.post6
pre
| ||
0.13.2rc3
pre
| ||
0.13.2rc1
pre
| ||
0.12.0
minor
| ||
0.12.0rc2
pre
| ||
0.12.0rc1
pre
| ||
0.11.2rc1
pre
| ||
0.11.1
minor
| ||
0.11.1rc3
pre
| ||
0.11.1rc2
pre
| ||
0.11.1rc1
pre
| ||
0.10.1.1
minor
|
1–20 of 22