vllm-acc
A high-throughput and memory-efficient inference and serving engine for LLMs
- Latest release
- May 24, 2024
- Releases
- 8
- Known CVEs
- 0
- First release
- May 03, 2024
- License
- Apache-2.0
Repository
Source
- Stars
- —
- Forks
- —
- Open issues
- —
Security score
No OpenSSF Scorecard available for this repository.
Packages from this repo
Insights
Activity
- Total releases
- 8
- Last 12 months
- 0
- Cadence
- ~daily
- Dependencies
- 28
Releases per month
last 12 monthsRelease mix
- patch 7
8
releases
Dependencies
Depends on
0.4.21716571491.2888474-
aiohttp
-
cmake >=3.21
-
fastapi
-
filelock >=3.10.4
-
lm-format-enforcer ==0.10.1
-
ninja
-
numpy
-
nvidia-ml-py
-
openai
-
outlines ==0.0.34
1–10 of 28
Used by
Nothing tracked depends on this yet.
Releases
| Version | Released | |
|---|---|---|
0.4.21716571491.2888474
patch
| ||
0.4.11715037925.7634745
patch
| ||
0.4.11715032367.5221682
patch
| ||
0.4.11715028389.1049566
patch
| ||
0.4.11715025949.4067512
patch
| ||
0.4.11714783937.8746862
patch
| ||
0.4.11714782966.854302
patch
| ||
0.4.1
initial
|