inference-proxy
Inference Proxy is an OpenAI-compatible http proxy server for inferencing various LLMs capable of working with Google, Anthropic, OpenAI APIs, local PyTorch inference, etc.
- Latest release
- Apr 02, 2026
- Releases
- 21
- Known CVEs
- 0
- First release
- May 24, 2025
- License
- custom
Insights
Activity
- Total releases
- 21
- Last 12 months
- 18
- Cadence
- ~5 days
- Dependencies
- 12
Releases per month
last 12 monthsRelease mix
- major 3
- minor 7
- patch 9
- pre 1
21
releases
Dependencies
Depends on
3.2.2-
ai-microcore <7,>=5.1.2
-
anthropic <1,>=0.77
-
fastapi <1,>=0.121.3
-
google-genai <2,>=1.62.0
-
pydantic <2.13.0,>=2.12.5
-
pytest <8.5.0,>=8.4.2
-
pytest-asyncio <1.3.0,>=1.2.0
-
pytest-cov <7.1.0,>=7.0.0
-
requests <3,>=2.32.5
-
typer <1,>=0.24.0
1–10 of 12
Used by
Nothing tracked depends on this yet.
Releases
| Version | Released | |
|---|---|---|
3.2.2
patch
|
3.2.2
patch
Dependencies (12)
+ 4 more |
|
3.2.1
patch
|
3.2.1
patch
|
|
3.2.0
minor
|
3.2.0
minor
|
|
3.1.0
minor
|
3.1.0
minor
|
|
3.0.2
patch
|
3.0.2
patch
|
|
3.0.1
patch
|
3.0.1
patch
|
|
3.0.0
major
|
3.0.0
major
|
|
3.0.0.dev1
pre
|
3.0.0.dev1
pre
|
|
2.1.1
patch
|
2.1.1
patch
|
|
2.1.0
minor
|
2.1.0
minor
|
|
2.0.0
major
|
2.0.0
major
|
|
1.1.0
minor
|
1.1.0
minor
|
|
1.0.0
major
|
1.0.0
major
|
|
0.4.0
minor
|
0.4.0
minor
|
|
0.3.0
minor
|
0.3.0
minor
|
|
0.2.2
patch
|
0.2.2
patch
|
|
0.2.1
patch
|
0.2.1
patch
|
|
0.2.0
minor
|
0.2.0
minor
|
|
0.0.3
patch
|
0.0.3
patch
|
|
0.0.2
patch
|
0.0.2
patch
|
1–20 of 21