Sign in Sign up
cargo

shimmy

⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.

Latest release
4h ago
Releases
21
Known CVEs
0
First release
Sep 04, 2025
License
MIT
Downloads
12.2k
View on Cargo
Repository

Source

michael-a-kuykendall/shimmy
Stars
5.4k
Forks
510
Open issues
11
Language
Rust
  • llama
  • llamacpp
  • llm-inference
  • ollama-api
  • command-line-tool
  • gguf
  • inference-server
  • local-ai

Security score

No OpenSSF Scorecard available for this repository.

Packages from this repo

No other tracked packages from this repository.

Insights

Activity

Total releases
21
Last 12 months
21
Cadence
~2 days
Dependencies
36

Releases per month

last 12 months

Release mix

  • major 2
  • minor 9
  • patch 9
21 releases
Dependencies

Depends on

2.2.0
1–10 of 36

Used by

Nothing tracked depends on this yet.

Releases
Version Released
2.2.0 minor
2.0.1 patch
2.0.0 major
1.9.0 minor yanked
1.8.2 patch yanked
1.8.1 minor
1.7.4 patch
1.7.3 patch
1.7.0 minor
1.6.0 minor
1.4.2 patch
1.5.1 minor
1.4.1 patch
1.4.0 minor
1.3.5 patch
1.3.4 patch
1.3.3 minor
1.2.0 minor
1.1.0 major
0.1.1 patch
1–20 of 21