Sign in Sign up

pypi

GPTQModel

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Latest release: Jun 08, 2026
Releases: 55
Known CVEs: 0
First release: Aug 15, 2024
License: Apache-2.0

Repository

Source

modelcloud/gptqmodel

Stars: —
Forks: —
Open issues: —

Security score

No OpenSSF Scorecard available for this repository.

Packages from this repo

No other tracked packages from this repository.

Insights

Activity

Total releases: 55
Last 12 months: 20
Cadence: ~5 days
Dependencies: 45

Releases per month

last 12 months

Release mix

major 5
minor 19
patch 29
pre 1

55 releases

Dependencies

Depends on

7.1.0

accelerate >=1.13.0
bitblas ==0.1.0.post1
bitsandbytes >=0.49.3
datasets >=3.6.0
defuser >=0.0.21
device-smi >=0.5.5
dill >=0.3.8
evalution
fastapi
flashinfer-python >=0.3.1

1–10 of 45

Used by

Nothing tracked depends on this yet.

Releases

Version	Released
`7.1.0` minor	Jun 08, 2026
`7.0.0` major	Apr 28, 2026
`6.0.3` patch	Apr 03, 2026
`6.0.0` major	Apr 02, 2026
`5.8.0` minor	Mar 19, 2026
`5.7.0` minor	Feb 11, 2026
`5.6.12` patch	Dec 17, 2025
`5.6.10` patch	Dec 16, 2025
`5.6.8` patch	Dec 16, 2025
`5.6.6` patch	Dec 15, 2025
`5.6.2` patch	Dec 12, 2025
`5.6.0` minor	Dec 09, 2025
`5.4.2` patch	Nov 16, 2025
`5.4.0` minor	Nov 09, 2025
`5.2.0` minor	Nov 02, 2025
`5.0.0` major	Oct 24, 2025
`4.2.5` patch	Sep 16, 2025
`4.2.0` minor	Sep 12, 2025
`4.1.0` minor	Sep 08, 2025
`4.0.0` major	Aug 22, 2025

1–20 of 55

Release calendar

2026

S M T W T F S

Jan

Feb

Mar

Apr

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec