Download Latest Version v0.7.1_ patch release source code.tar.gz (7.5 MB)
Email in envelope

Get an email when there's a new version of AutoGPTQ

Home / v0.5.0
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2023-11-02 4.3 kB
v0.5.0_ Exllama v2 GPTQ kernels, RoCm 5.6_5.7 support, many bugfixes source code.tar.gz 2023-11-02 7.4 MB
v0.5.0_ Exllama v2 GPTQ kernels, RoCm 5.6_5.7 support, many bugfixes source code.zip 2023-11-02 7.5 MB
Totals: 3 Items   14.9 MB 1

Exllama v2 GPTQ kernel support

The more performant GPTQ kernels from @turboderp's exllamav2 library are now available directly in AutoGPTQ, and are the default backend choice.

A comprehensive benchmark is available here.

CPU inference support

This is experimental.

Loading from safetensors is now the default

Falcon, Mistral support

Other changes and bugfixes

New Contributors

Full Changelog: https://github.com/PanQiWei/AutoGPTQ/compare/v0.4.2...v0.5.0

Source: README.md, updated 2023-11-02