Download Latest Version v0.7.1_ patch release source code.tar.gz (7.5 MB)
Email in envelope

Get an email when there's a new version of AutoGPTQ

Home / v0.7.1
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2024-03-01 1.0 kB
v0.7.1_ patch release source code.tar.gz 2024-03-01 7.5 MB
v0.7.1_ patch release source code.zip 2024-03-01 7.6 MB
Totals: 3 Items   15.0 MB 1

Support loading sharded quantized checkpoints

Sharded checkpoints can now be loaded in the from_quantized method.

Gemma GPTQ quantization

Gemma model can be quantized with AutoGPTQ.

Other changes and fixes

Full Changelog: https://github.com/AutoGPTQ/AutoGPTQ/compare/v0.7.0...v0.7.1

Source: README.md, updated 2024-03-01