AutoGPTQ - Browse /v0.7.1 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2024-03-01	1.0 kB	0
v0.7.1_ patch release source code.tar.gz	2024-03-01	7.5 MB	0
v0.7.1_ patch release source code.zip	2024-03-01	7.6 MB	1
Totals: 3 Items		15.0 MB	1

Support loading sharded quantized checkpoints

Sharded checkpoints can now be loaded in the from_quantized method.

Support loading sharded quantized checkpoints. by @LaaZa in https://github.com/AutoGPTQ/AutoGPTQ/pull/425

Gemma model can be quantized with AutoGPTQ.

Add back missing import by @fxmarty in https://github.com/AutoGPTQ/AutoGPTQ/pull/553
Fix bias materialization for Marlin by @fxmarty in https://github.com/AutoGPTQ/AutoGPTQ/pull/554
Fix shape check marlin by @fxmarty in https://github.com/AutoGPTQ/AutoGPTQ/pull/557
Explicitely check compute capability in marlin's QLinear by @fxmarty in https://github.com/AutoGPTQ/AutoGPTQ/pull/567
Compatibility with latest transformers by @fxmarty in https://github.com/AutoGPTQ/AutoGPTQ/pull/573

Full Changelog: https://github.com/AutoGPTQ/AutoGPTQ/compare/v0.7.0...v0.7.1

Source: README.md, updated 2024-03-01