Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel #2932

ZX-ModelCloud · 2025-12-01T09:24:13Z

Remove autogptq clutter and autogptq related configs that are not worth adding backward compat.
See
huggingface/transformers#41567
huggingface/optimum#2385

Signed-off-by: ZX-ModelCloud <[email protected]>

Qubitium · 2025-12-02T09:10:01Z

@BenjaminBossan PR is now synced to Optimum/Transformer pending Prs. Ready for final review for this portion. All relevant tests passing paired with the tranformer companion pr with pending gpt-qmodel 5.4.4 release (later today).

Signed-off-by: ZX-ModelCloud <[email protected]>

BenjaminBossan · 2025-12-04T13:05:17Z

Thanks for all the work @ZX-ModelCloud and @Qubitium. Let's wait for the transformers PR to be merged and then do the final testing on PEFT.

There is a small merge conflict now in the Dockerfile. It's just because we use conda run now, it should be easy to fix. Could you please take care?

ZX-ModelCloud · 2025-12-05T01:35:38Z

Thanks for all the work @ZX-ModelCloud and @Qubitium. Let's wait for the transformers PR to be merged and then do the final testing on PEFT.

There is a small merge conflict now in the Dockerfile. It's just because we use conda run now, it should be easy to fix. Could you please take care?

Thank you for your reply. The Dockerfile conflict has been resolved.

fix gptq test

69789af

Signed-off-by: ZX-ModelCloud <[email protected]>

ZX-ModelCloud mentioned this pull request Dec 1, 2025

Fully deprecate AutoGPTQ for GPT-QModel huggingface/optimum#2385

Open

3 tasks

ZX-ModelCloud added 2 commits December 1, 2025 09:49

remove auto_gptq

53a66fd

Signed-off-by: ZX-ModelCloud <[email protected]>

call hf_select_quant_linear_v2()

67449b7

Signed-off-by: ZX-ModelCloud <[email protected]>

ZX-ModelCloud changed the title ~~[WIP] Fully deprecate AutoGPTQ for GPT-QModel~~ [WIP] Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel Dec 1, 2025

ZX-ModelCloud added 2 commits December 2, 2025 03:04

remove auto_awq

7f94e7c

Signed-off-by: ZX-ModelCloud <[email protected]>

cleanup

099ac0d

Signed-off-by: ZX-ModelCloud <[email protected]>

ZX-ModelCloud marked this pull request as ready for review December 2, 2025 09:07

ZX-ModelCloud changed the title ~~[WIP] Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel~~ Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel Dec 2, 2025

ZX-ModelCloud added 2 commits December 2, 2025 10:03

format

c31fe66

Signed-off-by: ZX-ModelCloud <[email protected]>

fix PeftAwqGPUTests

8173e33

Signed-off-by: ZX-ModelCloud <[email protected]>

Merge branch 'main' into gptqmodel

d914291

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel #2932

Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel #2932

Uh oh!

ZX-ModelCloud commented Dec 1, 2025

Uh oh!

Qubitium commented Dec 2, 2025 •

edited

Loading

Uh oh!

BenjaminBossan commented Dec 4, 2025

Uh oh!

ZX-ModelCloud commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel #2932

Are you sure you want to change the base?

Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel #2932

Uh oh!

Conversation

ZX-ModelCloud commented Dec 1, 2025

Uh oh!

Qubitium commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenjaminBossan commented Dec 4, 2025

Uh oh!

ZX-ModelCloud commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Qubitium commented Dec 2, 2025 •

edited

Loading