-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel #2932
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
|
@BenjaminBossan PR is now synced to Optimum/Transformer pending Prs. Ready for final review for this portion. All relevant tests passing paired with the tranformer companion pr with pending gpt-qmodel 5.4.4 release (later today). |
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
|
Thanks for all the work @ZX-ModelCloud and @Qubitium. Let's wait for the transformers PR to be merged and then do the final testing on PEFT. There is a small merge conflict now in the Dockerfile. It's just because we use |
Thank you for your reply. The Dockerfile conflict has been resolved. |
Remove autogptq clutter and autogptq related configs that are not worth adding backward compat.
See
huggingface/transformers#41567
huggingface/optimum#2385