[bugfix] saving model without model.name_or_path #2061

HDCharles · 2025-11-20T20:08:20Z

Summary:

the way models were saved assumed that model.name_or_path was populated but this is not always the case. I think that if you generate a model and then immediately quantize it, you'll run into this problem.

This PR relaxes this assumption so that the model can be saved.

TEST PLAN:
ran https://gist.github.com/HDCharles/78c55da388076aa046ee893edfbd9df5

previously you'd get:

huggingface_hub.errors.HFValidationError: Repo id must use alphanumeric chars, '-', '_' or '.'. The name cannot start or end with '-' or '.' and the maximum length is 96: ''.

now:

no error

github-actions · 2025-11-20T20:08:29Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

gemini-code-assist · 2025-11-20T20:08:37Z

Summary of Changes

Hello @HDCharles, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses a critical bug that prevented models from being saved correctly when their configuration's _name_or_path attribute was present but empty. The changes ensure that the model saving process is more robust and reliable. Additionally, the pull request improves the stability of recipe inference by implementing error handling for Hugging Face Hub validation issues, contributing to a more resilient overall system for model and recipe management.

Highlights

Model Saving Logic Refinement: The pull request modifies the model saving mechanism to ensure that the _name_or_path attribute of a model's configuration is not only present but also contains a non-empty string before being utilized. This resolves a bug where models could not be saved if _name_or_path was an empty string, relaxing a previous assumption.
Robust Recipe Inference: The infer_recipe_from_model_path function has been enhanced to gracefully handle HFValidationError exceptions that may occur when attempting to load recipes from the Hugging Face Hub cache. This prevents potential crashes and improves the resilience of the recipe inference process.
Debugging Additions: Several print statements have been introduced in key functions like copy_python_files_from_model_cache and infer_recipe_from_model_path to provide better visibility into configuration values and model path resolution during execution, aiding in future debugging efforts.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request addresses a bug where model saving would fail if model.name_or_path was not populated. The changes correctly relax this assumption by adding checks for an empty path and handling potential errors when resolving recipes from what might be an invalid Hugging Face Hub path. My review focuses on cleaning up some leftover debugging code and improving logging consistency. I've identified a couple of print statements that should be removed and suggested changing a log level from info to debug for an expected error case to avoid unnecessary noise in the logs. The core logic of the fix is sound.

src/llmcompressor/pytorch/model_load/helpers.py

src/llmcompressor/transformers/utils/helpers.py

Summary: the way models were saved assumed that model.name_or_path was populated but this is not always the case. This PR relaxes this assumption so that the model can be saved. Signed-off-by: HDCharles <[email protected]>

Summary Signed-off-by: HDCharles <[email protected]>

gemini-code-assist bot reviewed Nov 20, 2025

View reviewed changes

src/llmcompressor/pytorch/model_load/helpers.py Outdated Show resolved Hide resolved

src/llmcompressor/transformers/utils/helpers.py Outdated Show resolved Hide resolved

src/llmcompressor/transformers/utils/helpers.py Outdated Show resolved Hide resolved

HDCharles force-pushed the 97_fix_saving branch from 13909d4 to 189c48e Compare November 20, 2025 20:09

HDCharles requested review from dsikka and kylesayrs November 20, 2025 20:12

HDCharles added bug Something isn't working ready When a PR is ready for review labels Nov 20, 2025

HDCharles requested review from fynnsu, rahul-tuli and shanjiaz November 20, 2025 20:29

HDCharles mentioned this pull request Nov 20, 2025

[Bug]: Llama-4-Maverick-17B-128E-Instruct quantization skips all MoE experts → missing expert weights → vLLM load failure #2060

Open

HDCharles force-pushed the 97_fix_saving branch from a454f0f to 0e676e2 Compare November 20, 2025 21:37

HDCharles added 5 commits November 20, 2025 21:38

[bugfix] saving model without model.name_or_path

e7608e2

Summary: the way models were saved assumed that model.name_or_path was populated but this is not always the case. This PR relaxes this assumption so that the model can be saved. Signed-off-by: HDCharles <[email protected]>

formatting

a460f02

Summary Signed-off-by: HDCharles <[email protected]>

logger debug

91ee4ba

Summary Signed-off-by: HDCharles <[email protected]>

change to exit early

7904137

Summary Signed-off-by: HDCharles <[email protected]>

remove old soln

2e07360

Summary Signed-off-by: HDCharles <[email protected]>

HDCharles force-pushed the 97_fix_saving branch from 2aa6c59 to 2e07360 Compare November 20, 2025 21:39

HDCharles and others added 2 commits November 20, 2025 21:39

remove unneeded import

393fed0

Summary Signed-off-by: HDCharles <[email protected]>

Merge branch 'main' into 97_fix_saving

e4060f2

HDCharles enabled auto-merge (squash) November 20, 2025 22:01

fynnsu approved these changes Nov 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[bugfix] saving model without model.name_or_path #2061

[bugfix] saving model without model.name_or_path #2061

HDCharles commented Nov 20, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

gemini-code-assist bot commented Nov 20, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[bugfix] saving model without model.name_or_path #2061

Are you sure you want to change the base?

[bugfix] saving model without model.name_or_path #2061

Conversation

HDCharles commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

gemini-code-assist bot commented Nov 20, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HDCharles commented Nov 20, 2025 •

edited

Loading