Replies: 4 comments
-
|
Take refactoring gpt-oss replacement as an example a4a4118 |
Beta Was this translation helpful? Give feedback.
-
|
Need is somehow general, especially for MoE related models. let's include more people in review @wenhuach21, @n1ck-guo, @WeiweiZhang1, @mengniwang95, @xinhe3. Pls give your feedback so that we can move to next step, like raising PR |
Beta Was this translation helpful? Give feedback.
-
|
Consider adding a function to verify the model's configuration or related information. Relying solely on the module name may not provide sufficient robustness. |
Beta Was this translation helpful? Give feedback.
-
|
We also need to consider how to apply this in AutoRound inference on the Transformers backend, where the replacement must occur before loading the weights. But we could assume that the wights is aligned to the replaced module |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Design for #899
Usage
@wenhuach21 @n1ck-guo Please help review that design, thx.
cc @thuang6
Beta Was this translation helpful? Give feedback.
All reactions