Skip to content

megatron 格式转换,加载模型出错 #5856

@looput

Description

@looput

Describe the bug

进行模型格式转换时

CUDA_VISIBLE_DEVICES=0 swift export \
    --model /root/.cache/models/Qwen3-30B-A3B-Instruct-2507/ \
    --to_mcore true \
    --torch_dtype bfloat16 \
    --output_dir /root/.cache/models/Qwen3-30B-A3B-Instruct-2507-mcore \
    --test_convert_precision true

出现以下错误

 in _wrapper                                                                                                        
[rank0]:     return func(*args, **kwargs)                                                                           
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^                                                                           
[rank0]:   File "/root/miniconda3/envs/mcore/lib/python3.11/site-packages/transformers/modeling_utils.py", line 5176
, in from_pretrained                                                                                                
[rank0]:     ) = cls._load_pretrained_model(                                                                        
[rank0]:         ^^^^^^^^^^^^^^^^^^^^^^^^^^^                                                                        
[rank0]: TypeError: _new__load_pretrained_model() missing 1 required positional argument: 'pretrained_model_name_or_
path'                                                                                                               
[rank0]:[W917 16:53:31.821412936 ProcessGroupNCCL.cpp:1496] Warning: WARNING: destroy_process_group() was not called
 before program exit, which can leak resources. For more info, please see https://pytorch.org/docs/stable/distribute
d.html#shutdown (function operator())         

Your hardware and system info
CUDA: 12.4
GPU: H20
Torch: 2.6.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions