-
Notifications
You must be signed in to change notification settings - Fork 878
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Describe the bug
进行模型格式转换时
CUDA_VISIBLE_DEVICES=0 swift export \
--model /root/.cache/models/Qwen3-30B-A3B-Instruct-2507/ \
--to_mcore true \
--torch_dtype bfloat16 \
--output_dir /root/.cache/models/Qwen3-30B-A3B-Instruct-2507-mcore \
--test_convert_precision true
出现以下错误
in _wrapper
[rank0]: return func(*args, **kwargs)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "/root/miniconda3/envs/mcore/lib/python3.11/site-packages/transformers/modeling_utils.py", line 5176
, in from_pretrained
[rank0]: ) = cls._load_pretrained_model(
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: TypeError: _new__load_pretrained_model() missing 1 required positional argument: 'pretrained_model_name_or_
path'
[rank0]:[W917 16:53:31.821412936 ProcessGroupNCCL.cpp:1496] Warning: WARNING: destroy_process_group() was not called
before program exit, which can leak resources. For more info, please see https://pytorch.org/docs/stable/distribute
d.html#shutdown (function operator())
Your hardware and system info
CUDA: 12.4
GPU: H20
Torch: 2.6.0
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working