-
Notifications
You must be signed in to change notification settings - Fork 181
Pull requests: ByteDance-Seed/VeOmni
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[parallel, ci] feat: support loading large tensor for fsdp2
ci
#688
opened Apr 23, 2026 by
JorgenWan
Collaborator
Loading…
[ci] refactor: consolidate workflows and reorganize tests
ci
#687
opened Apr 23, 2026 by
FoolPlayer
Collaborator
Loading…
5 tasks done
[parallel] fix: set gather_outputs no grad scale by default
fix
#681
opened Apr 21, 2026 by
JorgenWan
Collaborator
Loading…
[ci] fix: update npu ci workflow yml
ascend
everything about Ascend support
ci
fix
#657
opened Apr 15, 2026 by
FoolPlayer
Collaborator
Loading…
[parallel] fix: NPU Hang/Deadlock during DTensor parameter loading in FSDP2
ascend
everything about Ascend support
fix
#642
opened Apr 10, 2026 by
First-Frost-code
Loading…
[docker] feat: update to torch2.10 + cu130
docker
#629
opened Apr 2, 2026 by
FoolPlayer
Collaborator
Loading…
[model]feat: add NPU support for Qwen3.5
ascend
everything about Ascend support
#628
opened Apr 2, 2026 by
yanghw116
Loading…
[model]feat: Qwen3.5 is compatible with NPU
ascend
everything about Ascend support
#600
opened Mar 23, 2026 by
wang-hua-2019
Contributor
Loading…
[model] feat: [transformers-v5] Introduce new registration based kernel replacement.
#569
opened Mar 16, 2026 by
piyifan123
Collaborator
Loading…
[parallel] feat: Vision Data Parallel — O(1) communication alternative to patch-level SP
#505
opened Feb 24, 2026 by
aoshen524
Loading…
2 of 3 tasks
[models] chore: Change transformers v5 support for qwen3_moe to use HF v5 style expert weight layout and add a converter impl.
hf_v5
Related for transformers v5
misc
Every misc
#500
opened Feb 24, 2026 by
piyifan123
Collaborator
•
Draft
[task] feat: support sequence classification tasks
#470
opened Feb 11, 2026 by
yiwzhao
Collaborator
Loading…
[model] fix: Incorrect usage of the 'check_model_inputs' decorator
fix
#457
opened Feb 5, 2026 by
HSYZhang
Contributor
Loading…
6 tasks done
[misc] chore: add_copy_right
misc
Every misc
#438
opened Jan 30, 2026 by
FoolPlayer
Collaborator
Loading…
Draft [models] feat: Add a modeling patch gen sample for qwen3
#424
opened Jan 26, 2026 by
piyifan123
Collaborator
Loading…
6 tasks
[docs]Update ascend_quick_start doc
ascend
everything about Ascend support
#225
opened Nov 27, 2025 by
Alter-A1ways
Loading…
4 of 6 tasks
[optim, config] feat: add support for Muon optimizer via dion
doc
Improvements or additions to documentation
#216
opened Nov 25, 2025 by
clarkipeng
Loading…
[fix] [model] auto-patch all Attention layers to ensure cu_seq_lens stays on CPU for NPU fused-attention.
ascend
everything about Ascend support
#199
opened Nov 17, 2025 by
A1waysBeenHere
Contributor
Loading…
4 of 6 tasks
Add TensorBoard support for training metrics logging
#195
opened Nov 14, 2025 by
iqiancheng
Contributor
Loading…
train qwen3-vl-moe on ShareGPT4V-small with quick-start
#194
opened Nov 14, 2025 by
iqiancheng
Contributor
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.