Skip to content

Pull requests: ByteDance-Seed/VeOmni

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[parallel, ci] feat: support loading large tensor for fsdp2 ci
#688 opened Apr 23, 2026 by JorgenWan Collaborator Loading…
[ci] refactor: consolidate workflows and reorganize tests ci
#687 opened Apr 23, 2026 by FoolPlayer Collaborator Loading…
5 tasks done
[parallel] fix: set gather_outputs no grad scale by default fix
#681 opened Apr 21, 2026 by JorgenWan Collaborator Loading…
[ci] chore: drop v4 tests for models already migrated to v5 ci misc Every misc
#676 opened Apr 18, 2026 by TimYangst Collaborator Draft
3 of 4 tasks
[model] feat: migrate seed_oss to transformers v5 hf_v5 Related for transformers v5
#662 opened Apr 15, 2026 by TimYangst Collaborator Draft
4 tasks done
[model, ci] feat: migrate deepseek_v3 to transformers v5 ci hf_v5 Related for transformers v5
#661 opened Apr 15, 2026 by TimYangst Collaborator Draft
4 tasks
[ci] fix: update npu ci workflow yml ascend everything about Ascend support ci fix
#657 opened Apr 15, 2026 by FoolPlayer Collaborator Loading…
[docker] feat: update to torch2.10 + cu130 docker
#629 opened Apr 2, 2026 by FoolPlayer Collaborator Loading…
[model]feat: add NPU support for Qwen3.5 ascend everything about Ascend support
#628 opened Apr 2, 2026 by yanghw116 Loading…
[model]feat: Qwen3.5 is compatible with NPU ascend everything about Ascend support
#600 opened Mar 23, 2026 by wang-hua-2019 Contributor Loading…
[data] feat: add MultiSourceDataset for weighted sampling
#522 opened Feb 28, 2026 by hjshi84 Collaborator Draft
6 tasks
[misc] fix: use dedicated Gloo process group for HF safetensor save to avoid NCCL timeouts ckpt Checkpoint related. fix misc Every misc
#492 opened Feb 18, 2026 by Ziyi-Wang Collaborator Loading…
[task] feat: support sequence classification tasks
#470 opened Feb 11, 2026 by yiwzhao Collaborator Loading…
[model] fix: Incorrect usage of the 'check_model_inputs' decorator fix
#457 opened Feb 5, 2026 by HSYZhang Contributor Loading…
6 tasks done
[misc] chore: add_copy_right misc Every misc
#438 opened Jan 30, 2026 by FoolPlayer Collaborator Loading…
Draft [models] feat: Add a modeling patch gen sample for qwen3
#424 opened Jan 26, 2026 by piyifan123 Collaborator Loading…
6 tasks
[docs]Update ascend_quick_start doc ascend everything about Ascend support
#225 opened Nov 27, 2025 by Alter-A1ways Loading…
4 of 6 tasks
[optim, config] feat: add support for Muon optimizer via dion doc Improvements or additions to documentation
#216 opened Nov 25, 2025 by clarkipeng Loading…
[fix] [model] auto-patch all Attention layers to ensure cu_seq_lens stays on CPU for NPU fused-attention. ascend everything about Ascend support
#199 opened Nov 17, 2025 by A1waysBeenHere Contributor Loading…
4 of 6 tasks
Add TensorBoard support for training metrics logging
#195 opened Nov 14, 2025 by iqiancheng Contributor Loading…
train qwen3-vl-moe on ShareGPT4V-small with quick-start
#194 opened Nov 14, 2025 by iqiancheng Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.