-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Community][Main] feat(moe): Adding context parallel support to eager attention implementation
#2295
opened Nov 19, 2025 by
yuzhongw-nvidia
•
Draft
6 tasks
build: Upgrade deps
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Run tests
Fix TemporalAsyncCaller pin_memory lifetime in async checkpointing
#2288
opened Nov 18, 2025 by
lvdunlin
Loading…
6 tasks
[HOT FIX] Fix bug of hybrid-ep backend in flex-dispatcher
bug
Something isn't working
Final Review
Apply this label to indicate that your PR is ready for final review.
[Draft][Dev] fix(moe): minor refactor for fine-grained activation offloading
dev branch
Dev branch related issues and development
Hybrid Context Parallel Feature
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Add support for fake distributed process groups.
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Clean up DP coord code & unit test
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Run tests
Remove dependency on Apply this label to indicate that your PR is ready for expert review.
Run tests
megatron.training within megatron.core
Expert Review
Add assertion for mxfp8 params without dp overlap
Expert Review
Apply this label to indicate that your PR is ready for expert review.
fix: Pass the timeout argument for the EP group
bug
Something isn't working
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Revert active-buffer-size-gb arg name.
Expert Review
Apply this label to indicate that your PR is ready for expert review.
[Dev] Feature: linear cross entropy fusion
dev branch
Dev branch related issues and development
Expert Review
Apply this label to indicate that your PR is ready for expert review.
#2256
opened Nov 14, 2025 by
Jianbing-D
Loading…
4 of 6 tasks
[DEV] fix layerwise torch_dist checkpointing fails due to empty rank
dev branch
Dev branch related issues and development
Expert Review
Apply this label to indicate that your PR is ready for expert review.
[Dev] fix(megatron-fsdp): Resolve hang caused by non-deterministic reduce-scatter
dev branch
Dev branch related issues and development
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-18.