Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Changes to support latent MoEs
#2296 opened Nov 19, 2025 by deepakn94 Loading… Core 0.16
Bugfix for Mamba with Chunked-Prefill
#2293 opened Nov 18, 2025 by sidsingh-nvidia Loading…
6 tasks
Core 0.16
add FIM dataset support Run tests
#2291 opened Nov 18, 2025 by dimapihtar Loading…
6 tasks
Core 0.16
build: Upgrade deps Expert Review Apply this label to indicate that your PR is ready for expert review. Run tests
#2289 opened Nov 18, 2025 by ko3n1g Loading…
6 tasks
Core 0.16
[HOT FIX] Fix bug of hybrid-ep backend in flex-dispatcher bug Something isn't working Final Review Apply this label to indicate that your PR is ready for final review.
#2286 opened Nov 18, 2025 by Autumn1998 Loading…
6 tasks
Core 0.16
[Draft][Dev] fix(moe): minor refactor for fine-grained activation offloading dev branch Dev branch related issues and development
#2285 opened Nov 18, 2025 by lhb8125 Draft
6 tasks
Hybrid Context Parallel Feature Expert Review Apply this label to indicate that your PR is ready for expert review.
#2282 opened Nov 17, 2025 by parthmannan Loading…
6 tasks
Core 0.16
Add support for fake distributed process groups. Expert Review Apply this label to indicate that your PR is ready for expert review.
#2280 opened Nov 17, 2025 by Victarry Loading…
9 of 15 tasks
Core 0.16
Automatically choose available ports in ZMQ
#2278 opened Nov 17, 2025 by tdene Draft
6 tasks
Clean up DP coord code & unit test Expert Review Apply this label to indicate that your PR is ready for expert review. Run tests
#2277 opened Nov 17, 2025 by tdene Loading…
6 tasks
Core 0.16
Change ZMQ communication to use async ZMQ
#2276 opened Nov 17, 2025 by tdene Draft
6 tasks
Remove dependency on megatron.training within megatron.core Expert Review Apply this label to indicate that your PR is ready for expert review. Run tests
#2274 opened Nov 17, 2025 by ananthsub Loading…
1 of 6 tasks
Core 0.16
Add assertion for mxfp8 params without dp overlap Expert Review Apply this label to indicate that your PR is ready for expert review.
#2271 opened Nov 17, 2025 by kunlunl Loading…
6 tasks
Core 0.16
fix: Pass the timeout argument for the EP group bug Something isn't working Expert Review Apply this label to indicate that your PR is ready for expert review.
#2268 opened Nov 17, 2025 by yanring Loading…
6 tasks
Core 0.16
Add MambaInferenceStateConfig dataclass
#2265 opened Nov 15, 2025 by santhnm2 Loading…
6 tasks
Core 0.16
Added top n log probs
#2262 opened Nov 15, 2025 by shanmugamr1992 Loading…
6 tasks
Top n dynamic
#2260 opened Nov 14, 2025 by shanmugamr1992 Loading…
6 tasks
Revert active-buffer-size-gb arg name. Expert Review Apply this label to indicate that your PR is ready for expert review.
#2257 opened Nov 14, 2025 by lmcafee-nvidia Loading…
6 tasks
Core 0.15
[Dev] Feature: linear cross entropy fusion dev branch Dev branch related issues and development Expert Review Apply this label to indicate that your PR is ready for expert review.
#2256 opened Nov 14, 2025 by Jianbing-D Loading…
4 of 6 tasks
[DEV] fix layerwise torch_dist checkpointing fails due to empty rank dev branch Dev branch related issues and development Expert Review Apply this label to indicate that your PR is ready for expert review.
#2255 opened Nov 14, 2025 by FDecaYed Loading…
6 tasks
Core 0.16
[Dev] fix(megatron-fsdp): Resolve hang caused by non-deterministic reduce-scatter dev branch Dev branch related issues and development
#2252 opened Nov 14, 2025 by xuwchen Loading…
6 tasks
Core 0.16
feat: check: api backwards compatibility
#2251 opened Nov 14, 2025 by pablo-garay Loading…
6 tasks
Core 0.16
ProTip! What’s not been updated in a month: updated:<2025-10-18.