Skip to content

Pull requests: argonne-lcf/Megatron-DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update training.py
#98 opened Jan 30, 2026 by mngom2 Loading…
Lb optimizers MuP
#97 opened Jan 27, 2026 by AGupta41 Loading…
fix: Dont explicitly source AWS plugin on Polaris
#95 opened Oct 20, 2025 by saforem2 Loading…
Pull new upstream changes into microsoft-main-fpdt
#91 opened Jul 16, 2025 by saforem2 Loading…
Pull upstream into saforem2/fix-formatting
#90 opened Jul 16, 2025 by saforem2 Loading…
Word Embedding init std adjustment
#85 opened May 6, 2025 by hatanp Loading…
Adding the new feature of FPDT (#441)
#70 opened Dec 6, 2024 by saforem2 Loading…
adding my changes to main repo
#60 opened Oct 2, 2024 by chian Loading…
add agpt inference scripts
#54 opened Sep 3, 2024 by vksastry Loading…
Pull in DPO loss
#48 opened Jul 15, 2024 by saforem2 Loading…
[WIP] Async checkpointing support
#12 opened May 10, 2024 by zhenghh04 Draft
ProTip! Updated in the last three days: updated:>2026-02-25.