[wip] Distributed Scion/Muon #1630

rakkit · 2025-08-25T02:05:45Z

This is a distributed version of Scion or Modular Norm, muon is considered to be a variant of this by using explicit AdamW for LLM's embedding/output.

Works:

Embedding/head
FSDP/DP/TP/EP/CP/PP parameters
Bias (mainly for norm)
weight decay

Missing

Conv

Need some extra work to adjust the EP changes for EP-[shard(1)] and ETP?

At the moment, we need a long configuration to initialize it, will fix later by passing the list

CC @janEbert @ofivite

tianyu-l

Thanks for the PR on cutting-edge features!

I didn't read the papers so please forgive me if what I comment doesn't make sense.

I guess for "core" changes such as this one on optimizers, the recommended path is to first land in pytorch/pytorch, and then expose minimal interfaces to torchtitan. torchtitan shouldn't be a place to host core features.

cc @janeyx99 on interesting optimizer work

rakkit requested review from tianyu-l, fegin, wwwjn and wconstab as code owners August 25, 2025 02:05

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 25, 2025

rakkit force-pushed the dist-scion branch from a67f1c3 to eae4684 Compare August 25, 2025 02:08

rakkit mentioned this pull request Aug 25, 2025

[RFC] distributed scion/muon #1636

Open

init scion

40e9cbb

rakkit force-pushed the dist-scion branch from eae4684 to 40e9cbb Compare August 26, 2025 00:53

tianyu-l requested changes Aug 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[wip] Distributed Scion/Muon #1630

[wip] Distributed Scion/Muon #1630

Uh oh!

rakkit commented Aug 25, 2025 •

edited

Loading

Uh oh!

tianyu-l left a comment

Uh oh!

Uh oh!

[wip] Distributed Scion/Muon #1630

Are you sure you want to change the base?

[wip] Distributed Scion/Muon #1630

Uh oh!

Conversation

rakkit commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rakkit commented Aug 25, 2025 •

edited

Loading