[llama4] add apply_compile for moe, where fullgraph=False for moe layers #1519

danielvegamyhre · 2025-08-02T00:02:12Z

We should add an apply_compile function for llama4 that uses fullgraph=False for MoE layers and fullgraph=True for dense layers.

I keep manually applying this hack during development to test compile composability, but IMO we should have this merged and update to use fullgraph=True everywhere once that is supported.

cc @xmfan @tianyu-l any thoughts?

tianyu-l

fyi @xmfan had #1365

we should work together to figure out what to do

danielvegamyhre · 2025-08-04T21:44:47Z

@xmfan any thoughts on when #1365 will be landable? if you plan to land it soon we can close this. this is a temporary solution to avoid having to add this change to every feature branch in torchtitan i make while doing MoE related work across torchao and torchtitan

xmfan · 2025-08-04T21:55:08Z

@tianyu-l i think fullgraph=False is better than compile just not working on main, so good with landing this

tianyu-l

should do this to dsv3 too

@xmfan

…ers (pytorch#1519) We should add an `apply_compile` function for llama4 that uses fullgraph=False for MoE layers and fullgraph=True for dense layers. I keep manually applying this hack during development to test compile composability, but IMO we should have this merged and update to use fullgraph=True everywhere once that is supported. cc @xmfan @tianyu-l any thoughts?

@xmfan

…ers (pytorch#1519) We should add an `apply_compile` function for llama4 that uses fullgraph=False for MoE layers and fullgraph=True for dense layers. I keep manually applying this hack during development to test compile composability, but IMO we should have this merged and update to use fullgraph=True everywhere once that is supported. cc @xmfan @tianyu-l any thoughts?

@xmfan

…ers (pytorch#1519) We should add an `apply_compile` function for llama4 that uses fullgraph=False for MoE layers and fullgraph=True for dense layers. I keep manually applying this hack during development to test compile composability, but IMO we should have this merged and update to use fullgraph=True everywhere once that is supported. cc @xmfan @tianyu-l any thoughts?

add apply_compile for moe, where fullgraph=False for moe layers

86efe09

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 2, 2025

tianyu-l reviewed Aug 2, 2025

View reviewed changes

xmfan approved these changes Aug 4, 2025

View reviewed changes

tianyu-l approved these changes Aug 4, 2025

View reviewed changes

danielvegamyhre merged commit 2844029 into pytorch:main Aug 4, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[llama4] add apply_compile for moe, where fullgraph=False for moe layers #1519

[llama4] add apply_compile for moe, where fullgraph=False for moe layers #1519

Uh oh!

danielvegamyhre commented Aug 2, 2025

Uh oh!

tianyu-l left a comment

Uh oh!

danielvegamyhre commented Aug 4, 2025

Uh oh!

xmfan commented Aug 4, 2025

Uh oh!

tianyu-l left a comment

Uh oh!

Uh oh!

Uh oh!

[llama4] add apply_compile for moe, where fullgraph=False for moe layers #1519

[llama4] add apply_compile for moe, where fullgraph=False for moe layers #1519

Uh oh!

Conversation

danielvegamyhre commented Aug 2, 2025

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

danielvegamyhre commented Aug 4, 2025

Uh oh!

xmfan commented Aug 4, 2025

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!