[dynamo] propagate tensor metadata on Tensor.setitem(tensor) #161036

xmfan · 2025-08-20T04:59:14Z

Stack from ghstack (oldest at bottom):

-> [dynamo] propagate tensor metadata on Tensor.__setitem__(tensor) #161036

Fixes silent incorrectness for autograd function tracing, where we rely on FakeTensor metadata (requires_grad) to determine whether to HOP or not:

pytorch/torch/_dynamo/variables/misc.py

Line 671 in 5ee464d

if requires_grad and torch.is_grad_enabled():

Stared at this with @anijain2305 yesterday, Tensor.__setitem__ can update tensor metadata, and we can just run the fake prop and extract the output metadata from the updated FakeTensor.

FIXES #160901

It should also be the root cause behind the issue in pytorch/torchtitan#1604 @bdhirsh @ruisizhang123

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames @Lucaskabela @mlazos

[ghstack-poisoned]

pytorch-bot · 2025-08-20T04:59:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161036

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 33ecc00 with merge base 95e456f ():

NEW FAILURE - The following job has failed:

inductor / linux-jammy-cpu-py3.9-gcc11-inductor / test (cpu_inductor_torchbench, 1, 2, linux.8xlarge.amx) (gh)
hf_Bart

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: b934c70 Pull Request resolved: #161036

[ghstack-poisoned]

ghstack-source-id: 90ad2bb Pull Request resolved: #161036

[ghstack-poisoned]

ghstack-source-id: 11f9e35 Pull Request resolved: #161036

torch/_dynamo/variables/builder.py

bdhirsh · 2025-08-20T15:09:56Z

test/dynamo/test_repros.py

+        torch.compile(fn, backend="eager")(x2, y2).sum().backward()
+        self.assertTrue(x2.requires_grad)
+
+        self.assertEqual(x1.grad, x2.grad)


I tried running your test locally without your changes, and it looks like this test passes without the changes in this PR - it turns out that x is a non-leaf tensor after the mutation (x.grad_fn is CopySlices), so x.gradisNone` in both examples.

You probably need to assert on self.assertEqual(y1.grad, y2.grad)? since the "wrong gradients" that we are computing today should propagate into y2.grad

Thanks for the catch, let me just add the original repro's permutations...

bdhirsh · 2025-08-20T15:12:05Z

torch/_dynamo/variables/tensor.py

+                target_cls, tx, example_value, infer_subclass_type(example_value)
+            )
+            for k, v in specialized_props.items():
+                setattr(self, k, v)


We might need to audit a few other places in dynamo too. For example, this will also change the requires_gradness of x:

x = torch.randn(4) y = torch.randn(4, requires_grad=True) # x.requires_grad is now True x.add_(y)

so any tensor data mutation ops will need the same treatment

Confirmed locally - i tried tweaking your input_fn to use add_ and hit the same correctness issue when asserting on y.grad:

def fn(x, y): x.add_(y) return MyFn.apply(x)

hmm does this PR also fix the problem for add_? it's not clear to me where (if not no worries, but can you file another issue?)

bdhirsh

Left a few comments - nice find!

[ghstack-poisoned]

ghstack-source-id: f488dfb Pull Request resolved: #161036

[ghstack-poisoned]

ghstack-source-id: caabc06 Pull Request resolved: #161036

xmfan · 2025-08-21T04:02:28Z

hf_Bart fails on base commit

xmfan · 2025-08-22T01:56:20Z

@pytorchbot merge -i

pytorchmergebot · 2025-08-22T01:58:20Z

Merge started

Your change will be merged while ignoring the following 1 checks: inductor / linux-jammy-cpu-py3.9-gcc11-inductor / test (cpu_inductor_torchbench, 1, 2, linux.8xlarge.amx)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

8919ed2

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Aug 20, 2025

[dynamo] propagate tensor metadata on Tensor.__setitem__(tensor)

747a01a

ghstack-source-id: b934c70 Pull Request resolved: #161036

pytorch-bot bot added ciflow/inductor module: dynamo labels Aug 20, 2025

Update

3404391

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Aug 20, 2025

[dynamo] propagate tensor metadata on Tensor.__setitem__(tensor)

9cc4205

ghstack-source-id: 90ad2bb Pull Request resolved: #161036

xmfan mentioned this pull request Aug 20, 2025

workarounds for all2all autograd issues that Ruisi ran into pytorch/torchtitan#1604

Open

Update

cfeb170

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Aug 20, 2025

[dynamo] propagate tensor metadata on Tensor.__setitem__(tensor)

f0d1feb

ghstack-source-id: 11f9e35 Pull Request resolved: #161036

xmfan marked this pull request as ready for review August 20, 2025 14:41

xmfan requested review from anijain2305 and bdhirsh August 20, 2025 14:44

xmfan added the release notes: dynamo label Aug 20, 2025

bdhirsh reviewed Aug 20, 2025

View reviewed changes

torch/_dynamo/variables/builder.py Show resolved Hide resolved

bdhirsh reviewed Aug 20, 2025

View reviewed changes

Update

6f60161

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Aug 20, 2025

[dynamo] propagate tensor metadata on Tensor.__setitem__(tensor)

5bb2fbf

ghstack-source-id: f488dfb Pull Request resolved: #161036

Update

33ecc00

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Aug 21, 2025

[dynamo] propagate tensor metadata on Tensor.__setitem__(tensor)

e383d9b

ghstack-source-id: caabc06 Pull Request resolved: #161036

xmfan requested a review from bdhirsh August 21, 2025 04:00

anijain2305 approved these changes Aug 22, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 22, 2025

pytorchmergebot added the merging label Aug 22, 2025

pytorchmergebot added the Merged label Aug 22, 2025

pytorchmergebot closed this in 8aad3a6 Aug 22, 2025

pytorchmergebot removed the merging label Aug 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[dynamo] propagate tensor metadata on Tensor.setitem(tensor) #161036

[dynamo] propagate tensor metadata on Tensor.setitem(tensor) #161036

Uh oh!

xmfan commented Aug 20, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

bdhirsh Aug 20, 2025

Uh oh!

xmfan Aug 20, 2025

Uh oh!

bdhirsh Aug 20, 2025

Uh oh!

bdhirsh Aug 20, 2025

Uh oh!

bdhirsh Aug 22, 2025

Uh oh!

bdhirsh left a comment

Uh oh!

xmfan commented Aug 21, 2025

Uh oh!

xmfan commented Aug 22, 2025

Uh oh!

pytorchmergebot commented Aug 22, 2025

Uh oh!

Uh oh!

[dynamo] propagate tensor metadata on Tensor.__setitem__(tensor) #161036

[dynamo] propagate tensor metadata on Tensor.__setitem__(tensor) #161036

Uh oh!

Conversation

xmfan commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161036

❌ 1 New Failure

Uh oh!

Uh oh!

bdhirsh Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

xmfan Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

bdhirsh Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

bdhirsh Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

bdhirsh Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

bdhirsh left a comment

Choose a reason for hiding this comment

Uh oh!

xmfan commented Aug 21, 2025

Uh oh!

xmfan commented Aug 22, 2025

Uh oh!

pytorchmergebot commented Aug 22, 2025

Merge started

Uh oh!

Uh oh!

[dynamo] propagate tensor metadata on Tensor.setitem(tensor) #161036

[dynamo] propagate tensor metadata on Tensor.setitem(tensor) #161036

xmfan commented Aug 20, 2025 •

edited

Loading

pytorch-bot bot commented Aug 20, 2025 •

edited

Loading