[Frontend] Refactor how frontend handles node datatypes #641

stomfaig · 2025-12-13T16:48:36Z

This PR implements a more direct reference system in Graph to different node types (e.g. Input, Param etc.).

Some notes about the current version of the draft.

GraphImporter still also only receives input and params in TensorMeta (see ambiguity of this below). I've left it this way, because in GraphImporter we do not expect to change this again, so there is no reason to introduce more complication, though the notation will be improved to reflect this.
One point of friction here is that thorughout the frontend there seems to be two notions of tensor_meta, one being the type TensorMeta, the other being either a dict[str, Any], the other being dict[str, list[Any]]. For now I left some convenience script in for the sake of this draft, but let's discuss what the general direction with this should be.

Closes #639

stomfaig · 2025-12-13T16:48:58Z

cc: @zhanghb97 @R-Tars

R-Tars · 2025-12-18T11:06:13Z

Thanks for pointing this out — I agree it is an important issue.

For now, I don’t think we should rush to unify tensor_meta, as doing so would likely require changes across a large amount of existing operator and frontend code. Given the scope of this PR, deferring that work seems reasonable.

In the longer term, I do think converging on TensorMeta as the unified representation would be preferable, but this can be revisited once the new reference system stabilizes.

R-Tars · 2025-12-23T06:30:58Z

I ran into a runtime issue when testing this PR locally with Torch 2.8. The build fails while generating the DeepSeek-R1 example, and the traceback points to eliminate_weight_transpose.py in eliminate_transpose, with the following error:

AttributeError: 'int' object has no attribute 'shape'

In this case, tensor_meta seems to be resolved as an int instead of the expected TensorMeta (or at least an object with a shape attribute), so this part likely needs to be fixed or unified in this PR.

stomfaig · 2025-12-23T09:34:33Z

Thanks for pointing that out, I have been struggling to validate whether the change will cause any breakages.

I'll try to figure out something and maybe add frontend tests.

R-Tars · 2025-12-23T09:44:34Z

Thanks for looking into this. Just to add some context: a recent commit has caused most models to fail at runtime, so the frontend is currently in a rather unstable state. We are currently trying to address this issue. As a result, even if this specific issue is fixed, models may still not run correctly at the moment.

R-Tars · 2026-01-07T10:43:45Z

It seems that this PR has not been updated for some time. I pulled the latest changes today and tested them locally against the current main branch. At the moment, the DeepSeek-R1 example still cannot run successfully, so the runtime issue does not appear to be fully resolved yet.

For this PR to be ready for merging, I think a minimum requirement is that the DeepSeek-R1 example can run correctly from start to finish. Currently, this is still not the case.

If you encounter any difficulties fixing the issue, please feel free to let me know — I am happy to help with debugging.

feat: draft of revised node reference system

4e0a1ee

stomfaig changed the title ~~feat: draft of revised node reference system~~ [Frontend] Refactor how frontend handles node datatypes Dec 13, 2025

stomfaig marked this pull request as draft December 13, 2025 16:53

chore: remove leftover comments

ff13b87

stomfaig added 2 commits December 18, 2025 23:10

rename shape getters and add fixmes

dd974dc

rename GraphImporter fields to _shapes for more clarity

ee2d1d1

stomfaig marked this pull request as ready for review December 18, 2025 23:21

stomfaig added 2 commits December 23, 2025 22:28

add type annotation to params

4b02a49

use newly exposed interfaces in eliminate_weight_transpose

041c57c

stomfaig mentioned this pull request Dec 24, 2025

[Frontend] Add quantisation pass #651

Draft

stomfaig added 2 commits December 28, 2025 20:49

adjust node indices when deleting nodes

790987e

remove deprecated fn

92ef092

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Frontend] Refactor how frontend handles node datatypes #641

[Frontend] Refactor how frontend handles node datatypes #641

Uh oh!

stomfaig commented Dec 13, 2025

Uh oh!

stomfaig commented Dec 13, 2025

Uh oh!

R-Tars commented Dec 18, 2025

Uh oh!

R-Tars commented Dec 23, 2025

Uh oh!

stomfaig commented Dec 23, 2025

Uh oh!

R-Tars commented Dec 23, 2025 •

edited

Loading

Uh oh!

R-Tars commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Frontend] Refactor how frontend handles node datatypes #641

Are you sure you want to change the base?

[Frontend] Refactor how frontend handles node datatypes #641

Uh oh!

Conversation

stomfaig commented Dec 13, 2025

Uh oh!

stomfaig commented Dec 13, 2025

Uh oh!

R-Tars commented Dec 18, 2025

Uh oh!

R-Tars commented Dec 23, 2025

Uh oh!

stomfaig commented Dec 23, 2025

Uh oh!

R-Tars commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

R-Tars commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

R-Tars commented Dec 23, 2025 •

edited

Loading