[Frontend] Add quantisation pass #651

stomfaig · 2025-12-24T16:35:00Z

This PR is implementing a new pass for quantising graphs. We repeatedly evaluate each node, check if we can decide if it can be quantised, try to quantise it, and repeat the process. For more details see the code, it is extensively documented.

Note: the code also contains the code from #641, so that it is runnable locally. This poc only supports simple models like below, and is intended as a demonstration of the approach. The changes strictly associated with this pr are only in frontend/Python/graph/transform/quantise.py

class SimpleNet(nn.Module):
    def __init__(self):
      super().__init__()
      self.linear = torch.nn.Linear(28, 28)

    def forward(self, x):
        return self.linear(x)

Test driver to see the changes:

def main():

    model = SimpleNet()

    dynamo_compiler = DynamoCompiler(
        primary_registry=tosa.ops_registry,
        aot_autograd_decomposition=inductor_decomp,
        func_name="simple",
    )

    graphs = dynamo_compiler.importer(
        model,
        x=torch.rand((1, 1, 28, 28)),
    )

    assert len(graphs) == 1

    for graph in graphs:
        graph.init_op_group()

    graphs[0].perform(
        [quantise_graph]
    )

    g_body = graphs[0]._body
    for op in g_body:
        print("\n---")
        print(f"{op._name} : {type(op)}")
        print(f"children: {op._children}")
        print(f"parants: {op._parents}")
        print(f"tensor_meta: {op._tensor_meta}")

Depends on #641, #644

stomfaig · 2025-12-24T16:36:36Z

cc: @R-Tars

zhanghb97 · 2026-01-05T12:15:05Z

Looks like progress is going well. Just a reminder: we upgraded to LLVM 22. Don't forget to rebase to avoid potential compatibility issues.

gstomfai and others added 8 commits December 13, 2025 16:45

feat: draft of revised node reference system

4e0a1ee

chore: remove leftover comments

ff13b87

rename shape getters and add fixmes

dd974dc

rename GraphImporter fields to _shapes for more clarity

ee2d1d1

adding displace chain method to Graph

0a7309e

Merge branch 'issue_639' into issue_605

cdb5ce3

add sketch for new pass

a7c637e

poc draft of quantization pass

1b258ee

stomfaig marked this pull request as draft December 24, 2025 16:35

stomfaig changed the title ~~Add quantisation pass~~ [Frontend] Add quantisation pass Dec 26, 2025

stomfaig added 19 commits December 26, 2025 12:54

adding quantization modes and more correct logic

55aea27

introduce classes for quantization types

8921aba

add quantization axis support logic to Quantized

6768ada

add constructor with callback

ea7fdf8

add proper implementation for permute op

d2e481e

add propert implementation for addmm and mm

14c926a

add output rewrite to addmm

e5a13d8

add parent and child rewriting helpers

ce6d9e2

Merge branch 'issue_642' into issue_605

8ce5afb

modify replace_as functions

65a865c

add type annotation to params

b58c9bd

use newly exposed interfaces in eliminate_weight_transpose

5e6de64

adjust node indices when deleting nodes

36bd186

remove deprecated fn

c1b525f

add support for i8 datatype in compilation

1478343

small fixes in quantization methods

92eb2ac

refactor using passes and separating analysis and rewrite passses

14098a6

misc fixes for new layout

dab8686

add proper sizing for placeholder quantization consts

8cdbd77

add initial quantizations for other matmul type nodes

8ea7d49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Frontend] Add quantisation pass #651

[Frontend] Add quantisation pass #651

Uh oh!

stomfaig commented Dec 24, 2025 •

edited

Loading

Uh oh!

stomfaig commented Dec 24, 2025

Uh oh!

zhanghb97 commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Frontend] Add quantisation pass #651

Are you sure you want to change the base?

[Frontend] Add quantisation pass #651

Uh oh!

Conversation

stomfaig commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stomfaig commented Dec 24, 2025

Uh oh!

zhanghb97 commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stomfaig commented Dec 24, 2025 •

edited

Loading