[mlir-tensorrt] Raise activations from their elementwise representation to tensorrt.activation Op #679

matthewfl · 2025-07-25T19:36:47Z

This PR adds new raising from tensorrt.elemetnwise Ops to tensorrt.activation for the GELU Tanh (when created by torch-mlir) and GELU Erf. This also includes raising from min(a, max(x, y)) to the CLIP activation (which is used by clip and ReLU6 by torch-mlir). Merging these elementwise ops in to the tensorrt.activation type enables for TensorRT to fuse the activation into a proceeding linear layer's kernel (matrix multiply + elementwise sum).

mlir-tensorrt/tensorrt/lib/TensorRT/Transforms/RaiseActivations.pdll

…on to tensorrt.activation Op adds new raising from `tensorrt.elemetnwise` Ops to `tensorrt.activation` for the GELU Tanh (when created by torch-mlir) and GELU Erf. This also includes raising from `min(a, max(x, y))` to the CLIP activation (which is used by clip and ReLU6 by torch-mlir). Merging these elementwise ops in to the `tensorrt.activation` type enables for TensorRT to fuse the activation into a proceeding linear layer's kernel (matrix multiply + elementwise sum). Signed-off-by: Matthew Francis-Landau <[email protected]> make raise activation use same getSplatConstantElementAttribute utils function Signed-off-by: Matthew Francis-Landau <[email protected]>

matthewfl · 2025-08-08T14:32:16Z

@shelkesagar29 can you merge the PR. (I don't have a merge button that I can click)

matthewfl requested review from christopherbate and shelkesagar29 as code owners July 25, 2025 19:36

matthewfl force-pushed the mfl/raise-activations branch from 1d549c7 to 83014d4 Compare July 28, 2025 14:07

pranavm-nvidia reviewed Jul 28, 2025

View reviewed changes

mlir-tensorrt/tensorrt/lib/TensorRT/Transforms/RaiseActivations.pdll Show resolved Hide resolved

shelkesagar29 added the mlir-tensorrt Pull request for the mlir-tensorrt project label Jul 28, 2025

christopherbate reviewed Aug 1, 2025

View reviewed changes

mlir-tensorrt/tensorrt/lib/TensorRT/Transforms/RaiseActivations.pdll Outdated Show resolved Hide resolved

matthewfl force-pushed the mfl/raise-activations branch from 0685a68 to 221b51a Compare August 7, 2025 20:47

matthewfl force-pushed the mfl/raise-activations branch from 221b51a to ba10b6d Compare August 7, 2025 20:48

shelkesagar29 approved these changes Aug 7, 2025

View reviewed changes

shelkesagar29 merged commit d8f3603 into NVIDIA:main Aug 8, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir-tensorrt] Raise activations from their elementwise representation to tensorrt.activation Op #679

[mlir-tensorrt] Raise activations from their elementwise representation to tensorrt.activation Op #679

Uh oh!

matthewfl commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

matthewfl commented Aug 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[mlir-tensorrt] Raise activations from their elementwise representation to tensorrt.activation Op #679

[mlir-tensorrt] Raise activations from their elementwise representation to tensorrt.activation Op #679

Uh oh!

Conversation

matthewfl commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

matthewfl commented Aug 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants