Commit 13fbd43
authored
[TRTLLM-9370][feat] Integration of CuteDSL NVFP4 grouped GEMM (Part 2: SwiGLU Fusion and Finalize Fusion) (#9288)
Signed-off-by: Enwei Zhu <[email protected]>1 parent 9b2abb8 commit 13fbd43
File tree
16 files changed
+6229
-270
lines changed- cpp/tensorrt_llm
- kernels
- cuteDslKernels
- trtllmGenKernels/blockScaleMoe
- thop
- tensorrt_llm
- _torch
- custom_ops
- cute_dsl_kernels/blackwell
- modules/fused_moe
- tools/layer_wise_benchmarks
- tests
- integration/test_lists
- unittest/_torch
- modules
- thop/parallel
16 files changed
+6229
-270
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
142 | 142 | | |
143 | 143 | | |
144 | 144 | | |
145 | | - | |
| 145 | + | |
146 | 146 | | |
147 | 147 | | |
148 | 148 | | |
| |||
383 | 383 | | |
384 | 384 | | |
385 | 385 | | |
386 | | - | |
| 386 | + | |
387 | 387 | | |
388 | 388 | | |
389 | 389 | | |
| |||
Lines changed: 1 addition & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
647 | 647 | | |
648 | 648 | | |
649 | 649 | | |
650 | | - | |
651 | | - | |
| 650 | + | |
652 | 651 | | |
653 | 652 | | |
654 | 653 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
120 | 120 | | |
121 | 121 | | |
122 | 122 | | |
123 | | - | |
124 | | - | |
| 123 | + | |
| 124 | + | |
125 | 125 | | |
126 | 126 | | |
127 | 127 | | |
| |||
Large diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| 55 | + | |
55 | 56 | | |
56 | | - | |
| 57 | + | |
| 58 | + | |
57 | 59 | | |
58 | 60 | | |
59 | 61 | | |
| |||
88 | 90 | | |
89 | 91 | | |
90 | 92 | | |
91 | | - | |
| 93 | + | |
92 | 94 | | |
93 | 95 | | |
94 | 96 | | |
| |||
2052 | 2054 | | |
2053 | 2055 | | |
2054 | 2056 | | |
2055 | | - | |
2056 | | - | |
2057 | | - | |
2058 | 2057 | | |
2059 | 2058 | | |
2060 | 2059 | | |
| |||
2138 | 2137 | | |
2139 | 2138 | | |
2140 | 2139 | | |
2141 | | - | |
| 2140 | + | |
2142 | 2141 | | |
| 2142 | + | |
2143 | 2143 | | |
2144 | 2144 | | |
2145 | 2145 | | |
| |||
2198 | 2198 | | |
2199 | 2199 | | |
2200 | 2200 | | |
2201 | | - | |
| 2201 | + | |
2202 | 2202 | | |
2203 | 2203 | | |
2204 | 2204 | | |
2205 | 2205 | | |
2206 | 2206 | | |
2207 | | - | |
2208 | | - | |
2209 | | - | |
| 2207 | + | |
2210 | 2208 | | |
2211 | 2209 | | |
2212 | 2210 | | |
2213 | | - | |
| 2211 | + | |
2214 | 2212 | | |
2215 | 2213 | | |
2216 | 2214 | | |
2217 | 2215 | | |
2218 | | - | |
| 2216 | + | |
2219 | 2217 | | |
2220 | 2218 | | |
2221 | 2219 | | |
| |||
2238 | 2236 | | |
2239 | 2237 | | |
2240 | 2238 | | |
2241 | | - | |
| 2239 | + | |
2242 | 2240 | | |
2243 | 2241 | | |
2244 | 2242 | | |
| |||
2266 | 2264 | | |
2267 | 2265 | | |
2268 | 2266 | | |
2269 | | - | |
2270 | 2267 | | |
2271 | 2268 | | |
2272 | 2269 | | |
| |||
0 commit comments