Commit ee941ac
authored
[https://nvbugs/5456493][feat] add fp8 dense for sm120 (#9174)
Signed-off-by: CarstyYou <[email protected]>1 parent a79c0df commit ee941ac
File tree
6 files changed
+969
-8
lines changed- cpp/tensorrt_llm
- kernels/cutlass_kernels/fp8_blockscale_gemm
- 6kd_blockwise_gemm
- thop
- tensorrt_llm/_torch/modules
- tests/unittest/_torch/thop/parallel
6 files changed
+969
-8
lines changed
0 commit comments