Add Grouped GEMM for Mixed Dtype #457

muhammad-tanvir-1211 · 2025-07-04T15:59:31Z

This PR adds Grouped GEMM support for mixed precision GEMM.

t4c1 · 2025-07-07T07:45:46Z

examples/sycl/10_bmg_grouped_gemm_mixed_dtype/10_bmg_grouped_gemm_mixed_dtype.cpp

Can you rename this example to list the types instead of "mixed_dtype"?

t4c1 · 2025-07-07T07:48:37Z

examples/sycl/10_bmg_grouped_gemm_mixed_dtype/bmg_grouped_gemm_mixed_dtype_runner.hpp

+          using ret_type = cute::conditional_t<sizeof_bits_v<ElementZero> >= 8, ElementZero, int8_t>;
+          ret_type a = [&]() {
+            if constexpr (sizeof_bits_v<QuantizedElement> >= 8) {
+              return  (ret_type)(src_tensor(n, k, l));


Suggested change

return (ret_type)(src_tensor(n, k, l));

return static_cast<ret_type>(src_tensor(n, k, l));

And a few more examples of this below.

t4c1 · 2025-07-07T08:56:49Z

include/cutlass/gemm/collective/xe_array_mma_mixed_input.hpp

+    Tensor<EngineScales, LayoutScales>& tCrS_input,
+    Tensor<EngineZeros, LayoutZeros> tCrZ_input
+  ) {
+    // TODO: add assert here because such cases not support for int4 now


Suggested change

// TODO: add assert here because such cases not support for int4 now

// TODO (Codeplay): add assert here because int4 is not currently supported

…tlass-fork into mixed_group_gemm

Add Grouped GEMM for mixed type

260b3e3

muhammad-tanvir-1211 requested a review from a team July 4, 2025 15:59

t4c1 reviewed Jul 7, 2025

View reviewed changes

muhammad-tanvir-1211 added 6 commits July 9, 2025 15:06

Address feedback

eff23c8

Merge branch 'sycl-develop' of https://github.com/codeplaysoftware/cu…

923eea0

…tlass-fork into mixed_group_gemm

Added tests

a5b4b11

Fixed u4 example build

1541f98

Merge branch 'sycl-develop' of https://github.com/codeplaysoftware/cu…

a028a73

…tlass-fork into mixed_group_gemm

Merge branch 'sycl-develop' into mixed_group_gemm

2f21cfc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Grouped GEMM for Mixed Dtype #457

Add Grouped GEMM for Mixed Dtype #457

Uh oh!

muhammad-tanvir-1211 commented Jul 4, 2025

Uh oh!

t4c1 Jul 7, 2025

Uh oh!

t4c1 Jul 7, 2025

Uh oh!

t4c1 Jul 7, 2025

Uh oh!

Uh oh!

	return (ret_type)(src_tensor(n, k, l));
	return static_cast<ret_type>(src_tensor(n, k, l));

	// TODO: add assert here because such cases not support for int4 now
	// TODO (Codeplay): add assert here because int4 is not currently supported

Add Grouped GEMM for Mixed Dtype #457

Are you sure you want to change the base?

Add Grouped GEMM for Mixed Dtype #457

Uh oh!

Conversation

muhammad-tanvir-1211 commented Jul 4, 2025

Uh oh!

t4c1 Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

t4c1 Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

t4c1 Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!