Skip to content

QMoE CUDA EP — FP4/FP8/WFP4AFP8 Quantized Mixture-of-Experts + MoE GEMM Refactor#28467

Merged
tianleiwu merged 20 commits into
mainfrom
tlwu/20260511/qmoe_cuda
May 20, 2026
Merged

QMoE CUDA EP — FP4/FP8/WFP4AFP8 Quantized Mixture-of-Experts + MoE GEMM Refactor#28467
tianleiwu merged 20 commits into
mainfrom
tlwu/20260511/qmoe_cuda

Commits

Commits on May 12, 2026

Commits on May 13, 2026

Commits on May 14, 2026

Commits on May 17, 2026

Commits on May 19, 2026