remove scaled_mm fallback #1746

yuchengliu1 · 2025-06-16T07:40:00Z

scaled_mm has supported in pytorch pytorch/pytorch#140972
This fallback will cause duplicate registration.
remove this fallback after pytorch/pytorch#140972 merged

Copilot

Pull Request Overview

This PR removes the "_scaled_mm" fallback registration from the XPU backend, addressing duplicate registration issues as detailed in the PR description.

Remove the "_scaled_mm" fallback entry to prevent duplicate registrations
Align the XPU fallback registration with the updates in PyTorch core

guangyey · 2025-08-05T06:55:54Z

Hi, let's file a separate PR to a temp branch such as viable/strict branch, and we land that PR into that branch, then you could update the commit pin within your PR. When your stock PyTorch PR landed, let's merge this PR and update the commit pin again...

We do these things to avoid breaking CI both at PyTorch and torch-xpu-ops.

guangyey

LGTM.
Let's merge this PR after pytorch/pytorch#140972 landed in case block internal CI.

@yuchengliu1

This PR implements `scaled_mm` for XPU. It enables the following data types: 1. TensorWise Scaling: `fp8_e4m3` and `fp8_e5m2` 2. RowWise Scaling: `fp8_e4m3` and `fp8_e5m2` It leaves the BlockWise Scaling to next PR, so that it will have less reviewing efforts. This is the first PR that only adds `scaled_mm_xpu` but does not registered. We separate this out for less reviewing efforts. Secondly, there is a `scaled_mm_v2` API in #164141 . We will align with it once the v1 is cleaned up. **Co-author:** @yuchengliu1, @carsonwang ## PR stack: - -> #165978 : implementation of XPU scaled_mm and oneDNN kernel - #167518 : implementation of XPU scaled_mm_v2 - #166056 : Op registration ## Test Status: 1. Relies on the changes in intel/torch-xpu-ops#1746, Otherwise the op will fallback to CPU. 2. This PR does not include tests, the tests are enabled in #166056. ## Credit: This work is based on @yuchengliu1's work at #140972 . The purpose that we created a new PR is to align with the API / checks with CUDA, so there will be less porting efforts. ## FP8 Task tracker: We will track all the scaled_mm related tasks in: #167170 Pull Request resolved: #165978 Approved by: https://github.com/liangan1, https://github.com/EikanWang Co-authored-by: Eikan Wang <[email protected]>

carsonwang · 2025-11-18T02:09:07Z

Are we ready to merge this one as pytorch/pytorch#165978 was merged? @guangyey @Stonepia

Stonepia · 2025-11-18T02:54:50Z

Hi @carsonwang , I think this could be merged, but since the op is not registered pytorch/pytorch#166056 , so I would like to delete the torch-xpu-ops fallback after that PR is merged. ETA would be within this week or no later than next week

@yuchengliu1

…h#165978) This PR implements `scaled_mm` for XPU. It enables the following data types: 1. TensorWise Scaling: `fp8_e4m3` and `fp8_e5m2` 2. RowWise Scaling: `fp8_e4m3` and `fp8_e5m2` It leaves the BlockWise Scaling to next PR, so that it will have less reviewing efforts. This is the first PR that only adds `scaled_mm_xpu` but does not registered. We separate this out for less reviewing efforts. Secondly, there is a `scaled_mm_v2` API in pytorch#164141 . We will align with it once the v1 is cleaned up. **Co-author:** @yuchengliu1, @carsonwang ## PR stack: - -> pytorch#165978 : implementation of XPU scaled_mm and oneDNN kernel - pytorch#167518 : implementation of XPU scaled_mm_v2 - pytorch#166056 : Op registration ## Test Status: 1. Relies on the changes in intel/torch-xpu-ops#1746, Otherwise the op will fallback to CPU. 2. This PR does not include tests, the tests are enabled in pytorch#166056. ## Credit: This work is based on @yuchengliu1's work at pytorch#140972 . The purpose that we created a new PR is to align with the API / checks with CUDA, so there will be less porting efforts. ## FP8 Task tracker: We will track all the scaled_mm related tasks in: pytorch#167170 Pull Request resolved: pytorch#165978 Approved by: https://github.com/liangan1, https://github.com/EikanWang Co-authored-by: Eikan Wang <[email protected]>

remove scaled_mm fallback

dd8afc4

Copilot AI review requested due to automatic review settings June 16, 2025 07:40

Copilot AI reviewed Jun 16, 2025

View reviewed changes

yuchengliu1 mentioned this pull request Aug 5, 2025

add fp8 scaled_mm for XPU pytorch/pytorch#140972

Closed

Merge branch 'main' into remove_scaled_mm_scaled

b7703fe

guangyey approved these changes Aug 5, 2025

View reviewed changes

This was referenced Oct 21, 2025

[xpu][feature] [1/3] add fp8 scaled_mm implementation for XPU pytorch/pytorch#165978

Closed

[xpu][feature] [3/3] Register the scaled_mm and scaled_mm_v2 for xpu pytorch/pytorch#166056

Open

Stonepia added 2 commits November 18, 2025 14:16

Merge branch 'main' into remove_scaled_mm_scaled

057666e

Update XPUFallback.template

b2cbaaf

Merge branch 'main' into remove_scaled_mm_scaled

1bf5636

EikanWang approved these changes Nov 21, 2025

View reviewed changes

chuanqi129 merged commit 56851c9 into main Nov 21, 2025
31 of 32 checks passed

chuanqi129 deleted the remove_scaled_mm_scaled branch November 21, 2025 03:26

intel deleted a comment from github-actions bot Nov 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

remove scaled_mm fallback #1746

remove scaled_mm fallback #1746

Uh oh!

yuchengliu1 commented Jun 16, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

guangyey commented Aug 5, 2025 •

edited

Loading

Uh oh!

guangyey left a comment

Uh oh!

carsonwang commented Nov 18, 2025

Uh oh!

Stonepia commented Nov 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

remove scaled_mm fallback #1746

remove scaled_mm fallback #1746

Uh oh!

Conversation

yuchengliu1 commented Jun 16, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

guangyey commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

guangyey left a comment

Choose a reason for hiding this comment

Uh oh!

carsonwang commented Nov 18, 2025

Uh oh!

Stonepia commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

guangyey commented Aug 5, 2025 •

edited

Loading

Stonepia commented Nov 18, 2025 •

edited

Loading