[XLA:GPU][oneAPI] Add support for oneCCL bazel build by nhatleSummer22 · Pull Request #42595 · openxla/xla

nhatleSummer22 · 2026-05-14T05:45:15Z

📝 Summary of Changes
This PR enables building oneCCL from source using XLA bazel build system. oneCCL enables optimized communication pattern on Intel's GPUs.
🎯 Justification
This PR is first step to support scale-up functionality on Intel's GPUs. Subsequent PRs will add full support for scale-up.

🚀 Kind of Contribution
✨ New Feature

steeve · 2026-05-14T07:57:25Z

Hi, thank you for this PR !

I was trying to use it earlier but ran into problem due to oneCCL "one process per GPU" model (according to Codex), the collectives would fail to initialize.

Apparently this is only possible with oneAPI 2026.0. Do you have more info?

Also, we are maintaining a fork which now ran llama on a b70 at https://github.com/zml/xla/commits/zml/oneapi2/

We found 2 problems:

implementing sycl blas gemm
fix event recording which was broken

Thank you!

dimitar-asenov · 2026-05-14T16:09:48Z

@MichaelHudgins Can we simply add a directory in third_party or does this need a bit more work on our side?

bhavani-subramanian · 2026-05-19T01:10:29Z

@steeve Thanks for the note. Just a heads-up that I have opened a PR to fix the issue in event recording: #42806

MichaelHudgins · 2026-06-02T14:11:09Z

@MichaelHudgins Can we simply add a directory in third_party or does this need a bit more work on our side?

Apologies, i missed this one. In general we can, let me message you internally with more specifics.

Imported from GitHub PR #42595 📝 Summary of Changes This PR enables building oneCCL from source using XLA bazel build system. [oneCCL](https://github.com/uxlfoundation/oneCCL) enables optimized communication pattern on Intel's GPUs. 🎯 Justification This PR is first step to support scale-up functionality on Intel's GPUs. Subsequent PRs will add full support for scale-up. 🚀 Kind of Contribution ✨ New Feature Copybara import of the project: -- 8e59923 by Nhat Le <nhat.le@intel.com>: Add support for oneCCL bazel build -- 6762807 by Nhat Le <nhat.le@intel.com>: Add dependency to trigger oneCCL build -- 644072a by Nhat Le <nhat.le@intel.com>: Fix EOF -- 42901eb by Nhat Le <nhat.le@intel.com>: Make all the necessary headers visible to dependents -- f4c481a by nhatle <nhat.le@intel.com>: Fix for file not found errors in XLA Linux X86 GPU ONEAPI CI Merging this change closes #42595 FUTURE_COPYBARA_INTEGRATE_REVIEW=#42595 from Intel-tensorflow:nhatle/xla_oneccl_bazel_build f4c481a PiperOrigin-RevId: 927245566

Imported from GitHub PR #42595 📝 Summary of Changes This PR enables building oneCCL from source using XLA bazel build system. [oneCCL](https://github.com/uxlfoundation/oneCCL) enables optimized communication pattern on Intel's GPUs. 🎯 Justification This PR is first step to support scale-up functionality on Intel's GPUs. Subsequent PRs will add full support for scale-up. 🚀 Kind of Contribution ✨ New Feature Copybara import of the project: -- 8e59923 by Nhat Le <nhat.le@intel.com>: Add support for oneCCL bazel build -- 6762807 by Nhat Le <nhat.le@intel.com>: Add dependency to trigger oneCCL build -- 644072a by Nhat Le <nhat.le@intel.com>: Fix EOF -- 42901eb by Nhat Le <nhat.le@intel.com>: Make all the necessary headers visible to dependents -- f4c481a by nhatle <nhat.le@intel.com>: Fix for file not found errors in XLA Linux X86 GPU ONEAPI CI Merging this change closes #42595 FUTURE_COPYBARA_INTEGRATE_REVIEW=#42595 from Intel-tensorflow:nhatle/xla_oneccl_bazel_build f4c481a PiperOrigin-RevId: 929784497

Imported from GitHub PR openxla/xla#42595 📝 Summary of Changes This PR enables building oneCCL from source using XLA bazel build system. [oneCCL](https://github.com/uxlfoundation/oneCCL) enables optimized communication pattern on Intel's GPUs. 🎯 Justification This PR is first step to support scale-up functionality on Intel's GPUs. Subsequent PRs will add full support for scale-up. 🚀 Kind of Contribution ✨ New Feature Copybara import of the project: -- 8e599233097ee4da17aea66b2ddf844ee160c471 by Nhat Le <nhat.le@intel.com>: Add support for oneCCL bazel build -- 6762807a4d7b0b6392fdfa493cdb3270ea50cefa by Nhat Le <nhat.le@intel.com>: Add dependency to trigger oneCCL build -- 644072af588c031f96342cf15cf6e65967225660 by Nhat Le <nhat.le@intel.com>: Fix EOF -- 42901eb57429c14c416636f7b5a68ce87f9c895b by Nhat Le <nhat.le@intel.com>: Make all the necessary headers visible to dependents -- f4c481a006fbd38f8eca26e04d0f75b108bb8618 by nhatle <nhat.le@intel.com>: Fix for file not found errors in XLA Linux X86 GPU ONEAPI CI Merging this change closes #42595 Reverts changelist 929420822 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#42595 from Intel-tensorflow:nhatle/xla_oneccl_bazel_build f4c481a006fbd38f8eca26e04d0f75b108bb8618 PiperOrigin-RevId: 929784497

Imported from GitHub PR #42595 📝 Summary of Changes This PR enables building oneCCL from source using XLA bazel build system. [oneCCL](https://github.com/uxlfoundation/oneCCL) enables optimized communication pattern on Intel's GPUs. 🎯 Justification This PR is first step to support scale-up functionality on Intel's GPUs. Subsequent PRs will add full support for scale-up. 🚀 Kind of Contribution ✨ New Feature Copybara import of the project: -- 8e59923 by Nhat Le <nhat.le@intel.com>: Add support for oneCCL bazel build -- 6762807 by Nhat Le <nhat.le@intel.com>: Add dependency to trigger oneCCL build -- 644072a by Nhat Le <nhat.le@intel.com>: Fix EOF -- 42901eb by Nhat Le <nhat.le@intel.com>: Make all the necessary headers visible to dependents -- f4c481a by nhatle <nhat.le@intel.com>: Fix for file not found errors in XLA Linux X86 GPU ONEAPI CI Merging this change closes #42595 FUTURE_COPYBARA_INTEGRATE_REVIEW=#42595 from Intel-tensorflow:nhatle/xla_oneccl_bazel_build f4c481a PiperOrigin-RevId: 929784497

Imported from GitHub PR openxla/xla#42595 📝 Summary of Changes This PR enables building oneCCL from source using XLA bazel build system. [oneCCL](https://github.com/uxlfoundation/oneCCL) enables optimized communication pattern on Intel's GPUs. 🎯 Justification This PR is first step to support scale-up functionality on Intel's GPUs. Subsequent PRs will add full support for scale-up. 🚀 Kind of Contribution ✨ New Feature Copybara import of the project: -- 8e599233097ee4da17aea66b2ddf844ee160c471 by Nhat Le <nhat.le@intel.com>: Add support for oneCCL bazel build -- 6762807a4d7b0b6392fdfa493cdb3270ea50cefa by Nhat Le <nhat.le@intel.com>: Add dependency to trigger oneCCL build -- 644072af588c031f96342cf15cf6e65967225660 by Nhat Le <nhat.le@intel.com>: Fix EOF -- 42901eb57429c14c416636f7b5a68ce87f9c895b by Nhat Le <nhat.le@intel.com>: Make all the necessary headers visible to dependents -- f4c481a006fbd38f8eca26e04d0f75b108bb8618 by nhatle <nhat.le@intel.com>: Fix for file not found errors in XLA Linux X86 GPU ONEAPI CI Merging this change closes #42595 Reverts 028c67b FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#42595 from Intel-tensorflow:nhatle/xla_oneccl_bazel_build f4c481a006fbd38f8eca26e04d0f75b108bb8618 PiperOrigin-RevId: 929784497

dimitar-asenov · 2026-06-11T12:47:10Z

Hi @nhatleSummer22 . We are merging this without the change to oneccl_collectives.cc. Hope that's OK.

Reverts 028c67b FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#42595 from Intel-tensorflow:nhatle/xla_oneccl_bazel_build f4c481a006fbd38f8eca26e04d0f75b108bb8618 PiperOrigin-RevId: 930185853

To add support for other operations like all-gather/collective dots to the collective kernel thunk we should make it agnostic to custom kernels first. Reverts 028c67b FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#42595 from Intel-tensorflow:nhatle/xla_oneccl_bazel_build f4c481a006fbd38f8eca26e04d0f75b108bb8618 PiperOrigin-RevId: 930464574

nhatleSummer22 added 2 commits May 13, 2026 14:52

Add support for oneCCL bazel build

8e59923

Add dependency to trigger oneCCL build

6762807

dimitar-asenov requested a review from MichaelHudgins May 14, 2026 16:09

Fix EOF

644072a

dimitar-asenov requested a review from penpornk May 19, 2026 08:53

Make all the necessary headers visible to dependents

42901eb

dimitar-asenov approved these changes Jun 5, 2026

View reviewed changes

This was referenced Jun 5, 2026

PR #42595: [XLA:GPU][oneAPI] Add support for oneCCL bazel build #43806

Closed

PR #42595: [XLA:GPU][oneAPI] Add support for oneCCL bazel build tensorflow/tensorflow#120449

Draft

mraunak mentioned this pull request Jun 8, 2026

[XLA:GPU][oneAPI] Add oneAPI BUILD template and update redist versions for 2026.0 google-ml-infra/rules_ml_toolchain#271

Open

Fix for file not found errors in XLA Linux X86 GPU ONEAPI CI

f4c481a

nhatleSummer22 requested a review from dimitar-asenov June 9, 2026 00:31

neudinger added a commit to zml/xla that referenced this pull request Jun 9, 2026

Basicaly openxla#42595

6f3969c

dimitar-asenov approved these changes Jun 9, 2026

View reviewed changes

This was referenced Jun 10, 2026

PR #42595: [XLA:GPU][oneAPI] Add support for oneCCL bazel build #44033

Merged

PR #42595: [XLA:GPU][oneAPI] Add support for oneCCL bazel build tensorflow/tensorflow#120796

Merged

copybara-service Bot closed this in 7f98d00 Jun 11, 2026

copybara-service Bot mentioned this pull request Jun 11, 2026

Automated Code Change tensorflow/tensorflow#120930

Draft

copybara-service Bot mentioned this pull request Jun 11, 2026

Automated Code Change tensorflow/tensorflow#120923

Draft

This was referenced Jun 11, 2026

[XLA:GPU] Remove support for custom all reduce kernels tensorflow/tensorflow#120926

Draft

PR #42595: [XLA:GPU][oneAPI] Add support for oneCCL bazel build tensorflow/tensorflow#120932

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XLA:GPU][oneAPI] Add support for oneCCL bazel build#42595

[XLA:GPU][oneAPI] Add support for oneCCL bazel build#42595
nhatleSummer22 wants to merge 5 commits into
openxla:mainfrom
Intel-tensorflow:nhatle/xla_oneccl_bazel_build

nhatleSummer22 commented May 14, 2026 •

edited

Loading

Uh oh!

steeve commented May 14, 2026 •

edited

Loading

Uh oh!

dimitar-asenov commented May 14, 2026

Uh oh!

bhavani-subramanian commented May 19, 2026

Uh oh!

MichaelHudgins commented Jun 2, 2026

Uh oh!

dimitar-asenov commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

nhatleSummer22 commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

steeve commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dimitar-asenov commented May 14, 2026

Uh oh!

bhavani-subramanian commented May 19, 2026

Uh oh!

MichaelHudgins commented Jun 2, 2026

Uh oh!

dimitar-asenov commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

nhatleSummer22 commented May 14, 2026 •

edited

Loading

steeve commented May 14, 2026 •

edited

Loading