Added Multi-Host TPU tutorial #9507

vfdev-5 · 2025-07-24T09:39:32Z

This is a draft of multi-host tutorial, based on this gist: https://gist.github.com/vfdev-5/70f695e462443685a0922e79ce0ee899 and Chris Jones' mnist_xla.py code.

cc @melissawm

melissawm

Thank you @vfdev-5 ! A few very straightforward comments and one question (should we use TensorBoard or XProf for profiling?0

docs/source/learn/xla-multi-host-tpu.md

melissawm · 2025-07-24T22:43:56Z

docs/source/learn/xla-multi-host-tpu.md

+0 Training finished!
+```
+
+#### Profiler logs in TensorBoard


I believe we want to use XProf instead of TensorBoard, but we should confirm.

docs/source/learn/xla-multi-host-tpu.md

melissawm · 2025-07-29T17:07:33Z

Hello @pgmoka @bhavya01 - would you mind taking a look for correctness and scope of this tutorial? If you are happy with the general idea, we can remove this from draft and address any other feedback. Thank you!

melissawm · 2025-08-11T16:56:34Z

Hi folks - gentle ping. If you have any feedback, we're happy to address. Thanks!

melissawm · 2025-12-01T17:14:10Z

Hi all - is this something you are still interested in? I'm happy to help bring it over the finish line if so. Thanks!

zhanyong-wan · 2025-12-02T15:59:14Z

@bhavya01 , could you take a look and recommend whether we should proceed with this? Thanks!

vfdev-5 force-pushed the docs-learn-multi-host-tpu branch from 4527a9f to 8c0d217 Compare July 24, 2025 09:54

melissawm reviewed Jul 24, 2025

View reviewed changes

vfdev-5 force-pushed the docs-learn-multi-host-tpu branch from 8c0d217 to aac84ec Compare August 28, 2025 08:18

Added Multi-Host TPU tutorial

9f8c996

vfdev-5 force-pushed the docs-learn-multi-host-tpu branch from aac84ec to 9f8c996 Compare August 28, 2025 08:21

vfdev-5 marked this pull request as ready for review August 28, 2025 08:21

vfdev-5 requested review from mikegre-google, qihqi and zhanyong-wan as code owners August 28, 2025 08:21

zhanyong-wan requested review from bhavya01 and pgmoka August 28, 2025 15:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added Multi-Host TPU tutorial #9507

Added Multi-Host TPU tutorial #9507

Uh oh!

vfdev-5 commented Jul 24, 2025 •

edited

Loading

Uh oh!

melissawm left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

melissawm Jul 24, 2025

Uh oh!

Uh oh!

melissawm commented Jul 29, 2025

Uh oh!

melissawm commented Aug 11, 2025

Uh oh!

melissawm commented Dec 1, 2025

Uh oh!

zhanyong-wan commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added Multi-Host TPU tutorial #9507

Are you sure you want to change the base?

Added Multi-Host TPU tutorial #9507

Uh oh!

Conversation

vfdev-5 commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

melissawm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

melissawm Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

melissawm commented Jul 29, 2025

Uh oh!

melissawm commented Aug 11, 2025

Uh oh!

melissawm commented Dec 1, 2025

Uh oh!

zhanyong-wan commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vfdev-5 commented Jul 24, 2025 •

edited

Loading