JAX inference offloading bridge #5029
Triggered via pull request
November 12, 2025 05:52
Status
Failure
Total duration
3h 38m 57s
Artifacts
55
ci.yaml
on: pull_request
metadata
4s
Matrix: amd64 / test-distribution
Matrix: arm64 / test-distribution
amd64
/
...
/
build-mpi-operator-compatible-base
2m 11s
arm64
/
...
/
build-mpi-operator-compatible-base
Matrix: amd64 / test-jax-cutlass-h100 / jax-cutlass-test-h100
Matrix: amd64 / test-jax / run-unit-test
Matrix: amd64 / test-te-a100 / run-unit-test
Matrix: amd64 / test-te-h100 / te-test-h100
amd64
/
build-torchax
8m 41s
amd64
/
...
/
launch-slurm-runner
2h 40m
amd64
/
test-nsys-jax-eks
29m 57s
amd64
/
...
/
launch-slurm-runner
2h 0m
Matrix: amd64 / test-nsys-jax / run-unit-test
Matrix: amd64 / test-nccl / nccl-test
Matrix: amd64 / test-nccl / nccl-test-gke / nccl-gke
Matrix: arm64 / test-jax-cutlass-h100 / jax-cutlass-test-h100
Waiting for pending jobs
Matrix: arm64 / test-jax / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-te-a100 / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-te-h100 / te-test-h100
Waiting for pending jobs
arm64
/
build-torchax
8m 7s
arm64
/
test-nsys-jax-eks
arm64
/
...
/
launch-slurm-runner
arm64
/
...
/
launch-slurm-runner
Matrix: arm64 / test-nsys-jax / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-nccl / nccl-test
Waiting for pending jobs
Matrix: arm64 / test-nccl / nccl-test-gke / nccl-gke
Waiting for pending jobs
Matrix: amd64 / test-maxtext / maxtext-multinode
Matrix: amd64 / test-maxtext / single-process-multi-device
amd64
/
test-axlearn-eks
17m 16s
amd64
/
test-axlearn-fuji-models-eks
5m 23s
Matrix: amd64 / test-nsys-jax-archive
Matrix: arm64 / test-maxtext / maxtext-multinode
Waiting for pending jobs
Matrix: arm64 / test-maxtext / single-process-multi-device
Waiting for pending jobs
arm64
/
test-axlearn-eks
arm64
/
test-axlearn-fuji-models-eks
Matrix: arm64 / test-nsys-jax-archive
Matrix: amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Matrix: arm64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
Matrix: publish-containers
finalize
/
publish-badge
5s
Annotations
6 errors
|
amd64 / test-te-h100 / te-test-h100 (unittest, 8)
Process completed with exit code 1.
|
|
amd64 / test-te-a100 / te-A100-unit-test
The self-hosted runner lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
|
amd64 / test-nsys-jax / nsys-jax-A100-unit-test
Process completed with exit code 1.
|
|
amd64 / test-maxtext / test-maxtext-outcome
Process completed with exit code 1.
|
|
amd64 / test-rosetta-t5x / test-t5x-rosetta-metrics
Process completed with exit code 1.
|
|
amd64 / test-rosetta-t5x / test-t5x-rosetta-outcome
Process completed with exit code 1.
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
artifact-axlearn-build-amd64
|
566 Bytes |
sha256:be473a88ae184056940b3fdbe05588693335f20ea8afef5df50f9b6a565033e3
|
|
|
artifact-axlearn-build-arm64
|
567 Bytes |
sha256:ab49c2f957445df9f62fbcb61ef8eb0ec08e20c5b2f3bca3794394f41d50c5a6
|
|
|
artifact-axlearn-test
|
182 KB |
sha256:d159bd033fbd9ad3da901358e07ce62e71fd34e1353c88824328cee77ac01b4a
|
|
|
artifact-base-build-amd64
|
566 Bytes |
sha256:8f37a8586dd43dfab733465874a72748d36c3c40f3a1b0e4e39ce2b5520b45fe
|
|
|
artifact-base-build-arm64
|
566 Bytes |
sha256:7b7a86556dbe525199830576007a140bd39d4090e6b43a96ee19aabb47a38834
|
|
|
artifact-equinox-build-amd64
|
567 Bytes |
sha256:b4f3845992e8b16e9877185b958af0fd2a4f0f6f8878127e128ccad8e290f943
|
|
|
artifact-equinox-build-arm64
|
569 Bytes |
sha256:7b8de5d0ae5a4acfadeebaebd3a435505d82e1b38bb93a83deab92109a0a101e
|
|
|
artifact-final-report
|
4.09 KB |
sha256:b2ec8b68d134b5c7ed7a831da1a82a428222f7766031334345626a3389ea1ba7
|
|
|
artifact-jax-build-amd64
|
553 Bytes |
sha256:93ecaeee62a71a0ace9dca6cd7ce5a1055a2431781eee9bdf2026e1701e54706
|
|
|
artifact-jax-build-arm64
|
554 Bytes |
sha256:98679e28c194bcbe096fa3ad00c88789505da0c2afae4ea2343036beba604c5f
|
|
|
artifact-maxtext-build-amd64
|
567 Bytes |
sha256:9d33027cfe0d9cb7f28129c80b1ca86a01b9b781df161108fe496776a8c9de89
|
|
|
artifact-maxtext-build-arm64
|
569 Bytes |
sha256:1e915be92ed5304dd5a9d0db94564a3b102c6ce6e831a55a83ba01d5df80be4f
|
|
|
artifact-maxtext-test
|
1.46 KB |
sha256:2cd2a2f5d848c67d006da66940d73c923720b359b038eb5b729b98ea8fff7430
|
|
|
artifact-mpi-operator-compatible-base-build-amd64
|
637 Bytes |
sha256:70ce50d22a545bcc788f515a9f3c5924b2510466a30e082825ae2a414907a20b
|
|
|
artifact-nccl-gke-build-amd64
|
571 Bytes |
sha256:fb3ebbefb2c600c01d853983b5c316d78551fb8f3282e69396f83a9a738ba5cc
|
|
|
artifact-rosetta-build-t5x-amd64
|
585 Bytes |
sha256:a5b2f75e90f9b4d62fe140980357f0d9a4b09fab90d3980e24709d6308eca84b
|
|
|
artifact-rosetta-build-t5x-arm64
|
584 Bytes |
sha256:29d8a1c7d30f2049e3651c729e569faf76cf01d6f8fc67b098d639eab8d68824
|
|
|
artifact-rosetta-t5x-mgmn-test
|
624 Bytes |
sha256:a11cbd5a94e2a2438ba61e493e83ff3c995311477646fc0942459e572e640e87
|
|
|
artifact-t5x-build-amd64
|
569 Bytes |
sha256:1a7291e0b469530a7d5ab4b8691260cce6b673fd6794f6f89f9eec39972b8713
|
|
|
artifact-t5x-build-arm64
|
567 Bytes |
sha256:8b3122d00eb72463381969f5ab33793a999f1e4c3b1343c3a521dd5b8953413f
|
|
|
artifact-torchax-build-amd64
|
568 Bytes |
sha256:b21f5780b054ecd7a1e03f1aca191f4fdda50332652bc96bd0c133d4df6723fb
|
|
|
artifact-torchax-build-arm64
|
568 Bytes |
sha256:a4524deb531e4d4946bb54a8936caf5311541fcf1f0c5eb7379dca7b763491d0
|
|
|
artifact-workflow-metadata
|
277 Bytes |
sha256:573e41acef400699323d00d5b7d268efa4da37529d6236a5f359b6ad33886b2c
|
|
|
bumped-manifest
|
51.6 KB |
sha256:481312e64c111110f54044590612fcf069e79d8f0a7e4b279f14747058930155
|
|
|
final-axlearn
|
263 Bytes |
sha256:7244ca0a4639b80032849e86b45a28ae2c926a85d0c57b8e82a33ac9ccbac3b0
|
|
|
final-base
|
254 Bytes |
sha256:b2c33962d03d5f898c688d9b7ffee1dbc726e0e7c34325bcd35f6f3af19551d4
|
|
|
final-equinox
|
263 Bytes |
sha256:a1234d10bab7f44ab2031dfdc338543f342dceac521dc1381755373c54e7d215
|
|
|
final-jax
|
251 Bytes |
sha256:455755de0cd8edb5cd9bccd472194a03330df278a75fe11b644f4acd751c5118
|
|
|
final-maxtext
|
263 Bytes |
sha256:b22a977e5eeb6e6bc79748a7878135a1d3a6b3a47d23258933c461d381154022
|
|
|
final-t5x
|
251 Bytes |
sha256:962b778e8fb22ec4e438e0039a013adbf581cbbe2fa7de0c64cfb2968b8fb1ce
|
|
|
final-upstream-t5x
|
276 Bytes |
sha256:d46ca2a89a2bfde1df92b7b508c7793339f712fcf06d3104b4afd5c575cc81b7
|
|
|
gke-maxtext-train
|
371 MB |
sha256:341d9c3fb2f44a83519b5427cfaec236c4ea9e29bc88706b4d832b87944d7cc2
|
|
|
gke-maxtext-train-sitrep
|
228 Bytes |
sha256:14ec6975da658969872c7e5b54a0b5af72a47fcc763443d8ac742f0fb3d9fb73
|
|
|
jax-cutlass-test-H100
|
4.68 KB |
sha256:4539a881ffcc6abb124c9f823135d3264faa1a5436d83974ccb775e145d34ea2
|
|
|
jax-unit-test-A100
|
22.5 KB |
sha256:425e9058d03f4d54d087e1fdfc05ce17119da23e93984526eb0cf2647e068c10
|
|
|
mealkit-axlearn
|
272 Bytes |
sha256:75895f3c9454e782d75350018d2441dcaebac12f2d8400d9ae94aeb12f154d52
|
|
|
mealkit-equinox
|
272 Bytes |
sha256:3a96fb688138320bc24a7900b1e30963aece8242e37d27e9bcf2dee3b627ef96
|
|
|
mealkit-jax
|
261 Bytes |
sha256:42ca5fae4d798f0d3646dd298d4954f0f0e6ee6e2d23590c745ca77b03633b0e
|
|
|
mealkit-maxtext
|
271 Bytes |
sha256:320b975ce867c9dbec524e7d7f5e33ab1338a9bddc6c434ee4661c2aa7705855
|
|
|
mealkit-t5x
|
261 Bytes |
sha256:c00ee405422259b8237d765e088eac3d510dd60f5c5eae084db78128daa76b1a
|
|
|
mealkit-upstream-t5x
|
286 Bytes |
sha256:4d6dae8e79f8000991fe89494881c5a2e33a124b4d8cc789a123ab3e900c5a4f
|
|
|
nccl-gke-all-gather
|
15.4 KB |
sha256:dcd704ec3a08803207f6bf8e60fad88ca3c62d23eb2178a7e6bd1cf1b159b2fa
|
|
|
nccl-gke-all-gather-sitrep
|
231 Bytes |
sha256:dffc50a2727d88010aacc9e89a048a149a3e81cb9ac1c64337778a2afe7ec427
|
|
|
nccl-gke-all-reduce
|
15.6 KB |
sha256:15fc7529d0040feb099d21fe72c2a121d65a7163ba6b3f34945a33dcae8e9eec
|
|
|
nccl-gke-all-reduce-sitrep
|
231 Bytes |
sha256:dfb8c990d8e89767817c1871460b4e1853a4de187b839cecd19c04fe19242cd4
|
|
|
nccl-gke-broadcast
|
15.2 KB |
sha256:389262cec09390394ddad9641a60bc03fd39852d89f91f71228ec6d092257508
|
|
|
nccl-gke-broadcast-sitrep
|
229 Bytes |
sha256:f9144377789bdfb48fd2754a385c8f8bf53367c5500a5d657b5cda5af84d83d5
|
|
|
nccl-gke-reduce-scatter
|
15.4 KB |
sha256:9c4b6185846716c598f653c234914b30f3bda05bf9c1bb5f5f165bcfea9ce2b6
|
|
|
nccl-gke-reduce-scatter-sitrep
|
234 Bytes |
sha256:a6690215d2143d81f3718fa045991765fe0c77cbd1a6bb10f589f23b1ff37404
|
|
|
nsys-jax-unit-test-A100
|
127 MB |
sha256:1422bdb5bee1d22d5817cf3490e4fb2625107e78a116365f62e41f86481fc638
|
|
|
rosetta-t5x-vit-19287850868-VIT8G1N
|
16.3 KB |
sha256:7374b7422295b29f76bcfe023ec6329b4f8fd8b54b340148fd85426bc9cf46bb
|
|
|
te-unit-test-H100
|
2.08 MB |
sha256:c24e5eae92cb16a6a077c84fd7a62a55fbb5c089e574e3bda2749d4bdb9d917d
|
|
|
upstream-maxtext-19287850868-1DP2FSDP4TP1PP_single_process
|
23.7 KB |
sha256:afa668c6aeb8d2a177dae4273fe3bf9430283aed031c1630d26989e4b6f5c226
|
|
|
upstream-maxtext-19287850868-2DP2FSDP2TP1PP
|
32 KB |
sha256:ece65cc53c41628436d2bad01adbfa8546c22e9a811e4eab52600950e959c92b
|
|
|
upstream-maxtext-metrics-test-log
|
2.52 KB |
sha256:4ec4700f1979fc2289837021ae1dc0913603e732b2d5dbe30b4ec0daa38914f0
|
|