Skip to content

~NGC release testing #388

~NGC release testing

~NGC release testing #388

Manually triggered October 6, 2025 10:18
Status Failure
Total duration 1h 58m 36s
Artifacts 5

ngc-release-testing.yaml

on: workflow_dispatch
test-maxtext-gke  /  maxtext-gke-xpk
1m 33s
test-maxtext-gke / maxtext-gke-xpk
test-nccl  /  ...  /  build-nccl-gke
2m 42s
test-nccl / nccl-test-gke / build-nccl-gke
Matrix: test-nccl / nccl-test-gke / nccl-gke
finalize  /  workflow-badge
8s
finalize / workflow-badge
finalize  /  report
5s
finalize / report
finalize  /  upload-badge
7s
finalize / upload-badge
finalize  /  publish-badge
6s
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

5 errors
test-nccl / nccl-test-gke / nccl-gke (broadcast_perf_mpi)
Process completed with exit code 1.
test-nccl / nccl-test-gke / nccl-gke (all_gather_perf_mpi)
The strategy configuration was canceled because "test-nccl.nccl-test-gke.nccl-gke.broadcast_perf_mpi" failed
test-nccl / nccl-test-gke / nccl-gke (all_reduce_perf_mpi)
The strategy configuration was canceled because "test-nccl.nccl-test-gke.nccl-gke.broadcast_perf_mpi" failed
test-nccl / nccl-test-gke / nccl-gke (reduce_scatter_perf_mpi)
The strategy configuration was canceled because "test-nccl.nccl-test-gke.nccl-gke.broadcast_perf_mpi" failed
test-maxtext-gke / maxtext-gke-xpk
Process completed with exit code 1.

Artifacts

Produced during runtime
Name Size Digest
artifact-final-report
740 Bytes
sha256:6c21baa41293f10a32433e906c62b72350db5f8f1a732e4f88ce9aa52a9dd9e4
artifact-nccl-gke-build-amd64
571 Bytes
sha256:4728198a6445927e4161294904961f5478f8d9abc1e90f7e4e74c22001e54e0a
artifact-workflow-metadata
265 Bytes
sha256:cf8ee31b5c1a1a0b0ecd04d464ccb2f01e4a7558343b826474977eb7324ae136
gke-maxtext-train-sitrep
228 Bytes
sha256:80768e756a6be48fa317102b0f6e1851ccedafedd4413f43ef61bd3b09c1a6bf
nccl-gke-broadcast-sitrep
224 Bytes
sha256:0bc9312b80de217e11788b46f82ca392ac38daba9cfa29a8c57f1ed444b05e8b