Skip to content

CI

CI #1323

Manually triggered December 5, 2023 22:33
Status Failure
Total duration 7h 32m 42s
Artifacts 9
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention

ci.yaml

on: workflow_dispatch
Matrix: amd64 / test-distribution / test-create-distribution
amd64  /  ...  /  build-base
4m 47s
amd64 / build-base / build-base
Matrix: arm64 / test-distribution / test-create-distribution
arm64  /  ...  /  build-base
5m 2s
arm64 / build-base / build-base
amd64  /  ...  /  build-jax
8m 35s
amd64 / build-jax / build-jax
arm64  /  ...  /  build-jax
21m 39s
arm64 / build-jax / build-jax
amd64  /  ...  /  build-pax
6m 0s
amd64 / build-pax / build-pax
amd64  /  ...  /  build-t5x
6m 36s
amd64 / build-t5x / build-t5x
Matrix: amd64 / test-jax / jax-unit-test
amd64  /  ...  /  launch-slurm-runner
5s
amd64 / test-jax / runner / launch-slurm-runner
arm64  /  ...  /  build-pax
arm64 / build-pax / build-pax
arm64  /  ...  /  build-t5x
arm64 / build-t5x / build-t5x
Matrix: arm64 / test-jax / jax-unit-test
Waiting for pending jobs
arm64  /  ...  /  launch-slurm-runner
arm64 / test-jax / runner / launch-slurm-runner
Matrix: amd64 / test-pax / pax-multi-node
Matrix: amd64 / test-pax / single-process-evaluation
Matrix: amd64 / test-pax / single-process-multi-device
Matrix: amd64 / test-te / te-multi-gpu
amd64  /  ...  /  te-unit-tests
23m 31s
amd64 / test-te / te-unit-tests
amd64  /  ...  /  summary
0s
amd64 / test-pax / summary
amd64  /  ...  /  build-rosetta
6m 4s
amd64 / build-rosetta-pax / build-rosetta
amd64  /  ...  /  build-rosetta
8m 27s
amd64 / build-rosetta-t5x / build-rosetta
Matrix: amd64 / test-t5x / t5x-multi-gpu
Matrix: amd64 / test-t5x / t5x-multi-node
amd64  /  ...  /  summary
0s
amd64 / test-t5x / summary
Matrix: arm64 / test-pax / pax-multi-node
Waiting for pending jobs
Matrix: arm64 / test-pax / single-process-evaluation
Waiting for pending jobs
Matrix: arm64 / test-pax / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-te / te-multi-gpu
Waiting for pending jobs
arm64  /  ...  /  summary
arm64 / test-pax / summary
arm64  /  ...  /  te-unit-tests
arm64 / test-te / te-unit-tests
arm64  /  ...  /  build-rosetta
arm64 / build-rosetta-pax / build-rosetta
arm64  /  ...  /  build-rosetta
arm64 / build-rosetta-t5x / build-rosetta
Matrix: arm64 / test-t5x / t5x-multi-gpu
Waiting for pending jobs
Matrix: arm64 / test-t5x / t5x-multi-node
Waiting for pending jobs
arm64  /  ...  /  summary
arm64 / test-t5x / summary
amd64  /  ...  /  metrics
0s
amd64 / test-pax / metrics
amd64  /  ...  /  summary
0s
amd64 / test-vit / summary
Matrix: amd64 / test-vit / multi-gpu-multi-node
Matrix: amd64 / test-vit / single-process-multi-device
amd64  /  ...  /  metrics
0s
amd64 / test-t5x / metrics
arm64  /  ...  /  metrics
arm64 / test-pax / metrics
arm64  /  ...  /  summary
arm64 / test-vit / summary
Matrix: arm64 / test-vit / multi-gpu-multi-node
Waiting for pending jobs
Matrix: arm64 / test-vit / single-process-multi-device
Waiting for pending jobs
arm64  /  ...  /  metrics
arm64 / test-t5x / metrics
amd64  /  ...  /  publish
12s
amd64 / test-pax / publish-test / publish
amd64  /  ...  /  publish
3s
amd64 / test-vit / publish-test / publish
amd64  /  ...  /  publish
5s
amd64 / test-t5x / publish-test / publish
arm64  /  ...  /  publish
arm64 / test-pax / publish-test / publish
arm64  /  ...  /  publish
arm64 / test-vit / publish-test / publish
arm64  /  ...  /  publish
arm64 / test-t5x / publish-test / publish
amd64  /  ...  /  outcome
0s
amd64 / test-pax / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-vit / outcome
amd64  /  ...  /  outcome
0s
amd64 / test-t5x / outcome
arm64  /  ...  /  outcome
arm64 / test-pax / outcome
arm64  /  ...  /  outcome
arm64 / test-vit / outcome
arm64  /  ...  /  outcome
arm64 / test-t5x / outcome
finalize  /  upload-badge
10s
finalize / upload-badge
finalize  /  report
10s
finalize / report
finalize  /  publish-badge
0s
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

61 errors
arm64 / build-jax / build-jax
The self-hosted runner: arm-large-59fpj-bl8q4 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
amd64 / test-t5x / t5x-multi-gpu (1)
The job running on runner GitHub Actions 176 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-gpu (1)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (8, 2)
The job running on runner GitHub Actions 187 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (8, 2)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (4, 2)
The job running on runner GitHub Actions 184 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (4, 2)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (2, 1)
The job running on runner GitHub Actions 182 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (2, 1)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (1, 2)
The job running on runner GitHub Actions 181 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (1, 2)
The operation was canceled.
amd64 / test-t5x / t5x-multi-gpu (2)
The job running on runner GitHub Actions 177 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-gpu (2)
The operation was canceled.
amd64 / test-t5x / t5x-multi-gpu (4)
The job running on runner GitHub Actions 178 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-gpu (4)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (2, 2)
The job running on runner GitHub Actions 183 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (2, 2)
The operation was canceled.
amd64 / test-t5x / t5x-multi-gpu (8)
The job running on runner GitHub Actions 179 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-gpu (8)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (8, 1)
The job running on runner GitHub Actions 186 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (8, 1)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (4, 1)
The job running on runner GitHub Actions 185 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (4, 1)
The operation was canceled.
amd64 / test-t5x / t5x-multi-node (1, 1)
The job running on runner GitHub Actions 180 has exceeded the maximum execution time of 360 minutes.
amd64 / test-t5x / t5x-multi-node (1, 1)
The operation was canceled.
amd64 / test-t5x / publish-test / publish
Process completed with exit code 2.
amd64 / test-t5x / outcome
Process completed with exit code 2.
amd64 / test-pax / pax-multi-node (4, 2, 1, 1)
The job running on runner GitHub Actions 197 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / pax-multi-node (4, 2, 1, 1)
The operation was canceled.
amd64 / test-pax / pax-multi-node (1, 8, 1, 1)
The job running on runner GitHub Actions 196 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / pax-multi-node (1, 8, 1, 1)
The operation was canceled.
amd64 / test-pax / single-process-multi-device (1, 1, 2, 4)
The job running on runner GitHub Actions 189 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / single-process-multi-device (1, 1, 2, 4)
The operation was canceled.
amd64 / test-pax / pax-multi-node (1, 16, 1, 1)
The job running on runner GitHub Actions 195 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / pax-multi-node (1, 16, 1, 1)
The operation was canceled.
amd64 / test-pax / single-process-multi-device (1, 8, 1, 1)
The job running on runner GitHub Actions 190 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / single-process-multi-device (1, 8, 1, 1)
The operation was canceled.
amd64 / test-pax / pax-multi-node (4, 2, 1, 2)
The job running on runner GitHub Actions 198 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / pax-multi-node (4, 2, 1, 2)
The operation was canceled.
amd64 / test-pax / pax-multi-node (1, 1, 8, 1)
The job running on runner GitHub Actions 194 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / pax-multi-node (1, 1, 8, 1)
The operation was canceled.
amd64 / test-pax / pax-multi-node (1, 4, 1, 2)
The job running on runner GitHub Actions 193 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / pax-multi-node (1, 4, 1, 2)
The operation was canceled.
amd64 / test-pax / single-process-evaluation (1, 8, 1, 1)
The job running on runner GitHub Actions 188 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / single-process-evaluation (1, 8, 1, 1)
The operation was canceled.
amd64 / test-pax / pax-multi-node (1, 1, 1, 1)
The job running on runner GitHub Actions 192 has exceeded the maximum execution time of 360 minutes.
amd64 / test-pax / pax-multi-node (1, 1, 1, 1)
The operation was canceled.
amd64 / test-pax / publish-test / publish
Process completed with exit code 2.
amd64 / test-pax / outcome
Process completed with exit code 2.
amd64 / test-vit / multi-gpu-multi-node (1, 1)
The job running on runner GitHub Actions 210 has exceeded the maximum execution time of 360 minutes.
amd64 / test-vit / multi-gpu-multi-node (1, 1)
The operation was canceled.
amd64 / test-vit / multi-gpu-multi-node (1, 2)
The job running on runner GitHub Actions 211 has exceeded the maximum execution time of 360 minutes.
amd64 / test-vit / multi-gpu-multi-node (1, 2)
The operation was canceled.
amd64 / test-vit / multi-gpu-multi-node (8, 1)
The job running on runner GitHub Actions 212 has exceeded the maximum execution time of 360 minutes.
amd64 / test-vit / multi-gpu-multi-node (8, 1)
The operation was canceled.
amd64 / test-vit / multi-gpu-multi-node (8, 2)
The job running on runner GitHub Actions 213 has exceeded the maximum execution time of 360 minutes.
amd64 / test-vit / multi-gpu-multi-node (8, 2)
The operation was canceled.
amd64 / test-vit / single-process-multi-device (8)
The job running on runner GitHub Actions 209 has exceeded the maximum execution time of 360 minutes.
amd64 / test-vit / single-process-multi-device (8)
The operation was canceled.
amd64 / test-vit / publish-test / publish
Process completed with exit code 2.
amd64 / test-vit / outcome
Process completed with exit code 2.

Artifacts

Produced during runtime
Name Size
artifact-base-build-amd64 Expired
385 Bytes
artifact-base-build-arm64 Expired
385 Bytes
artifact-jax-build-amd64 Expired
363 Bytes
artifact-jax-unit-test-A100 Expired
38.6 KB
artifact-jax-unit-test-V100 Expired
34.2 KB
artifact-pax-build-amd64 Expired
372 Bytes
artifact-t5x-build-amd64 Expired
372 Bytes
integration-test-logs Expired
270 KB
unit-test-logs Expired
5.83 MB