Skip to content

Nightly Containers on CUDA 12.1 (JAX pinned) (schedule) #38

Nightly Containers on CUDA 12.1 (JAX pinned) (schedule)

Nightly Containers on CUDA 12.1 (JAX pinned) (schedule) #38

Triggered via schedule November 16, 2023 09:33
Status Failure
Total duration 6h 45m 10s
Artifacts 27
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention

cuda-121-jax-pin.yaml

on: schedule
metadata
0s
metadata
Matrix: test-jax / unit-test
test-jax  /  ...  /  launch
8s
test-jax / runner / launch
Matrix: build-pax / build
Matrix: test-t5x / multi-gpu-multi-node
Matrix: test-t5x / single-process-multi-device
Matrix: build-rosetta-t5x / build
test-t5x  /  summary
0s
test-t5x / summary
build-pax  /  merge
12s
build-pax / merge
test-t5x  /  metrics
15s
test-t5x / metrics
build-rosetta-t5x  /  merge
0s
build-rosetta-t5x / merge
Matrix: test-pax / multi-process-multi-device
Matrix: test-pax / single-process-evaluation
Matrix: test-pax / single-process-multi-device
test-pax  /  summary
0s
test-pax / summary
Matrix: build-rosetta-pax / build
test-t5x  /  ...  /  publish
11s
test-t5x / publish-test / publish
test-pax  /  metrics
0s
test-pax / metrics
build-rosetta-pax  /  merge
8s
build-rosetta-pax / merge
test-t5x  /  outcome
0s
test-t5x / outcome
test-pax  /  ...  /  publish
3m 17s
test-pax / publish-test / publish
build-summary
0s
build-summary
test-pax  /  outcome
0s
test-pax / outcome
finalize  /  ...  /  action
2m 27s
finalize / upload-badge / action
finalize  /  ...  /  action
2m 27s
finalize / report / action
finalize  /  ...  /  action
finalize / publish-badge / action
Fit to window
Zoom out
Zoom in

Annotations

4 errors
build-rosetta-t5x / build (amd64)
buildx failed with: ERROR: failed to solve: process "/bin/sh -c <<\"EOF\" bash -e\nbash create-distribution.sh \\\n -p patchlist-t5x.txt \\\n -m https://github.com/nvjax-svc-0/t5x.git \\\n -d $(dirname $(python -c \"import t5x; print(*t5x.__path__)\")) \\\n -e /opt/t5x-mirror\nbash create-distribution.sh \\\n -p patchlist-flax.txt \\\n -m https://github.com/nvjax-svc-0/flax.git \\\n -d $(dirname $(python -c \"import flax; print(*flax.__path__)\")) \\\n -e /opt/flax-mirror\nrm -rf $(find /opt -name \"__pycache__\")\nEOF" did not complete successfully: exit code: 1
test-pax / multi-process-multi-device (4, 2, 1, 2)
The job running on runner GitHub Actions 80 has exceeded the maximum execution time of 360 minutes.
test-pax / multi-process-multi-device (4, 2, 1, 2)
The operation was canceled.
test-pax / outcome
Process completed with exit code 1.

Artifacts

Produced during runtime
Name Size
6888837735-16DP1FSDP1TP1PP Expired
15.3 MB
6888837735-1DP1FSDP1TP1PP Expired
1.89 MB
6888837735-1DP2FSDP4TP1PP_single_process Expired
1.91 MB
6888837735-1DP8FSDP1TP1PP Expired
8.16 MB
6888837735-1G1N Expired
599 KB
6888837735-1G2N Expired
782 KB
6888837735-1P1G Expired
599 KB
6888837735-1P2G Expired
599 KB
6888837735-1P4G Expired
599 KB
6888837735-1P8G Expired
599 KB
6888837735-2DP1FSDP1TP4PP Expired
5.41 MB
6888837735-2G1N Expired
782 KB
6888837735-2G2N Expired
1.12 MB
6888837735-4DP1FSDP2TP1PP Expired
8.26 MB
6888837735-4G1N Expired
1.12 MB
6888837735-4G2N Expired
1.84 MB
6888837735-8DP1FSDP1TP1PP Expired
8.14 MB
6888837735-8DP1FSDP1TP1PP_eval Expired
444 MB
6888837735-8DP1FSDP1TP1PP_single_process Expired
1.35 MB
6888837735-8G1N Expired
1.83 MB
6888837735-8G2N Expired
3.26 MB
artifact-jax-unit-test-A100 Expired
42.2 KB
artifact-jax-unit-test-V100 Expired
42.5 KB
image-name-pax-amd64 Expired
57 Bytes
image-name-upstream-pax-amd64 Expired
66 Bytes
image-name-upstream-pax-arm64 Expired
66 Bytes
metrics-test-log Expired
61.8 KB