Skip to content

~NGC release testing #87

~NGC release testing

~NGC release testing #87

Manually triggered September 4, 2024 23:48
Status Failure
Total duration 1h 57m 53s
Artifacts 29

ngc-release-testing.yaml

on: workflow_dispatch
Matrix: test-maxtext / maxtext-multinode
Matrix: test-maxtext / single-process-multi-device
Matrix: test-jax / run-unit-test
Matrix: test-rosetta-pax / rosetta-pax-multi-node-te
Matrix: test-rosetta-pax / rosetta-pax-multi-node
Matrix: test-rosetta-pax / rosetta-pax-single-node-dropout-te
Matrix: test-rosetta-pax / single-process-evaluation-te
Matrix: test-rosetta-pax / single-process-multi-device-te
test-jax  /  ...  /  launch-slurm-runner
1h 53m
test-jax / runner / launch-slurm-runner
test-maxtext  /  test-maxtext-summary
0s
test-maxtext / test-maxtext-summary
test-maxtext  /  test-maxtext-metrics
10s
test-maxtext / test-maxtext-metrics
test-rosetta-pax  /  test-pax-rosetta-summary
0s
test-rosetta-pax / test-pax-rosetta-summary
test-rosetta-pax  /  test-pax-rosetta-metrics
13s
test-rosetta-pax / test-pax-rosetta-metrics
test-maxtext  /  ...  /  sitrep
15s
test-maxtext / test-maxtext-sitrep / sitrep
test-rosetta-pax  /  ...  /  sitrep
6s
test-rosetta-pax / test-pax-rosetta-sitrep / sitrep
test-maxtext  /  test-maxtext-outcome
0s
test-maxtext / test-maxtext-outcome
test-rosetta-pax  /  test-pax-rosetta-outcome
0s
test-rosetta-pax / test-pax-rosetta-outcome
finalize  /  workflow-badge
7s
finalize / workflow-badge
finalize  /  report
7s
finalize / report
finalize  /  upload-badge
3s
finalize / upload-badge
finalize  /  publish-badge
2s
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

6 errors
test-jax / jax-V100-unit-test
Process completed with exit code 1.
test-jax / jax-A100-unit-test
Process completed with exit code 1.
test-maxtext / test-maxtext-metrics
Process completed with exit code 1.
test-rosetta-pax / test-pax-rosetta-metrics
Process completed with exit code 1.
test-maxtext / test-maxtext-outcome
Process completed with exit code 1.
test-rosetta-pax / test-pax-rosetta-outcome
Process completed with exit code 1.

Artifacts

Produced during runtime
Name Size
artifact-final-report Expired
1.86 KB
artifact-maxtext-test Expired
657 Bytes
artifact-rosetta-pax-mgmn-test Expired
729 Bytes
artifact-workflow-metadata Expired
268 Bytes
jax-unit-test-A100 Expired
863 Bytes
jax-unit-test-V100 Expired
861 Bytes
rosetta-pax-10711244461-16DP1FSDP1TP1PP_TE Expired
1.46 KB
rosetta-pax-10711244461-1DP1FSDP1TP1PP_TE Expired
1.42 KB
rosetta-pax-10711244461-1DP2FSDP4TP1PP_single_process_TE Expired
1.59 KB
rosetta-pax-10711244461-1DP8FSDP1TP1PP_TE Expired
1.45 KB
rosetta-pax-10711244461-2DP1FSDP1TP4PP Expired
1.42 KB
rosetta-pax-10711244461-2DP1FSDP2TP4PP Expired
1.43 KB
rosetta-pax-10711244461-4DP1FSDP2TP1PP Expired
1.41 KB
rosetta-pax-10711244461-4DP1FSDP2TP1PP_TE Expired
1.45 KB
rosetta-pax-10711244461-5B_fused_attn_0 Expired
1.43 KB
rosetta-pax-10711244461-5B_fused_attn_1 Expired
1.43 KB
rosetta-pax-10711244461-8DP1FSDP1TP1PP Expired
1.41 KB
rosetta-pax-10711244461-8DP1FSDP1TP1PP_TE Expired
1.45 KB
rosetta-pax-10711244461-8DP1FSDP1TP1PP_eval_TE Expired
1.51 KB
rosetta-pax-10711244461-8DP1FSDP1TP1PP_single_process_TE Expired
1.59 KB
rosetta-pax-10711244461-8DP_TE_dropout Expired
1.45 KB
rosetta-pax-10711244461-LLaMA_eval_TE Expired
1.41 KB
upstream-maxtext-10711244461-1DP1FSDP1TP1PP Expired
880 Bytes
upstream-maxtext-10711244461-1DP1FSDP8TP1PP Expired
909 Bytes
upstream-maxtext-10711244461-1DP2FSDP4TP1PP_single_process Expired
940 Bytes
upstream-maxtext-10711244461-1DP4FSDP2TP1PP Expired
909 Bytes
upstream-maxtext-10711244461-1DP8FSDP1TP1PP Expired
906 Bytes
upstream-maxtext-10711244461-2DP2FSDP2TP1PP Expired
909 Bytes
upstream-maxtext-10711244461-4DP2FSDP2TP1PP Expired
902 Bytes