Skip to content

~NGC release testing #81

~NGC release testing

~NGC release testing #81

Manually triggered May 1, 2024 12:43
Status Failure
Total duration 3h 56m 57s
Artifacts 31

ngc-release-testing.yaml

on: workflow_dispatch
Matrix: test-maxtext / maxtext-multinode
Matrix: test-maxtext / single-process-multi-device
Matrix: test-jax / run-unit-test
Matrix: test-rosetta-pax / rosetta-pax-multi-node-te
Matrix: test-rosetta-pax / rosetta-pax-multi-node
Matrix: test-rosetta-pax / rosetta-pax-single-node-dropout-te
Matrix: test-rosetta-pax / single-process-evaluation-te
Matrix: test-rosetta-pax / single-process-multi-device-te
test-jax  /  ...  /  launch-slurm-runner
3h 15m
test-jax / runner / launch-slurm-runner
test-maxtext  /  summary
0s
test-maxtext / summary
test-maxtext  /  metrics
15s
test-maxtext / metrics
test-rosetta-pax  /  summary
0s
test-rosetta-pax / summary
test-rosetta-pax  /  metrics
18s
test-rosetta-pax / metrics
test-maxtext  /  ...  /  sitrep
10s
test-maxtext / sitrep / sitrep
test-rosetta-pax  /  ...  /  sitrep
10s
test-rosetta-pax / sitrep / sitrep
test-maxtext  /  outcome
0s
test-maxtext / outcome
test-rosetta-pax  /  outcome
0s
test-rosetta-pax / outcome
finalize  /  workflow-badge
4s
finalize / workflow-badge
finalize  /  report
6s
finalize / report
finalize  /  upload-badge
20s
finalize / upload-badge
finalize  /  publish-badge
0s
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

4 errors
test-jax / jax-V100-unit-test
Process completed with exit code 1.
test-jax / jax-A100-unit-test
Process completed with exit code 1.
test-rosetta-pax / outcome
Process completed with exit code 1.
finalize / upload-badge
Unable to download artifact(s): Failed to GetSignedArtifactURL: Unable to make request: ECONNRESET If you are using self-hosted runners, please make sure your runner has access to all GitHub endpoints: https://docs.github.com/en/actions/hosting-your-own-runners/managing-self-hosted-runners/about-self-hosted-runners#communication-between-self-hosted-runners-and-github

Artifacts

Produced during runtime
Name Size
artifact-final-report Expired
615 Bytes
artifact-maxtext-test Expired
1.67 KB
artifact-rosetta-pax-mgmn-test Expired
2.62 KB
artifact-workflow-metadata Expired
267 Bytes
jax-unit-test-A100 Expired
19.4 KB
jax-unit-test-V100 Expired
22.4 KB
rosetta-pax-8909473523-16DP1FSDP1TP1PP_TE Expired
592 KB
rosetta-pax-8909473523-1DP1FSDP1TP1PP_TE Expired
88.5 KB
rosetta-pax-8909473523-1DP2FSDP4TP1PP_single_process_TE Expired
106 KB
rosetta-pax-8909473523-1DP8FSDP1TP1PP_TE Expired
327 KB
rosetta-pax-8909473523-2DP1FSDP1TP4PP Expired
293 KB
rosetta-pax-8909473523-2DP1FSDP2TP4PP Expired
534 KB
rosetta-pax-8909473523-4DP1FSDP2TP1PP Expired
370 KB
rosetta-pax-8909473523-4DP1FSDP2TP1PP_TE Expired
321 KB
rosetta-pax-8909473523-5B_fused_attn_0 Expired
391 KB
rosetta-pax-8909473523-5B_fused_attn_1 Expired
395 KB
rosetta-pax-8909473523-8DP1FSDP1TP1PP Expired
370 KB
rosetta-pax-8909473523-8DP1FSDP1TP1PP_TE Expired
323 KB
rosetta-pax-8909473523-8DP1FSDP1TP1PP_eval_TE Expired
76.7 KB
rosetta-pax-8909473523-8DP1FSDP1TP1PP_single_process_TE Expired
106 KB
rosetta-pax-8909473523-8DP_TE_dropout Expired
328 KB
rosetta-pax-8909473523-LLaMA_eval_TE Expired
227 KB
rosetta-pax-metrics-test-log Expired
10.1 KB
upstream-maxtext-8909473523-1DP1FSDP1TP1PP Expired
9.75 KB
upstream-maxtext-8909473523-1DP1FSDP8TP1PP Expired
12.4 KB
upstream-maxtext-8909473523-1DP2FSDP4TP1PP_single_process Expired
9.87 KB
upstream-maxtext-8909473523-1DP4FSDP2TP1PP Expired
12.6 KB
upstream-maxtext-8909473523-1DP8FSDP1TP1PP Expired
12.6 KB
upstream-maxtext-8909473523-2DP2FSDP2TP1PP Expired
12.5 KB
upstream-maxtext-8909473523-4DP2FSDP2TP1PP Expired
15.1 KB
upstream-maxtext-metrics-test-log Expired
4.53 KB