Skip to content

Commit

Permalink
Add KFTO pytorch multi-node multi-gpu tests for GPUs with AMD ROCm an…
Browse files Browse the repository at this point in the history
…d NVIDIA Cuda
  • Loading branch information
abhijeet-dhumal committed Jan 9, 2025
1 parent 1c4c2be commit d8d75d4
Showing 1 changed file with 12 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -75,3 +75,15 @@ Run Training operator KFTO_MNIST multi-node test with AMD ROCm image
... Training
... TrainingOperator
Run Training Operator KFTO Test TestPyTorchJobMnistMultiNodeWithROCm ${ROCM_TRAINING_IMAGE}

Run Training operator KFTO_MNIST multi-node multi-gpu test with NVIDIA CUDA image
[Documentation] Run Go KFTO_MNIST multi-node multi-gpu test for Training operator using PyTorch job with NVIDIA CUDA image - It requires 2 cluster-nodes with 2 GPUs each

Check warning

Code scanning / Robocop

Line is too long ({{ line_length }}/{{ allowed_length }}) Warning test

Line is too long (176/120)
[Tags] Kfto-MultiNodeMultiGpu
... Training
Run Training Operator KFTO Test TestPyTorchJobMnistMultiNodeMultiGpuWithCuda ${CUDA_TRAINING_IMAGE}

Run Training operator KFTO_MNIST multi-node multi-gpu test with AMD ROCm image
[Documentation] Run Go KFTO_MNIST multi-node multi-gpu test for Training operator using PyTorch job with AMD ROCm image - It requires 2 cluster-nodes with 2 GPUs each

Check warning

Code scanning / Robocop

Line is too long ({{ line_length }}/{{ allowed_length }}) Warning test

Line is too long (174/120)
[Tags] Kfto-MultiNodeMultiGpu
... Training
Run Training Operator KFTO Test TestPyTorchJobMnistMultiNodeMultiGpuWithROCm ${ROCM_TRAINING_IMAGE}

0 comments on commit d8d75d4

Please sign in to comment.