-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update train_ssd.py to support multiple GPUs #4
base: master
Are you sure you want to change the base?
Conversation
…tiple GPUs to work on default single GPU i.e Jetson Co-authored-by: @Mystique-orca <sumeshrmeppadath@gmail.com>
@Mystique-orca and I have enabled multiple GPUs to support for training SSD based Object Detection Model in PyTorch
…tiple GPUs to work on default single GPU i.e Jetson Co-authored-by: @Mystique-orca <sumeshrmeppadath@gmail.com>
Thanks @NISHANTSHRIVASTAV - can you make this work on a single GPU (i.e. Jetson) just the same that it did previously? If it required no changes in CLI arguments/ect on the single-GPU use-case I would merge it. |
@dusty-nv Yes, it will work on a single GPU using the same CLI argument i.e For e.g: For single GPU
For 2 GPUs
For n GPUs
|
The default should be |
Hi @dusty-nv, We have modified the SSD-based Object Detection Training implementation using Multiple GPUs to work on the default single GPU i.e Jetson according to your suggestions in the latest commit. For training with multiple GPUs, it will use the Please let us know if we could provide more information. Thanks |
Hi @dusty-nv Can you let us know if this request can be merged? Many thanks! |
Hello, i've been trying to apply these changes into my 1_train_ssd as i also want to apply a MultiGPU training, but have been facing the recurrent error: Did you had a similar issue or know where i'm making a mistake? This is my first Computer Vision project and i would really appreciate your input! Thanks |
Hello @dusty-nv,
According to your suggestion here Support multiple GPU and the issue referenced here
@Mystique-orca and I have enabled multiple GPUs to support training SSD-based Object Detection Model using PyTorch Framework.
We have tested the modified
train_ssd.py
on our environment for Object Detection using 3 Nvidia Tesla T4 GPUs. We can pass the number of GPUs we need using the argument--gpu-devices
.For e.g:
python train_ssd.py --dataset-type=voc --data=<path-to-dataset-dir> --model-dir=<path-to-model-dir> --batch-size=12 --epochs=400 --workers=0 --use-cuda=True --gpu-devices 0 1 2
Please let us know if we could provide more information.
Hope this will help the community!
Thanks