Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mimimum NVIDA chip for training #301

Open
reedkotler opened this issue Sep 29, 2023 · 7 comments
Open

Mimimum NVIDA chip for training #301

reedkotler opened this issue Sep 29, 2023 · 7 comments

Comments

@reedkotler
Copy link

What kind of NVIDIA chip would I need for training?

TIA

@mtrofin
Copy link
Collaborator

mtrofin commented Sep 29, 2023

We didn't use any kind of acceleration, but if you want to, I'd assume anything that Tensorflow supports.

@boomanaiden154
Copy link
Collaborator

Anything that Tensorflow supports should work, but note (that at least with how the code is setup currently) that copying the data over to the GPU/doing the training iterations on the GPU/copying everything back over to the CPU takes longer than just doing the training on the CPU, at least when I last tested it. There has been some experimentation with much larger models where acceleration (GPU/TPU) has definitely helped, but none of those models are upstream in this repository.

@reedkotler
Copy link
Author

So even then a modern mac m1 should work?

@reedkotler
Copy link
Author

I see, just a modern CPU with 96 HW threads? "for local training, which is currently the only supported mode, we recommend a high-performance workstation (e.g. 96 hardware threads)."

@mtrofin
Copy link
Collaborator

mtrofin commented Sep 29, 2023

Yup. The bottleneck is currently compile time.

@reedkotler
Copy link
Author

Do you know if anyone is using a mac for this?

@boomanaiden154
Copy link
Collaborator

There was some experimentation with using a Mac (see patches like #260), but it's not really a platform where all the tooling here is guaranteed to work. The tooling in this repository is almost exclusively developed and run on Linux, although running it on a Mac should theoretically work, maybe minus seem slight issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants