We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Howdy,
this is more of question to validate what I have been doing as a error. The setup I am using runs just fine.
Apple Silicone M3 - macBook - OS X Sequoia latest 15.3 (24D60) version - 18GB ram
Using "parler-tts/parler-tiny-v1-jenny" voice for testing. torch_device = "mps"
Yet my inference times on a 100 char sentence is around 90seconds. It averages to about 10x per second of audio generated.
Is this the expected speed, or is something totally wrong at my end ?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Howdy,
this is more of question to validate what I have been doing as a error. The setup I am using runs just fine.
Apple Silicone M3 - macBook - OS X Sequoia latest 15.3 (24D60) version - 18GB ram
Using "parler-tts/parler-tiny-v1-jenny" voice for testing.
torch_device = "mps"
Yet my inference times on a 100 char sentence is around 90seconds.
It averages to about 10x per second of audio generated.
Is this the expected speed, or is something totally wrong at my end ?
The text was updated successfully, but these errors were encountered: