Pyannote.audio toolkit with MFCC #1680
sumansamui
started this conversation in
Development
Replies: 1 comment 1 reply
-
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have a few doubts:
How to use pyannote.audio setup if we want to extract MFCC, i.e., using the speech segmentation model with the MFCC feature. Is there any pre-trained model available for those settings? Or we have to train from scratch.
What is the impact of sample frequency on Sincnet? I know all the input audio is downsampled or upsampled to 16k.
We observed that Pyannote provides the same result for 8k and 16k versions of a WAV file in the case of SincNet architecture. Is it because of the same number of Sinc filters in the low-frequency range for both 8k and 16k.
Beta Was this translation helpful? Give feedback.
All reactions