Suggestions on how to best separate speaker identities #923
Unanswered
Rahul-Brito
asked this question in
Q&A
Replies: 1 comment
-
I recommend using Speechbrain ECAPA-TDNN pretrained model for that purpose. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello! Your work is awesome, and I have been playing with your pipeline for a few months (a few months last year, now back to using it this year).
We have a population of speakers who are all reading the same passage. We want to be able to determine the relative distance between speakers i.e. figure out which speakers have similar voice and which are different. Ideally there is some form of a gradient (speaker 1 might be most similar to speaker 0, speaker 2 less so, and speaker 3 far away. Of course speaker 1, 2, and 3 would have their own interrelationships).
I was curious if you had a suggestion on the best way to do so with your pre-trained pipelines/models (those seem to be effective enough so far an we don't have a big training dataset so we have not done any retraining).
What I have tried so far:
Beta Was this translation helpful? Give feedback.
All reactions