-
I am also facing another major issue. With the pipeline
the segmentation and detection of speech segments are extremely good with very fine-grained segments. But with regards to the number of speakers in audio, the detection of the speakers is limited to 2-3 at max.. For audio files with a large number of speakers (say more than 5), the results obtained after applying the pre-trained pipeline still show 2 or 3 speakers. Have I incorrectly applied the pre-trained pipeline to obtain such results? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
You have to tune the hyper parameters of the pipeline on your own validation data. See related tutorial. |
Beta Was this translation helpful? Give feedback.
You have to tune the hyper parameters of the pipeline on your own validation data. See related tutorial.