-
-
Notifications
You must be signed in to change notification settings - Fork 808
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
These voice can not split correctly #1524
Comments
Thank you for your issue.You might want to check the FAQ if you haven't done so already. Feel free to close this issue if you found an answer in the FAQ. If your issue is a feature request, please read this first and update your request accordingly, if needed. If your issue is a bug report, please provide a minimum reproducible example as a link to a self-contained Google Colab notebook containing everthing needed to reproduce the bug:
Providing an MRE will increase your chance of getting an answer from the community (either maintainers or other power users). Companies relying on
|
Without providing details about the code you tried, it is kind of difficult to tell. AUDIO = "asr_res_240-243_1_audio.mp3"
from pyannote.audio import Audio
io = Audio(mono="downmix", sample_rate=16000)
waveform, sample_rate = io(AUDIO)
audio = {"waveform": waveform, "sample_rate": sample_rate}
from pyannote.audio import Inference
inference = Inference("pyannote/segmentation-3.0", window="whole")
prediction = inference(audio)
from matplotlib import pyplot as plt
plt.plot(prediction)
plt.legend(['speaker#1', 'speaker#2', 'speaker#3']) |
@hbredin hi, the audio acutially only have 2 people, the first period is person1, and rest is a man voice. the cliff of speaker3 seems detected the later man voice, but how can i tell, (i actually just need split 2 person), this cliff is exactly what I want? |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
asr_res_240-243_1_audio.zip
the output time and label
but the speaker are clearly 2 speaker first and later, how to precisely get the splitter time in the middle?
The text was updated successfully, but these errors were encountered: