How to stream diarization and transcription (Whisper) results from an audio file? #211
-
Do you have any suggestions on how streaming results from an audio file can be supported in conjunction with transcription? I have read your article here but this appears to be a bit outdated from where the library has progressed and also only demonstrates use from a mic. I am able to stream transcription results already and would like to be able to just add in speaker diarization. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @ColtonBehannon, if you already have streaming transcriptions (with timestamps), adding diarization would be a matter of running both in parallel and then aligning the output according to both sets of timestamps. I recently implemented a now outdated SpeakerAwareTranscription pipeline (see #147), but it's a bit hacky and should be improved before integrating that into the library. Concerning the blog post, the same principles apply to the latest version of diart, only some names need to be updated. Otherwise you can simply install a previous version like v0.6 or v0.7. |
Beta Was this translation helpful? Give feedback.
Hi @ColtonBehannon, if you already have streaming transcriptions (with timestamps), adding diarization would be a matter of running both in parallel and then aligning the output according to both sets of timestamps. I recently implemented a now outdated SpeakerAwareTranscription pipeline (see #147), but it's a bit hacky and should be improved before integrating that into the library.
Concerning the blog post, the same principles apply to the latest version of diart, only some names need to be updated. Otherwise you can simply install a previous version like v0.6 or v0.7.