How to stream diarization and transcription (Whisper) results from an audio file? #211

ColtonBehannon · 2023-11-13T20:33:39Z

ColtonBehannon
Nov 13, 2023

Do you have any suggestions on how streaming results from an audio file can be supported in conjunction with transcription? I have read your article here but this appears to be a bit outdated from where the library has progressed and also only demonstrates use from a mic. I am able to stream transcription results already and would like to be able to just add in speaker diarization.

Answered by juanmc2005

Nov 14, 2023

Hi @ColtonBehannon, if you already have streaming transcriptions (with timestamps), adding diarization would be a matter of running both in parallel and then aligning the output according to both sets of timestamps. I recently implemented a now outdated SpeakerAwareTranscription pipeline (see #147), but it's a bit hacky and should be improved before integrating that into the library.

Concerning the blog post, the same principles apply to the latest version of diart, only some names need to be updated. Otherwise you can simply install a previous version like v0.6 or v0.7.

View full answer

juanmc2005 · 2023-11-14T16:18:32Z

juanmc2005
Nov 14, 2023
Maintainer

Hi @ColtonBehannon, if you already have streaming transcriptions (with timestamps), adding diarization would be a matter of running both in parallel and then aligning the output according to both sets of timestamps. I recently implemented a now outdated SpeakerAwareTranscription pipeline (see #147), but it's a bit hacky and should be improved before integrating that into the library.

Concerning the blog post, the same principles apply to the latest version of diart, only some names need to be updated. Otherwise you can simply install a previous version like v0.6 or v0.7.

1 reply

ColtonBehannon Nov 14, 2023
Author

Thanks for the response, after leaving the comment I was actually able to get a basic implementation going by just changing some of the names from the blog post. I will check out #147 as well. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to stream diarization and transcription (Whisper) results from an audio file? #211

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

How to stream diarization and transcription (Whisper) results from an audio file? #211

ColtonBehannon Nov 13, 2023

Replies: 1 comment · 1 reply

juanmc2005 Nov 14, 2023 Maintainer

ColtonBehannon Nov 14, 2023 Author

ColtonBehannon
Nov 13, 2023

Replies: 1 comment 1 reply

juanmc2005
Nov 14, 2023
Maintainer

ColtonBehannon Nov 14, 2023
Author