Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

✨ Install ffmpeg to enable speech-to-text functionality #6131

Closed
jtattersall09403 opened this issue Nov 18, 2024 · 1 comment
Closed

✨ Install ffmpeg to enable speech-to-text functionality #6131

jtattersall09403 opened this issue Nov 18, 2024 · 1 comment

Comments

@jtattersall09403
Copy link

jtattersall09403 commented Nov 18, 2024

Describe the feature request.

I would like to be able to use the openai whisper model in python for transcribing audio to text, in VS code on the AP (https://huggingface.co/openai/whisper-large-v3-turbo).
This requires ffmpeg (https://www.ffmpeg.org/), which requires root access for installation on linux.

Describe the context.

My team are about to start work on a project to transcribe and translate audio. The current state of the art in this space is the OpenAI whisper model. Unlike their other models (the GPT family), Whisper can be downloaded and run locally. It will be critical for our project to be able to test whisper for our use case. In order to be able to do this, we need to be able to run it on the AP. And to do that, we need ffmpeg :)

Value / Purpose

Enable use of OpenAI whisper as a critical bottleneck for a prisons data science project.

User Types

Data Scientists in Prisons Data Science.

@jacobwoffenden
Copy link
Member

Hi @jtattersall09403,

This shipped in release 2.7.0!

Cheers

@github-project-automation github-project-automation bot moved this from 👀 TODO to 🎉 Done in Analytical Platform Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

No branches or pull requests

2 participants