Skip to content

vlasakjiri/NLP-Correcting-automatic-speech-recognition-errors-using-pre-trained-language-models

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Correcting automatic speech recognition errors using pre-trained language models

This projects trains mBART and mT5 models to correct errors made in automatic speech recognition by an OpenAI Whisper model. It was done as an assignment in the Natural Language Processing course at Brno Univerisity of Technology

Results

Model WER
vtlustos/whisper-small (finetuned ASR) 23.6
mT5-small 26.5
mT5-large 22.4
mBART-large-50 21.4
Model WER
openai/whisper-small (baseline ASR) 50.1
mT5-small 49.2
mBART-large-50 25.1

Requirements

Project uses huggingface and pytorch. You can install all of the requirements using

pip install -r requirements.txt

Get ASR predictions

To get predictions from ASR system, use the src/predict.py script

Process the predictions for language model training

To save the predictions as huggingface dataset for LM training use the src/process_predictions.py and src/process_parliament_predictions.py scripts

Train the language model to correct errors

Use src/train_corrector.py

Evaluate the language model

Use src/eval.py

MetaCentrum

All the computation was done at the MetaCentrum.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published