This projects trains mBART and mT5 models to correct errors made in automatic speech recognition by an OpenAI Whisper model. It was done as an assignment in the Natural Language Processing course at Brno Univerisity of Technology
Model | WER |
---|---|
vtlustos/whisper-small (finetuned ASR) | 23.6 |
mT5-small | 26.5 |
mT5-large | 22.4 |
mBART-large-50 | 21.4 |
Model | WER |
---|---|
openai/whisper-small (baseline ASR) | 50.1 |
mT5-small | 49.2 |
mBART-large-50 | 25.1 |
Project uses huggingface and pytorch. You can install all of the requirements using
pip install -r requirements.txt
To get predictions from ASR system, use the src/predict.py script
To save the predictions as huggingface dataset for LM training use the src/process_predictions.py and src/process_parliament_predictions.py scripts
Use src/train_corrector.py
Use src/eval.py
All the computation was done at the MetaCentrum.