You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Due to the recent bugs in inference, it would be nice to have a unit test that runs translation on a test set end-to-end using a browser and checks translation quality.
For example, we could,
Take our regular flore-test test set for one or several language pairs
Create a web page where each line in the dataset would be a separate paragraph
Run translation in the browser
Take the output page and split it back
Calculate the COMET or BLEU score
Compare the score to the evaluation results. It should be similar with a small error.
The text was updated successfully, but these errors were encountered:
Due to the recent bugs in inference, it would be nice to have a unit test that runs translation on a test set end-to-end using a browser and checks translation quality.
For example, we could,
The text was updated successfully, but these errors were encountered: