From d63e50fda790cccec9448bd264ea594917b8f789 Mon Sep 17 00:00:00 2001 From: Evgeny Pavlov Date: Fri, 3 Nov 2023 16:30:46 -0700 Subject: [PATCH] Fix note for bicleaner Co-authored-by: Marco Castelluccio --- docs/training-guide.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/training-guide.md b/docs/training-guide.md index 09b9f04c5..8d1ddd611 100644 --- a/docs/training-guide.md +++ b/docs/training-guide.md @@ -137,7 +137,7 @@ and add filtering thresholds to the config. - `0.5` should be a good default value. - Noisier datasets like OpenSubtitles should have higher threshold. -- Set the threshold to `0` to skip cleaning entirely, for example for ParaCrawl dataset that comes already cleaned. +- Set the threshold to `0` to skip cleaning entirely, for example for ParaCrawl dataset that comes already cleaned by bicleaner. ``` bicleaner: