From e245168e59706c743005bfd09633d322caa56483 Mon Sep 17 00:00:00 2001 From: Shaopeng Liu Date: Wed, 25 Oct 2023 10:51:24 -0400 Subject: [PATCH] fix typo --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 36eb37b..b4aa1ef 100644 --- a/README.md +++ b/README.md @@ -163,7 +163,7 @@ In the two preceding steps, you will obtain a k-mer sketch file in zip format (i In our benchmark with `GTDB representive genomes`, it takes `15 minutes` using `16 threads, 50GB of MEM` on a system equipped with a `3.5GHz AMD EPYC 7763 64-Core Processor`. The processing time can be significant when executed on GTDB all genomes OR with limited resources. If only part of genomes are needed, one may use `sourmash sig` command to extract signatures of interests only. -####
+
The script `make_training_data_from_sketches.py` extracts the sketches from the Zipfile-format reference database, and then turns them into a form usable by YACHT. In particular, it removes one of any two organisms that have ANI greater than the user-specified threshold as these two organisms are too close to be "distinguishable".