Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using Assembled Contig Data with Melon: Compatibility and Guidelines #3

Open
ystone1101 opened this issue Sep 19, 2024 · 4 comments
Open

Comments

@ystone1101
Copy link

Hi,

Does Melon accept assembled contig data produced from short-read sequencing data using assemblers (e.g., MEGAHIT, metaSPAdes...) ?

@xinehc
Copy link
Owner

xinehc commented Sep 19, 2024

Hi,

Melon is a read-based taxonomic profiler so it does not work with contigs.

@ystone1101
Copy link
Author

Hi, Thank you for the clarification.

@muhit-emon
Copy link

Hi,

Does Melon work with assembled contigs from long-read data using assemblers like Canu, MetaFlye?

@xinehc
Copy link
Owner

xinehc commented Oct 24, 2024

Hi,

no, the main usage of Melon is to estimate the genome copies of different species in a metagenomic sample. Since assembled contigs do not have the coverage information Melon cannot handle them.

However, it may be used for a rough quality assessment & quick taxonomic classification of MAGs, for example:

melon GCF_900232105.1.fa -d database -o .

INFO: Estimating genome copies ...
INFO: ... found 1.0 copies of genomes (bacteria: 1.0; archaea: 0).
INFO: Assigning taxonomy ...
INFO: Reassigning taxonomy ...                                                                                    
INFO: ... found 1 unique species (bacteria: 1; archaea: 0).
INFO: Done.

If all the eight marker genes are found, the estimated genome copy should be 1.0.

cat GCF_900232105.1_genomic.tsv

...species                       copy     abundance       identity
...Kuenenia stuttgartiensis_A    1.000    1.000000e+00    1.0000/1.0000

If the MAG is present in the reference database (e.g. GTDB), the estimated identity should be 1.0 or very close to it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants