Microbial genomics pipeline to build and analyse genome graphs First version of best practice pipeline for use of graph genomes in microbial genomics, which include
- Use whole genomes as input for constructing pangenome graph, in which the variation graph data model describes the all versus all alignment of many sequences
- Manipulate the graph, such as annotation, visualization and extracting subregions
- Map short reads to graph, get Gam files
- Call variants using the Gam files either for known ones as genotying or for novel variants. The variants called from graph based gam files can be used for further analysis as variants called based on single linear reference.
Softwares used in this pipeline
- circlator version 1.5.5, https://github.com/sanger-pathogens/circlator, resets the start point of a circular genomes.
- Mash version 2.1, https://github.com/marbl/Mash/releases, calculates the distance for the genomes, which can be used as a guide for the similary -p for building the Pangenome graph.
- samtools version 1.9, https://github.com/samtools/samtools.
- pggb version 0.4.0, https://github.com/pangenome/pggb, for pangenome graph building based on all versus all alignment of sequences.
- pgge https://github.com/pangenome/pgge, measures the reconstruction accuracy of a pangenome graph (in the variation graph model). Its goal is to give guidance in finding the best pangenome graph construction tool for a given input data and task.
- odgi version 0.6.2, https://github.com/pangenome/odgi, provides a set of tools ranging from graph building, manipulation, layouting, over graph statistics to graph visualization and gene annotation lift overs https://github.com/pangenome/odgi
- vg from version 1.41.0, https://github.com/vgteam/vg, provides computational methods for creating and manipulating of genome variation graphs.
- gfaestus, https://github.com/chfi/gfaestus, to view the graphs in 2D
- vcftools, https://github.com/vcftools/vcftools
- bcftools, https://github.com/samtools/bcftools
- minigraph-cactus pipeline for comparision:https://github.com/ComparativeGenomicsToolkit/cactus, https://github.com/ComparativeGenomicsToolkit/cactus/blob/master/doc/pangenome.md
- iqtree, https://github.com/iqtree/iqtree2, for phylogeny reconstruction