Skip to content

Repository of utility scripts that I use frequently while working with genomic data

Notifications You must be signed in to change notification settings

SandersonHaley/Useful_Genomics_Scripts

Repository files navigation

Useful_Genomics_Scripts

Repository of utility scripts that I use frequently while working with genomic data

removeshortcontigs.py: removing contigs under 200 nucleotides for NCBI submission, written in python using biopython

chromosome_new_name_move.sh: rename contigs.fasta from output of SPAdes to include isolate name from a list in a text file and move to a new folder

quast.sh: running quast through a list of assemblies from a text file

NG_rename_copy_1.sh: rename fastq files from sequencing folder and all into new folder for assembly

extractgenesfromgenbackProtein.py: extract the DNA sequence of a gene annotated by prokka from a genBank file and translate into a protein sequence extractgenesfromgenbackDNA.py: extract the DNA sequence of a gene annotated by prokka from a genBank file

diamond.sh : running a blastx over all refseq sequences and all the plasmids from a set of genomes predicted by mobsuite using diamond to predict host range on Plato

diamond_setup.sh: setting up diamond.sh to run each plasmid fasta parralel and not one at a time on Plato

Neptune.sh: running Neptune to find biomarkers in three groups of genomes on Plato

fasttreegamma.sh: run fasttree on a core genome alignmewnt from panaroo with the ideal parameters for bacterial tree building

About

Repository of utility scripts that I use frequently while working with genomic data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published