You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
NCBI FTP site: ftp.ncbi.nlm.nih.gov (username: anonymous, pw: your email)
location /gene/DATA has a file gene2refseq that lists all the known genes, their corresponding locations and the accessions for the genomes that they belong to.
Select a bunch of accessions for which we have gene information (from the gene2refseq)
Download their corresponding whole genomes
Create a mapping file that maps regions to genes (slight modification of gene2refseq)
Download the genes themselves for training of sourmash and Diamond
Create a file that takes a bbmap simulation and spits out the genes covered in the simulation, as well as the %bp of the gene covered and the mean/median/summary of the total amount of bp mapped to each gene
The text was updated successfully, but these errors were encountered:
The following will help with that task:
NCBI FTP site: ftp.ncbi.nlm.nih.gov (username: anonymous, pw: your email)
location
/gene/DATA
has a file gene2refseq that lists all the known genes, their corresponding locations and the accessions for the genomes that they belong to.Pull out specific regions of a genome:
Idea:
The text was updated successfully, but these errors were encountered: