Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

if locus name is too long, the extract step doesn't work #19

Open
Zachary-Muscavitch opened this issue Dec 25, 2024 · 2 comments
Open

Comments

@Zachary-Muscavitch
Copy link

I submitted two fasta target files, both are identical except for the name of the sequence. I had longer "gene" names to be easily readable and it wouldn't work. I shortened the length of the gene names, and it seems to now work. I assume this is because no one expected someone to use such a long name. Perhaps consider adding a maximum length to the documentation?

Taxon-Contig_15_Locus_3_gene_B
Taxon-C15_L3_gene_B

best,
Zach

@edgardomortiz
Copy link
Owner

Hi @Zachary-Muscavitch ,

I can't replicate the issue, and I haven't met this limitation before. Perhaps you can send me the actual sample names you are trying, and maybe the target file too?

Edgardo

image

@Zachary-Muscavitch
Copy link
Author

Sorry. The issue was the in the construction of the target file. It was late and I was changing many things, and thought it was the locus name.

Seems like the issue was an error when I extracted the CDS to make the target file. Some nucleotides were not extracted, in their place were ? characters. I ended up translating the nucleotides into aminoacid sequences. When in amino acid format, the ? in the target file did not throw any errors.

Sorry for the trouble.

Additionally, I have a question about why Ns are introduced into the sequences at the extraction step? Is there a flag I can add to prevent this? It seems to not be associated with the assembly process, as it happens when I use genomes with the FASTA option.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants