Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Barcoding and training #141

Open
tnn111 opened this issue Apr 2, 2021 · 2 comments
Open

Barcoding and training #141

tnn111 opened this issue Apr 2, 2021 · 2 comments

Comments

@tnn111
Copy link

tnn111 commented Apr 2, 2021

I would like to build some custom models using bonito. As input, I would like to use reads from samples that were barcoded. Is there a standard way of doing this?

Based on information that I could easily find, a way of doing this appears to do demultiplexing using guppy, collect the read IDs corresponding to each barcode and extracting the fast5 reads using the IDs and feeding those into bonito.

Is that how everyone does it or is there a better way? Is there anything written on this?

Thanks, Torben

@aistBMRG
Copy link

I guess you could use porechop or qcat for demultiplexing after basecalling. Also, guppy_barcoder is possible but needs the headers of the fastq/fasta from bonito to be updated with runid= information, as far as I understand as dummy name will work.

Any other thoughts from others, I am exploring the same issue. I tried the above approaches but all yield quite different number of reads after demultiplexing.

Regards,

Dieter

@CWYuan08
Copy link

Hi @aistBMRG, I am wondering how to update the headers of the bonito fastq with runID=information, could you please share your method? Thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants