-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Piscem performance with bulk RNA datasets #20
Comments
Hi @JosephLalli, Thanks for reaching out. Indeed, that paper only discusses We do not have published benchmarking of that pipeline right now. However, we do have a manuscript in the works that describes piscem in more detail and which will include a description and some benchmarking of Best, |
I have a dataset of 235 ancestrally diverse samples with paired bulk RNA and DNA short-read sequencing. I previously encountered issues with highly variable pseudogene expression leading to clearly erroneous eQTL hits. To address this problem, and to increase QTL detection power, I am benchmarking the effect of different methods of alignment on eQTL results. I have pangenie SV calls in GRCh38 and T2T coordinates for all samples, as well as variant calls using the following combinations of tools:
I intend on phasing the best performing GRCh38 and T2T variant call set, then use a nextflow pipeline I have written to obtain RNA-seq counts by mapping to GRCh38, T2T, or a personalized transcriptome in both reference coordinates generated from the phased variant calls. This requires two separate Salmon index files for each individual, which quickly takes up many terabytes of HDD space. The smaller index size of piscem is thus very interesting! I don't know that I have the time to do proper benchmarking between piscem and salmon (hence my hoping that you had already done it :) but if you have any interest in this dataset I'd be happy to collaborate. My email address is lalli@wisc.edu. |
Hi there,
The alvinfry-piscem paper understandably focuses on single cell sequencing performance, and finds comparable results between piscem-alvinfry and salmon-alvenfry. Based on this result, it's reasonable to assume that piscem and salmon would also produce comparable results when aligning and quantifying bulk-rna reads.
However, has the performance of the two tools been tested in a bulk RNA context? To be frank, I skimmed your alvinfry-piscem paper, so I apologize if this comparison was done in the supplemental figures.
Best,
Joe
The text was updated successfully, but these errors were encountered: