A set of Python commands for preparing inputs to and processing outputs from the Toil RNA-Seq pipeline
Uses Python's subprocess
module and command line utilities to execute subcommands efficiently
Includes additional commands for dual alignment to human and mouse genome and subsequent mouse read filtering. Pipeline configurations including reference files and directory stuctures designed for shared workstations in the Graeber lab @ UCLA.
Image courtesy of our AI overlords
You can install toilkit using pip:
pip install -e git+https://github.com/nbay13/toilkit.git#egg=toilkit
toilkit provides a set of subcommands that can be used from the command line. Here's an overview of the available subcommands:
fastq-merge
: Merge fastq files based on sample namefastq-rename
: Rename fastqs based on key
bbseal
: Run bbduk (adapter trimming) and bbseal (dual reference alignment)bbcat
: Concatenate human and ambiguously-aligned reads into fastqsbbmetrics
: Gather bbseal alignment metrics
make-manifest
: Create a toil-rnaseq manifest filecut-manifest
: Split a manifest file into smaller partsmanifest-key
: Convert a manifest file to a sample key .tsv file
toil-fix
: (if bamQC enabled) Rename outputs with '_FAIL' filename suffixtoil-combine
: Extract info from UUID_XX.tar.gz results -- such as RSEM, QC and/or STAR junctions datatoil-rename
: Final rename of all toil-rnaseq output files
To use a specific subcommand, run toilkit <subcommand>
. For example:
toilkit fastq-merge --indir <input_directory> --outdir <output_directory>
For detailed usage instructions check the tutorial