Notes on demultiplexing 515Y reads with cutadapt (last updated 2021-09-17)

Preamble

This repository documents how I do second-round demultiplexing (i.e. for the inline barcodes contained in the 515Y reads) for the Fuhrman lab standard tag protocol after using something like bcl2fastq or illumina-utils to split out the reverse indices.

Steps

First, clone and enter the repo:

git clone https://github.com/jcmcnch/demux-notes.git
cd demux-notes

Now, create a cutadapt-env based on the env subfolder in demux-notes. For example:

conda env create --name cutadapt-env --file env/cutadapt-env.yaml
#could also use mamba
#mamba env create --name cutadapt-env --file env/cutadapt-env.yaml

Next, construct a tab-separated (tsv) spreadsheet (I use excel/libreoffice to edit more easily and then export to tsv) that contains in the first column your sample ID (i.e. what you want your output files to be named) and the # of the barcode in the second column. For the third column, put the name of your sample file (i.e. your partially demultiplexed fastq, but minus the .fastq suffix) wherever you switch reverse indices. An example can be found here.
Go into the subdirectory generate-fastas-for-cutadapt.

cd generate-fastas-for-cutadapt

Copy your tsv file into the input folder, overwriting the input file and erase the example output files.

cp myfile input/bioGEOTRACES-sample-input.tsv
rm output/*

Run the script (uses basic python functions such as csv and sys so should work by default on most pythons).

chmod u+x run.sh
./run.sh

Now copy the output folder to a directory below the location of your demultiplexed data, naming it 00-adapter-fastas. Then enter the directory below 00-adapter-fastas. For example let's say your partially demultiplexed data (i.e. a bunch of different fastqs, each corresponding to a different index) is in /home/inigo/mydemuxdata/00-raw. Then you would do the following:

cp -r output /home/inigo/mydemuxdata/00-adapter-fastas
cd /home/inigo/mydemuxdata/

Now you should be ready to run the script contained in the demux-notes repo, but you may have to edit the input/output folders to suit your needs. But assuming your raw data is in a folder called 00-raw and you don't mind your demultiplexed data going into a folder called demultiplexed you can just run it as follows:

cp <full path to your demux-notes dowloaded repo>/00-run-cutadapt-recursively.sh .
chmod u+x 00-run-cutadapt-recursively.sh
./00-run-cutadapt-recursively.sh

If it's giving you errors, try appending echo to the beginning of line 11. That will print out the commands cutadapt will execute and you can look for typos or missing filenames etc.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
env		env
generate-fastas-for-cutadapt		generate-fastas-for-cutadapt
00-run-cutadapt-recursively.sh		00-run-cutadapt-recursively.sh
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Notes on demultiplexing 515Y reads with cutadapt (last updated 2021-09-17)

Preamble

Steps

About

Releases

Packages

Languages

jcmcnch/demux-notes

Folders and files

Latest commit

History

Repository files navigation

Notes on demultiplexing 515Y reads with cutadapt (last updated 2021-09-17)

Preamble

Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages