-
Notifications
You must be signed in to change notification settings - Fork 4
Full workflow for collaborators
Clara Qin edited this page Jul 1, 2020
·
26 revisions
As of June 29, 2020, the full ITS workflow should (pending any bugs) consists of the following scripts in the ./code/workflow
directory, run in order:
00_new_server_setup.R
01a_dada2_workflow_its.R
02a_dada2_to_phyloseq_its.R
Parameters should be configured to suit your file system structure by making direct edits to params.R
.
Ensure that you have installed all dependencies including R packages and additional software. Of particular note:
- You will need to install
cutadapt
to use01a_dada2_worfklow_its.R
. Installation instructions can be found here. (Note that if you have both Python2 and Python3 installed, you will need to replacepip
withpip3
in the Quick Installation command.) Once installed, change theCUTADAPT_PATH
variable inparams.R
to match the location of thecutadapt
script on your system. - Lastly, you will need to install the latest General FASTA Release of the UNITE database for taxonomy assignment of ITS sequences, available here. Once installed, change the
UNITE_REF_PATH
variable inparams.R
to match the location of the FASTA file on your system.
In addition, you should download the latest versions of R and RStudio before starting these scripts. You may also need to update Python to succesfully install cutadapt
.
Be sure to run these with your working directory set to the project directory, not to the /code
subdirectory.