tuberculosis-resistance-classification

Python based project for pipelining on classifying Mycobacterium Tuberculosis first-line drugs resistance from DNA genome sequence powered by ML model

Tools

Tools/library used in the pipeline :

Tabula : Extracting DST data from pdf files into csv
enaWebTools (FTP) : https://github.com/enasequence/enaBrowserTools
FASP Aspera client : https://download.asperasoft.com/download/sw/connect/3.9.9/ibm-aspera-connect-3.9.9.177872-linux-g2.12-64.tar.gz
ARIBA : https://github.com/sanger-pathogens/ariba
scikit-learn : https://github.com/scikit-learn/scikit-learn
From scratch RF and DT code : https://github.com/zhaoxingfeng/RandomForest
Other library needed in python3 is Numpy,Pandas,Bowtie,CD-HIT,sklearn,matplotlib, etc.

Flowchart

What to do next

Check threading process (it seems still have error)
Implementing into other multilabel cases
This MLRF could be integrated with other modified RF/DT algorithm, or pipelined into other Classifier process

Progress Report and Step by Step

Gdocs : https://docs.google.com/document/d/1HKc87iLV8qUzujZ_jzEfRqFR9IoTSv-x7UFBGVyLq54/edit?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.idea		.idea
ariba_out		ariba_out
bash_script		bash_script
code		code
data_acquisition		data_acquisition
img		img
.gitattributes		.gitattributes
README.md		README.md
VcXsrv.txt		VcXsrv.txt
clean_ext4_vhd_instruction.txt		clean_ext4_vhd_instruction.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tuberculosis-resistance-classification

Tools

Flowchart

What to do next

Progress Report and Step by Step

About

Releases

Packages

Languages

Fortissible/tuberculosis-resistance-classification

Folders and files

Latest commit

History

Repository files navigation

tuberculosis-resistance-classification

Tools

Flowchart

What to do next

Progress Report and Step by Step

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages