BED files
CMRG_5030.bed
- This file contains all 5,030 challenging medically relevant gene (CMRG) regionsFixItFelix.collapsed.sorted.bed
- This contains the regions that are impacted due to collapsed errors in the GRCh38 referenceFixItFelix.duplicated.sorted.bed
- This contains the regions that are impacted due to duplicated errors in the GRCh38 referenceFixItFelix_12_CMRG.sorted.bed
- This file contains the CMRG regions that are impacted due to either collapsed or duplicated errors
Counting basepairs
count_basepair.py
- This script counts the total impacted basepairs of all variants at the sample level. It can be run with Tuvari-4.1-dev (https://github.com/ACEnglish/truvari)count_bp_SNV_INDEL_chr21.json
- Sample output file of the scriptdragen_sv_merge.py
- script for merging STR, SV and VCF files at sample level (check thedragen_sv_merge_notes
file for additional information)
STR benchmarking
truvari_giabtr_testing.pdf
- This document contains the detailed description of STR benchmarking analysisTruvarizer.py
- script used in the STR analysisEHVCFConverter.py
- script used to convert DRAGEN STR vcf to truvari recognizable format