grouped_gtc_to_bed fails when one of the GTC files are corrupted #293

shukwong · 2024-05-28T20:05:23Z

It seems that when the pipeline is run with only GTC files as input, and that one of the GTC files is corrupted, the grouped_gtc_to_bed task that the cluster_group the GTC file is in will fail without much information given.
It would be good that:

Option 1: report which GTC file is corrupted and fail gracefully
Option 2: report which GTC file is corrupted and continue.

rajwanir · 2025-01-16T15:09:16Z

I think I slightly misunderstood the issue. It's refering to "corrupted" such that the file exists but is not usable and with GTC entry point.

While writing IDAT->GTC->BCF->Plink/BED workflow, I encountered some samples would fail the conversion IDAT>GTC due to IDAT being corrupted i.e. file exists but not usable. I took the option 2 route in that case as described in issue #370 and fixed with 77b172c in PR #359 (yet to be merged into default). With this fix, any samples that have a corrupt IDAT (exists but fails GTC creation), it will flag them as is_missing_gtc=True in samplesheet, skip them from further analysis and continue.

This would be only applicable if user starts with IDATs. If GTC is entry point is used, similar check and skip function need to be implemented. Currently, not addressed. Sorry.

shukwong · 2025-01-16T15:32:04Z

No worries. Thanks for looking into this, Rahim!

shukwong added the enhancement New feature or request label Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

grouped_gtc_to_bed fails when one of the GTC files are corrupted #293

grouped_gtc_to_bed fails when one of the GTC files are corrupted #293

shukwong commented May 28, 2024

rajwanir commented Jan 16, 2025

shukwong commented Jan 16, 2025

grouped_gtc_to_bed fails when one of the GTC files are corrupted #293

grouped_gtc_to_bed fails when one of the GTC files are corrupted #293

Comments

shukwong commented May 28, 2024

rajwanir commented Jan 16, 2025

shukwong commented Jan 16, 2025