331 iterate over corpus directory and check for corresponding results when benchmarking #332

yaseminbridges · 2024-05-14T13:55:49Z

Currently, we iterate over the PhEval TSV processed output --> find the corresponding phenopacket in the test data directory --> benchmark this way.

This change will instead, iterate over the phenopackets in the test data directory --> find the corresponding PhEval TSV output (if none found this is handled) --> benchmark this way

This will allow us to account for missing results, i.e., if a tool fails on certain results no output is written then this is accounted in the new way of benchmarking

…lace of the processed output

yaseminbridges · 2024-05-14T14:02:10Z

@julesjacobsen can you double-check that what I have done here makes sense? This is what I ended up doing for the AI-MARRVEL benchmarks as there were missing outputs it had different total counts and the comparisons of the ranks were a bit messed up

julesjacobsen

Seems like a better way around to do things in order to be able to track missed samples.

yaseminbridges added 3 commits May 14, 2024 14:50

remove obtain_phenopacket_path_from_pheval_result() method

ee27ee8

Add handling for missing PhEval processed output

4608c65

iterate through phenopacket test data directory for benchmarking in p…

6b7ca28

…lace of the processed output

yaseminbridges linked an issue May 14, 2024 that may be closed by this pull request

Iterate over corpus directory and check for corresponding results when benchmarking #331

Closed

yaseminbridges marked this pull request as ready for review May 14, 2024 13:59

yaseminbridges self-assigned this May 14, 2024

yaseminbridges requested review from julesjacobsen and souzadevinicius May 14, 2024 14:00

julesjacobsen approved these changes May 14, 2024

View reviewed changes

julesjacobsen merged commit 2845ee9 into main May 14, 2024
5 checks passed

yaseminbridges deleted the 331-iterate-over-corpus-directory-and-check-for-corresponding-results-when-benchmarking branch June 2, 2024 12:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

331 iterate over corpus directory and check for corresponding results when benchmarking #332

331 iterate over corpus directory and check for corresponding results when benchmarking #332

yaseminbridges commented May 14, 2024

yaseminbridges commented May 14, 2024

julesjacobsen left a comment

331 iterate over corpus directory and check for corresponding results when benchmarking #332

331 iterate over corpus directory and check for corresponding results when benchmarking #332

Conversation

yaseminbridges commented May 14, 2024

yaseminbridges commented May 14, 2024

julesjacobsen left a comment

Choose a reason for hiding this comment