Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

331 iterate over corpus directory and check for corresponding results when benchmarking #332

Conversation

yaseminbridges
Copy link
Contributor

Currently, we iterate over the PhEval TSV processed output --> find the corresponding phenopacket in the test data directory --> benchmark this way.

This change will instead, iterate over the phenopackets in the test data directory --> find the corresponding PhEval TSV output (if none found this is handled) --> benchmark this way

This will allow us to account for missing results, i.e., if a tool fails on certain results no output is written then this is accounted in the new way of benchmarking

@yaseminbridges yaseminbridges marked this pull request as ready for review May 14, 2024 13:59
@yaseminbridges yaseminbridges self-assigned this May 14, 2024
@yaseminbridges
Copy link
Contributor Author

@julesjacobsen can you double-check that what I have done here makes sense? This is what I ended up doing for the AI-MARRVEL benchmarks as there were missing outputs it had different total counts and the comparisons of the ranks were a bit messed up

Copy link
Contributor

@julesjacobsen julesjacobsen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like a better way around to do things in order to be able to track missed samples.

@julesjacobsen julesjacobsen merged commit 2845ee9 into main May 14, 2024
5 checks passed
@yaseminbridges yaseminbridges deleted the 331-iterate-over-corpus-directory-and-check-for-corresponding-results-when-benchmarking branch June 2, 2024 12:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Iterate over corpus directory and check for corresponding results when benchmarking
2 participants