Reproducibility Assisstance #24

kanlions · 2025-01-24T12:07:19Z

Thank you for great research work and making it available for research community. I am particularly working on the zero shot classification. I have worked with the demo code in zero shot starter. The datasets are CRC100k (colorectal cancer tissue classification), WSSS4LUAD (LUAD tissue classification) and SICAP (Gleason pattern classification). If I am correct I see the numbers in .75 ish range and I am getting way off. Can you please confirm is it the same prompts which is available in prompts folder. Also it would be very kind if you can also tell me whether I am using correct set CRC-VAL-HE-7K, (with 8 classes), WSS4luad (training 1.14gb, val 150mb from Grand challenges) and SICAPv2 (18793 images). If any discrepancy please guide me to the correct prompts files for these 3 datasets and also data sources. It is kind of important for my understanding and future use of this model that the correct baseline is established and these 3 datasets work uniformly for the associated prompt.

Thanks in advance

fedshyvana · 2025-01-24T14:20:16Z

Did you use starter code or the ensemble example? We ensemble multiple prompts / templates similar to CLIP (except we also ensemble multiple classnames per class). Example of reproducing CRC-100K results is provided here: https://github.com/mahmoodlab/CONCH/blob/main/notebooks/zeroshot_classification_example_ensemble.ipynb.

In the paper, we report both non-ensembled (i.e. single prompt) and ensembled results.

kanlions · 2025-01-25T22:41:09Z

Thank you very much for your reply. If I understand there are two results one is in main paper Figure 2 (c) which I am talking about. And in extended supplementary Figure 2 extended where for the three datasets are shown. Since I am struggling with reproducibility, can you please provide the following information about what subsets of images for the three datasets was used for the results and also prompts as for SICAP only primary and secondary gleason score is available. It will be very helpful if the relevant prompts of the paper are uploaded in Github as it is a VLM model so text cues may have a significant impact. My first goal is to run reproduce your method to establish baseline. As in future when I run on new set of images and cite your paper I want to be sure that correct results are generated and communicated. Again thanks for your help.

fedshyvana · 2025-01-26T06:22:59Z

Prompts are listed in supplementary tables 38 - 44. The splits are described in Methods section under Downstream evaluation datasets (CRC-100k = validation set, WSS4LUAD = subset of training set, SICAP = test set).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducibility Assisstance #24

Reproducibility Assisstance #24

kanlions commented Jan 24, 2025 •

edited

Loading

fedshyvana commented Jan 24, 2025

kanlions commented Jan 25, 2025

fedshyvana commented Jan 26, 2025

Reproducibility Assisstance #24

Reproducibility Assisstance #24

Comments

kanlions commented Jan 24, 2025 • edited Loading

fedshyvana commented Jan 24, 2025

kanlions commented Jan 25, 2025

fedshyvana commented Jan 26, 2025

kanlions commented Jan 24, 2025 •

edited

Loading