mask2former_swin-s-p4-w7-224_lsj_8x2_50e_coco.pth #34

wwwjiahuan · 2025-01-08T07:22:15Z

Where can I download mask2former_swin-s-p4-w7-224_lsj_8x2_50e_coco.pth?

JeyesHan · 2025-01-08T10:08:39Z

@wwwjiahuan
Thanks for your attention to Infinity.
You could refer to this for downloading mask2former.
Hope you enjoy playing with Infinity!

wwwjiahuan · 2025-01-09T03:11:45Z

Thanks for your reply. When using this weight to detect the generated images (normal), I am confused that the output always looked like this.
Summary

Total images: 1420
Total prompts: 355
% correct images: 1.13%
% correct prompts: 1.13%

Task breakdown

color_attr = 0.00% (0 / 400)
counting = 10.00% (12 / 120)
colors = 0.00% (0 / 268)
position = 0.00% (0 / 344)
two_object = 0.00% (0 / 248)
single_object = 10.00% (4 / 40)

Overall score (avg. over tasks): 0.03333

JeyesHan · 2025-01-09T03:26:28Z

Can you check if the generated images are normal?

wwwjiahuan · 2025-01-09T04:47:44Z

They are normal.

wwwjiahuan · 2025-01-09T04:52:36Z

For example, the generated image is normal, but the detections in det.jsonl are abnormal.
{"filename":"output/infinity_2b_evaluation/gen_eval_cfg4_tau1_cfg_insertion_layer0_rewrite_prompt0_round2_real_rewrite/images/00294/samples/00002.jpg","tag":"color_attr","prompt":"a photo of a purple train and a yellow bicycle","correct":false,"reason":"expected train>=1, found 0\nexpected yellow bicycle>=1, found 0 yellow; and 1 purple","metadata":"{"tag": "color_attr", "include": [{"class": "train", "count": 1, "color": "purple"}, {"class": "bicycle", "count": 1, "color": "yellow"}], "prompt": "a photo of a purple train and a yellow bicycle"}","details":"{"cat": [[421.0, 634.0, 870.0, 964.0, 0.9570325016975403]], "bicycle": [[0.0, 218.0, 1024.0, 809.0, 0.9742059707641602]]}"}

JeyesHan · 2025-01-09T05:21:04Z

Can you run the benchmark in GenEval and get normal results? I doubt that you did not load the correct weights for mask2former.

l-dawei · 2025-01-09T06:10:42Z

@wwwjiahuan
Hello, I would like to ask if you have retrained or fine-tuned Infinity?

wwwjiahuan closed this as completed Jan 8, 2025

wwwjiahuan reopened this Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mask2former_swin-s-p4-w7-224_lsj_8x2_50e_coco.pth #34

mask2former_swin-s-p4-w7-224_lsj_8x2_50e_coco.pth #34

wwwjiahuan commented Jan 8, 2025

JeyesHan commented Jan 8, 2025

wwwjiahuan commented Jan 9, 2025

JeyesHan commented Jan 9, 2025

wwwjiahuan commented Jan 9, 2025

wwwjiahuan commented Jan 9, 2025

JeyesHan commented Jan 9, 2025

l-dawei commented Jan 9, 2025

mask2former_swin-s-p4-w7-224_lsj_8x2_50e_coco.pth #34

mask2former_swin-s-p4-w7-224_lsj_8x2_50e_coco.pth #34

Comments

wwwjiahuan commented Jan 8, 2025

JeyesHan commented Jan 8, 2025

wwwjiahuan commented Jan 9, 2025

Thanks for your reply. When using this weight to detect the generated images (normal), I am confused that the output always looked like this. Summary

Task breakdown

JeyesHan commented Jan 9, 2025

wwwjiahuan commented Jan 9, 2025

wwwjiahuan commented Jan 9, 2025

JeyesHan commented Jan 9, 2025

l-dawei commented Jan 9, 2025

Thanks for your reply. When using this weight to detect the generated images (normal), I am confused that the output always looked like this.
Summary