-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fine-tune the size of the dataset and the value of h_div_w #33
Comments
It seems that there exist bugs in your fine-tunning. Using 10 images to fine-tune Infinity will soon overfit and generate the same images to the training set. Could you please post your training logs and testing logs here? |
@l-dawei Could you please post your inference code here? |
@xuanyuzhang21 Could you please post your training logs (b1_stdout.txt)and inference code here? |
This is my training log and inference code. Thanks~ |
@l-dawei Your training '--pn' is 0.06M but your inference '--pn' is 1M |
But after I changed |
@xuanyuzhang21 @l-dawei |
@l-dawei iter100 may be too less for fine-tuing a 1024 model to 256. What about more iterations? What is your current training acc now? |
@JeyesHan |
When I use the following inference code, it will report the above imcompatible errors.
The size of the finetuning checkpoint “local_output/debug/ar-ckpt-giter004K-ep3-iter756-last.pth” seems not to be compatible with the transformer of Infinity. Is any operation required to convert this weight during testing? |
@xuanyuzhang21 You need to convert [ar-ckpt-giter004K-ep3-iter756-last.pth] to [slim-ckpt-giter004K-ep3-iter756-last.pth] first. Please refer to https://github.com/FoundationVision/Infinity/blob/main/tools/run_infinity.py#L137. Besides, enable_model_cache=True will do this automatically. |
Thanks for your help! It works~ |
@xuanyuzhang21 |
@l-dawei Yes. My training result is normal now. |
Dear authors,
Thanks for publishing the codebase and checkpoints, and for the great work! I am interested in the training data used to train Infinity. I used the 10 images you provided for fine-tuning, but got a bad image result. I would like to know the size of the dataset required for fine-tuning. And how did you get the possible values of h_div_w?
Best wishes,
Antonsen
The text was updated successfully, but these errors were encountered: