Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bad over-fitting #44

Open
SajjadAemmi opened this issue Dec 3, 2024 · 2 comments
Open

Bad over-fitting #44

SajjadAemmi opened this issue Dec 3, 2024 · 2 comments

Comments

@SajjadAemmi
Copy link

SajjadAemmi commented Dec 3, 2024

i used vsr_trlrs3_23h_base.pth as pre-trained ckeckpoint and trained VSR model on a very small persian dataset (10 video with 10 seconds duration of each video)

My goal was over-fitting on the dataset to get correct prediction on the same dataset.

But the over-fitted model only learnd 1 text, and produced exactly the same text for every input videos in the inference time.

what is the problem?

@mpc001
Copy link
Owner

mpc001 commented Jan 6, 2025

I assume that the 23-hour pre-trained checkpoint is not a good one and it cannot generalize well to other datasets. Have you tried using the checkpoint pre-trained on the entire dataset for fine-tuning?

@SajjadAemmi
Copy link
Author

SajjadAemmi commented Jan 11, 2025

I assume that the 23-hour pre-trained checkpoint is not a good one and it cannot generalize well to other datasets. Have you tried using the checkpoint pre-trained on the entire dataset for fine-tuning?

Thank you for your awesome project and your reply. I was using vsr_trlrs2lrs3vox2avsp_base.pth checkpoint.
I found the solution: I have to freeze encoder to get a little better results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants