You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
i used vsr_trlrs3_23h_base.pth as pre-trained ckeckpoint and trained VSR model on a very small persian dataset (10 video with 10 seconds duration of each video)
My goal was over-fitting on the dataset to get correct prediction on the same dataset.
But the over-fitted model only learnd 1 text, and produced exactly the same text for every input videos in the inference time.
what is the problem?
The text was updated successfully, but these errors were encountered:
I assume that the 23-hour pre-trained checkpoint is not a good one and it cannot generalize well to other datasets. Have you tried using the checkpoint pre-trained on the entire dataset for fine-tuning?
I assume that the 23-hour pre-trained checkpoint is not a good one and it cannot generalize well to other datasets. Have you tried using the checkpoint pre-trained on the entire dataset for fine-tuning?
Thank you for your awesome project and your reply. I was using vsr_trlrs2lrs3vox2avsp_base.pth checkpoint.
I found the solution: I have to freeze encoder to get a little better results.
i used
vsr_trlrs3_23h_base.pth
as pre-trained ckeckpoint and trained VSR model on a very small persian dataset (10 video with 10 seconds duration of each video)My goal was over-fitting on the dataset to get correct prediction on the same dataset.
But the over-fitted model only learnd 1 text, and produced exactly the same text for every input videos in the inference time.
what is the problem?
The text was updated successfully, but these errors were encountered: