Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Want to train VSR model for digit recognition using grid dataset. #28

Open
Chhaya-Trootech opened this issue Mar 13, 2024 · 1 comment

Comments

@Chhaya-Trootech
Copy link

Thank you for sharing the code.

I am interested in training VSR model (lip_reading) for digit recognition using grid dataset. because the pre-train weights are not perfectly working for digits recognition. this repo gives idea for training lrs2 nd lrs3. how can i use this to train model for grid dataset?

@mpc001
Copy link
Owner

mpc001 commented Jan 6, 2025

Hi @Chhaya-Trootech, I recommend building your own vocabulary list (either subword-based or charater-based) using GRID. An example instruction can be found at https://github.com/mpc001/auto_avsr/blob/main/INSTRUCTION.md. Then, fine-tune using the customized vocabulary list on GRID with the provided pre-trained checkpoint. It is also important to replace with your customized dictionary at the inference stage. https://github.com/mpc001/auto_avsr/blob/main/datamodule/transforms.py#L20-L32

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants