Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

jittery results #1

Open
jnnyii opened this issue May 13, 2024 · 2 comments
Open

jittery results #1

jnnyii opened this issue May 13, 2024 · 2 comments

Comments

@jnnyii
Copy link

jnnyii commented May 13, 2024

I plugged in a new dataset, after 1800 epochs, I see that semantically, the generation appears to follow the text conditioning, but the poses are too jittery (please see attachment). Could you maybe point out what's wrong?

text: straightening up
https://github.com/dongzhuoyao/motionfm/assets/169649811/b4fa0242-97d5-4080-827f-3578c3cd0d84

thanks!

@dongzhuoyao
Copy link
Owner

Hi, thanks for your interest to our work, could you elaborate how large your dataset is, waht your text encoder is, and how large the network is? what's your sampler and sampling steps?

@jnnyii
Copy link
Author

jnnyii commented May 14, 2024

Thank you for your response. I have a dataset that contains 600 sequences with a total of 34000 frames. My step size is 1 and if the sequence length is larger than the maximum number of frames, I randomly select a start index. I am using the default text encoder in the framework, i.e. CLIP. Do you think the have too little data? I don't observe this problem when I train using the diffusion model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants