Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

If unicode and text together are sent for translation, the model outputs junk which contains variants of 'Narendra Modi' #44

Open
Sri20021 opened this issue Aug 4, 2022 · 1 comment

Comments

@Sri20021
Copy link

Sri20021 commented Aug 4, 2022

The text list is "chk_list = ['hello', '\ue806x it tomorrow it doesn’t matter how well you have scheduled']" and the returned translated list is ['ನಮಸ್ಕಾರ.', 'narendramodi narendramodi syndramodi narendramodi syndramodi narendramodi syndramodi narendramodi syndramodi narendramodi narendramodi syndramodi syndramodi syndramodi syndramodi syndramodi syndramodi narendramodi narendramodi narendramodi narendramodi narendramodi narendramodi narendramodi'] when translated to kannada language. I am curious why translated list is junk and contains variants of Narendra Modi

@sumanthd17
Copy link
Member

We are equally surprised, we'll take a look into this and get back to you. I'll keep the issue open for now, please post any/more such examples where the model fails. It would be good to test the limits of the model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants