You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running phonetisaurus-apply, I sometimes get the warning "No pronunciation for word: ...". These warnings are very rare (in my case, about one word in 3,000) and usually indicate some spelling error in the word.
What baffles me, however, is that the output does include these words, along with (mostly) plausible pronunciations.
So I wonder:
Are there scenarios where Phonetisaurus actually won't generate a pronunciation for a given word? (This is important to me because my pipeline requires a pronunciation for each word.)
Might it make sense to change the wording of the warning to something more nuanced?
The text was updated successfully, but these errors were encountered:
Just to make sure you're on the right track: phonetisaurus-apply is a python script which runs phonetisaurus-g2pfst in the background. So the functioning can be affected by both of these programs.
I recommend you try phonetisaurus-g2pfst alone as well to see what are the differences in the output.
The phonetisaurus-apply adds a bit of more functionality in that it allows to use a predefined lexicon in parallel to the FST model. This means that the program first looks in your lexicon and uses the model only if the word is not in the lexicon already. This has two benefits:
it speeds up processing as it serves as a kind of "cache" for already processed words
it allows you to define manual exceptions to how the model works
Apart from that, not sure what the problem could be. In my experience, if there is a word that cannot be processed, it won't. Maybe you have some duplicate words with minor differences? Maybe its a matter of hidden characters (unicode can be quite a minefield)? You'd have to make a minimal example to be sure. Next time you get this output, try and extract one of these words to a separate file and see how it will work alone.
When running
phonetisaurus-apply
, I sometimes get the warning "No pronunciation for word: ...". These warnings are very rare (in my case, about one word in 3,000) and usually indicate some spelling error in the word.What baffles me, however, is that the output does include these words, along with (mostly) plausible pronunciations.
So I wonder:
The text was updated successfully, but these errors were encountered: