Warning: No pronunciation for word #71

DanielSWolf · 2022-01-05T08:36:58Z

When running phonetisaurus-apply, I sometimes get the warning "No pronunciation for word: ...". These warnings are very rare (in my case, about one word in 3,000) and usually indicate some spelling error in the word.

What baffles me, however, is that the output does include these words, along with (mostly) plausible pronunciations.

So I wonder:

Are there scenarios where Phonetisaurus actually won't generate a pronunciation for a given word? (This is important to me because my pipeline requires a pronunciation for each word.)
Might it make sense to change the wording of the warning to something more nuanced?

The text was updated successfully, but these errors were encountered:

DanielSWolf · 2022-01-05T08:38:31Z

BTW, I'm using the Docker version of Phonetisaurus.

danijel3 · 2022-10-04T12:26:05Z

Just to make sure you're on the right track: phonetisaurus-apply is a python script which runs phonetisaurus-g2pfst in the background. So the functioning can be affected by both of these programs.

I recommend you try phonetisaurus-g2pfst alone as well to see what are the differences in the output.

The phonetisaurus-apply adds a bit of more functionality in that it allows to use a predefined lexicon in parallel to the FST model. This means that the program first looks in your lexicon and uses the model only if the word is not in the lexicon already. This has two benefits:

it speeds up processing as it serves as a kind of "cache" for already processed words
it allows you to define manual exceptions to how the model works

Apart from that, not sure what the problem could be. In my experience, if there is a word that cannot be processed, it won't. Maybe you have some duplicate words with minor differences? Maybe its a matter of hidden characters (unicode can be quite a minefield)? You'd have to make a minimal example to be sure. Next time you get this output, try and extract one of these words to a separate file and see how it will work alone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Warning: No pronunciation for word #71

Warning: No pronunciation for word #71

DanielSWolf commented Jan 5, 2022

DanielSWolf commented Jan 5, 2022

danijel3 commented Oct 4, 2022

Warning: No pronunciation for word #71

Warning: No pronunciation for word #71

Comments

DanielSWolf commented Jan 5, 2022

DanielSWolf commented Jan 5, 2022

danijel3 commented Oct 4, 2022