Pre-processing Russian text for the AwesomeTTS add-on in Anki

March 21, 2021

Anki

The Anki add-on AwesomeTTS has been a vital tool for language learners using the Anki application on the desktop. It allows you to have elements of the card read aloud using text-to-speech capabilities. The new developer of the add-on has added a number of voice options, including the Microsoft Azure voices. The neural voices for Russian are quite good. But they have one major issue, syllabic stress marks that are sometimes seen in text intended for language learners cause the Microsoft Azure voices to grossly mispronounce the word.

For example, the sentence Цензу́ра потака́ет извращённому уму́ больше, чем сло́во из трёх букв само́ по себе. is incorrectly pronounced because it has stress marks and because the letter ë is correctly displayed. Apparently Microsoft Azure doesn’t like it when ë is correctly rendered.

Fortunately, there’s a text pre-processor built into the plugin. Here is how to use this pre-processor to change the text that’s fed to the TTS provider:

In the add-on configuration view, navigate to Text → Advanced and add two rules:

The first rule basically just strips the Unicode accent grave character (U+0301) from any vowels.
The second rule transliterates ë to e . Yes, it offends me to do this because they are two different letters in the Russian alphabet, but it appears that the models were trained on text written in the way that’s often encountered, unfortunately.

With those two rules in place, the pronounciations going to Microsoft Azure should be correct.