ForwardTacotron NVDA
1.0.0
Note: This add-on as well as the documentation is still under construction. Your contributions are welcome!
Remember that ForwardTacotron is a speech synthesis model in pytorch that uses a duration predictor to align text and generated mel spectrograms. The model has advantages, such as robustness, speed, pitch and energy manipulation, and efficiency.
So, this plugin is an attempt to implement support for ForwardTacotron in NVDA's open source screen reader via client/server, because the libraries used as torch are not possible to include in NVDA directly.
This is a work in progress and therefore there is still a lot to do.
In the meantime, you can listen to the progress that has been made so far.
| Language | Voice | Sample |
|---|---|---|
| English | LJSpeech (with griffinLim vocoder) | |
| English | LJSpeech (with HiFi-GAN vocoder) | |
| Spanish | Ald Dataset (with HiFi-GAN vocoder) | |
| Spanish | Odal (with HiFi-GAN vocoder, universal model) |