ForwardTacotron NVDA Download - ForwardTacotron NVDA Source code download

ForwardTacotron NVDA

AI Source Code

1.0.0

Download

ForwardTacotron and HiFi-GAN support for NVDA Screen reader

Note: This add-on as well as the documentation is still under construction. Your contributions are welcome!

introduction

Remember that ForwardTacotron is a speech synthesis model in pytorch that uses a duration predictor to align text and generated mel spectrograms. The model has advantages, such as robustness, speed, pitch and energy manipulation, and efficiency.

So, this plugin is an attempt to implement support for ForwardTacotron in NVDA's open source screen reader via client/server, because the libraries used as torch are not possible to include in NVDA directly.

This is a work in progress and therefore there is still a lot to do.

In the meantime, you can listen to the progress that has been made so far.

audio samples

Language	Voice	Sample
English	LJSpeech (with griffinLim vocoder)
English	LJSpeech (with HiFi-GAN vocoder)
Spanish	Ald Dataset (with HiFi-GAN vocoder)
Spanish	Odal (with HiFi-GAN vocoder, universal model)

to do:

A way to compile and integrate the server to the add-on.
- When this happens, allow the server to open when the synth is loaded. Once the server loads, we can call check to make the speech synthesizer ready for use.
- Two versions could be made for the add-on, with CPU support and one with GPU support, since apparently the synthesis is generated in real time on a GPU. In the meantime, we may notice slowdowns in the CPU.
Voice and energy change support in synth ring options.
At the moment the add-on uses httplib2 to communicate with the server, but I could look for other methods and if necessary rewrite a part of the server.
Add support for loading different voices that could be detected within a "voice_models" folder.
- With this, a support for downloading trained models could be added. We have a ljspeech model in English, another in German and two in Spanish.
For newer multi-speaker models, it can read the settings to check, and if so, it can choose the voice from the synth ring options with first consult the speaker names on the model.

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2025-08-23
size 2.43MB
From Github

Related Applications

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub the via/releases

2024-11-01

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All