pytorch dc tts
1.0.0
PyTorch implementation of Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention based partially on the following projects:
The following notebooks are executable on https://colab.research.google.com :
For audio samples and pretrained models, visit the above notebook links.
The English TTS uses the LJ-Speech dataset.
python dl_and_preprop_dataset.py --dataset=ljspeechpython train-text2mel.py --dataset=ljspeechpython train-ssrn.py --dataset=ljspeechpython synthesize.py --dataset=ljspeech
samples folder.The Mongolian text-to-speech uses 5 hours audio from the Mongolian Bible.
python dl_and_preprop_dataset.py --dataset=mbspeechpython train-text2mel.py --dataset=mbspeechpython train-ssrn.py --dataset=mbspeechpython synthesize.py --dataset=mbspeech
samples folder.