German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support
Google supported this work by providing Google Cloud credit. Thank you Google for supporting the open source! ?
I am releasing pretrained German neural text-to-speech (TTS) models Tacotron 2 and Multi-band MelGAN. It supports inference with saved_model and TF Lite formats, and all the models can be found on TensorFlow Hub.
Say hello in Discussions if you find it useful for anything.
inference.py to infer with saved_model.inference_tflite.py to infer with TF Lite.e2e-notebook.ipynb to check how I exported to these model formats.I trained these models on Thorsten dataset by Thorsten Müller. It is licensed under the terms of Creative Commons Zero V1 Universal (CC0), which is used to opt out of copyright entirely and ensure that the work has the widest reach. Thanks @thorstenMueller for such a great contribution to the community.
Some good guys are doing a great job at tensorspeech/TensorFlowTTS, which was already supporting TTS in English, Chinese and Korean. I wanted to contribute with support for German and trained these models. Now it supports both training and inference with proper processors. A detailed blog post will follow up, but some quick notes for now:
Tacotron 2 produces some noise at the end, and you need to cut it off. Again, inference examples show how to do this.Multi-band MelGAN to TF Lite without optimizations because it produced some background noise when I exported with the default ones. I used default optimizations in Tacotron 2.saved_model formats that I am releasing here are not suitable for finetuning. Architecture implementation uses Subclassing API in TensorFlow 2.x and gets multiple inputs in call method for teacher forcing during training. This caused some problems when exporting to saved_model and I had to remove this logic before exporting. If you want to finetune models, please see my fork of TensorFlowTTS.You can use these pretrained model artifacts and code examples under the terms of Apache 2.0 license. On the other hand, you may want to contact me for paid consultancies and/or collaborations in speech and/or NLP projects at the email address shown on my profile.