This project is implemented in Korean Multi-Speaker TTS by combining HiFi-Gan Vocoder with FastSpeech2.
This project aims to develop the TTS of the 'visible personalized AI speaker' project . It is replaced by the voices of the people around you want, rather than the voices of 'Siri', 'Bixby' and 'Ari'. (EX. Spouse, son, daughter, parents, etc.)
In order to cope with the immediate production of AI speakers, instead of excellent performance of Tacotron2 and WaveGlow, Non-AUTOREGRESSITIVE MOSTSPEECH2 and GAN-based Vocoder Model HIFI-GAN adopted both quality and production speed Consider it.
Based on the FastSpeech2 source code that corresponds to the Korean dataset KSS implemented in DLLAB.
The added content in the code used is as follows.
Speaker Embedding Implementation (Korean Multi-Speaker FastSpeech2)
Data_Preprocessing.py-End-to-end data pretreatment implementation containing all of the items below

Response to unstable synthesis of long sentences

Importing G2PK source code and applying only numbers and English
Save the WAV directory and json or transcript file in the DataSet/Data Name Directory as shown.
Learn Montral Forced Alinger in KALDI to learn textgrid by learning audio data.
# lab 생성, mfa 학습, lab 분리
python data_preprocessing.py
Save the generator learned by HiFi-Gan for evaluation during learning in the Vocoder/Pretained_models directory.
Set up the path of hparam.py's batch size, hiFi-gan generator and start learning.
python train.py
If you are studying, you can learn by adding RESTORE_STEP.
python train.py --restore_step [step]
If you perform pre-train for multi-speaker, storage speaker_info.json automatically generated during pre-train learning
Put Speaker_info.json at the top of the directory
Run Python in the same way as performing a study in Train
python train.py --restore_step [pre-train의 step]
python synthesize.py --step [step수]
This pipeline is a Flow pipeline for the TTS learning and creation that corresponds to the service.

The containers are largely classified as four.
In the actual service situation, only three containers will work.