indonesian tts
v1.2 - 80+ TTS Speakers
モデルは[リリース]タブで使用できます。
商業目的で使用しないでください!
v1.1モデルからのFinetuned on:
ljspeechモデルからのFinetuned:
ゼロから訓練されています:
Ardi (Azure) :
Gadis (Azure) :
Wibowo (Audiobook) :
グラフェムを音素に変換するには、 g2p-id必要です。
コキTTSのttsコマンドを使用して、音声を合成します。
tts --text "saja səˈdanʔ ˈbərada di dʒaˈkarta."
--model_path checkpoint.pth
--config_path config.json
--speaker_idx wibowo
--out_path output.wav
--list_speaker_idxs :を使用して、すべてのスピーカーIDXを取得できます。
tts --model_path checkpoint.pth
--config_path config.json
--list_speaker_idxs
@misc { https://doi.org/10.48550/arxiv.2106.06103 ,
doi = { 10.48550/ARXIV.2106.06103 } ,
url = { https://arxiv.org/abs/2106.06103 } ,
author = { Kim, Jaehyeon and Kong, Jungil and Son, Juhee } ,
keywords = { Sound (cs.SD), Audio and Speech Processing (eess.AS), FOS: Computer and information sciences, FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering } ,
title = { Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech } ,
publisher = { arXiv } ,
year = { 2021 } ,
copyright = { arXiv.org perpetual, non-exclusive license }
} @inproceedings { kjartansson-etal-tts-sltu2018 ,
title = { {A Step-by-Step Process for Building TTS Voices Using Open Source Data and Framework for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese} } ,
author = { Keshan Sodimana and Knot Pipatsrisawat and Linne Ha and Martin Jansche and Oddur Kjartansson and Pasindu De Silva and Supheakmungkol Sarin } ,
booktitle = { Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU) } ,
year = { 2018 } ,
address = { Gurugram, India } ,
month = aug,
pages = { 66--70 } ,
URL = { http://dx.doi.org/10.21437/SLTU.2018-14 }
}