indonesian ttsダウンロード - indonesian ttsソースコードのダウンロード

indonesian tts

AI ソースコード

v1.2 - 80+ TTS Speakers

ダウンロード

コキTTを使用したインドネシアのTT

モデルは[リリース]タブで使用できます。

商業目的で使用しないでください！

モデルChangelog

v1.2（2022年8月12日）

v1.1モデルからのFinetuned on：

4時間のオーディオブックデータセット
Azure TTSの2000サンプル
Javanese＆Sundaneseの高品質のTTSデータ

v1.1（2022年8月6日）

ljspeechモデルからのFinetuned：

4時間のオーディオブックデータセット
Azure TTSの2000サンプル

v1.0（2022年6月23日）

ゼロから訓練されています：

4時間のオーディオブックデータセット。

例

Ardi (Azure) ：

ardi-zure.mp4

Gadis (Azure) ：

gadis-zure.mp4

Wibowo (Audiobook) ：

wibowo-audiobook.mp4

使い方

グラフェムを音素に変換するには、 g2p-id必要です。

コキTTSのttsコマンドを使用して、音声を合成します。

 tts --text "saja səˈdanʔ ˈbərada di dʒaˈkarta." 
    --model_path checkpoint.pth 
    --config_path config.json 
    --speaker_idx wibowo 
    --out_path output.wav

--list_speaker_idxs ：を使用して、すべてのスピーカーIDXを取得できます。

 tts --model_path checkpoint.pth 
    --config_path config.json 
    --list_speaker_idxs

データ

インドネシアの紺ure TTS

引用

 @misc { https://doi.org/10.48550/arxiv.2106.06103 ,
  doi = { 10.48550/ARXIV.2106.06103 } , 
  url = { https://arxiv.org/abs/2106.06103 } ,
  author = { Kim, Jaehyeon and Kong, Jungil and Son, Juhee } ,
  keywords = { Sound (cs.SD), Audio and Speech Processing (eess.AS), FOS: Computer and information sciences, FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering } ,
  title = { Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech } ,
  publisher = { arXiv } ,
  year = { 2021 } ,
  copyright = { arXiv.org perpetual, non-exclusive license }
}

 @inproceedings { kjartansson-etal-tts-sltu2018 ,
    title = { {A Step-by-Step Process for Building TTS Voices Using Open Source Data and Framework for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese} } ,
    author = { Keshan Sodimana and Knot Pipatsrisawat and Linne Ha and Martin Jansche and Oddur Kjartansson and Pasindu De Silva and Supheakmungkol Sarin } ,
    booktitle = { Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU) } ,
    year  = { 2018 } ,
    address = { Gurugram, India } ,
    month = aug,
    pages = { 66--70 } ,
    URL   = { http://dx.doi.org/10.21437/SLTU.2018-14 }
}