etos tts下載etos tts源代碼下載

etos tts

Ai源碼

1.0.0

下載

ETOS TTS

ETOS TTS旨在建立一個神經文本到語音（TTS），該語音能夠在野外採樣的聲音中轉換為語音。這是Tacotron的Pytorch實現：完全端到端的文本到語音合成模型。

用法

要求

Python 3.6或更高版本
Pytorch 0.4測試
對於Ubuntu， sudo apt install libsndfile1

您可以使用PIP安裝其他要求。

pip3 install -r requirements.txt

測試

您可以在models/may22下使用預據的模型並運行TTS Web服務器：

python server.py -c server_conf.json

然後訪問http://127.0.0.1:8000 ，然後享受。

數據

當前TTS為數據加載程序提供

LJ演講

培訓網絡

要運行自己的培訓，您需要定義config.json文件（下面的簡單模板），然後使用命令致電。

train.py --config_path config.json

如果您想使用特定的GPU。

CUDA_VISIBLE_DEVICES="0,1,4" train.py --config_path config.json

每次運行都會在您在config.json中設置的文件夾下創建一個具有相關日期和時間的實驗文件夾。而且，如果該文件夾尚無檢查站，則按CTRL+C時將其刪除。

如果您指出的話--logdir實驗文件夾，您也可以享受帶有幾個良好訓練日誌的良好訓練日誌的張板。

示例config.json ：

 {
  "num_mels": 80,
  "num_freq": 1025,
  "sample_rate": 22050,
  "frame_length_ms": 50,
  "frame_shift_ms": 12.5,
  "preemphasis": 0.97,
  "min_level_db": -100,
  "ref_level_db": 20,
  "embedding_size": 256,
  "text_cleaner": "english_cleaners",

  "epochs": 200,
  "lr": 0.002,
  "warmup_steps": 4000,
  "batch_size": 32,
  "eval_batch_size":32,
  "r": 5,
  "mk": 0.0,  // guidede attention loss weight. if 0 no use
  "priority_freq": true,  // freq range emphasis

  "griffin_lim_iters": 60,
  "power": 1.2,

  "dataset": "TWEB",
  "meta_file_train": "transcript_train.txt",
  "meta_file_val": "transcript_val.txt",
  "data_path": "/data/shared/BibleSpeech/",
  "min_seq_len": 0,
  "num_loader_workers": 8,

  "checkpoint": true,  // if save checkpoint per save_step
  "save_step": 200,
  "output_path": "/path/to/my_experiment",
}