NATSpeech
v0.1

Natspeech:非自動回報的文本到語音框架
| | 中文文檔
該回購包含官方的Pytorch實施:
我們在此框架中實現以下功能:
# # We tested on Linux/Ubuntu 18.04.
# # Install Python 3.6+ first (Anaconda recommended).
export PYTHONPATH=.
# build a virtual env (recommended).
python -m venv venv
source venv/bin/activate
# install requirements.
pip install -U pip
pip install Cython numpy==1.19.1
pip install torch==1.9.0 # torch >= 1.9.0 recommended
pip install -r requirements.txt
sudo apt install -y sox libsox-fmt-mp3
bash mfa_usr/install_mfa.sh # install forced alignment tool 如果您發現這對您的研究有用,請引用以下論文:
@article { ren2021portaspeech ,
title = { PortaSpeech: Portable and High-Quality Generative Text-to-Speech } ,
author = { Ren, Yi and Liu, Jinglin and Zhao, Zhou } ,
journal = { Advances in Neural Information Processing Systems } ,
volume = { 34 } ,
year = { 2021 }
} @article { liu2021diffsinger ,
title = { Diffsinger: Singing voice synthesis via shallow diffusion mechanism } ,
author = { Liu, Jinglin and Li, Chengxi and Ren, Yi and Chen, Feiyang and Liu, Peng and Zhao, Zhou } ,
journal = { arXiv preprint arXiv:2105.02446 } ,
volume = { 2 } ,
year = { 2021 }
}我們的代碼受到以下存儲庫的影響:
禁止任何組織或個人使用本文中提到的任何技術未經他/她的同意,包括但不限於政府領導人,政治人物和名人。如果您不遵守此項目,則可能違反版權法。