NATSpeech 다운로드 - NATSpeech 소스 코드 다운로드

NATSpeech

AI 소스 코드

v0.1

다운로드

natspeech : 비유로가 아닌 텍스트 음성 연설 프레임 워크

| | 中文文档

이 repo에는 공식적인 Pytorch 구현이 포함됩니다.

portaspeech : 휴대용 및 고품질 생성 텍스트 음성 (Neurips 2021)
데모 페이지 | 포옹? 데모
Diffsinger : 얕은 확산 메커니즘을 통한 노래 음성 합성 (diffspeech) (AAAI 2022)
데모 페이지 | 프로젝트 페이지 | 포옹? 데모

주요 기능

이 프레임 워크에서 다음과 같은 기능을 구현합니다.

몬트리올 강제 조정기를 사용한 비 유적지가없는 텍스트 음성 연설에 대한 데이터 처리.
교육 및 추론을위한 편리하고 확장 가능한 프레임 워크.
간단하지만 효율적인 랜덤 액세스 데이터 세트 구현.

종속성을 설치하십시오

 # # We tested on Linux/Ubuntu 18.04. 
# # Install Python 3.6+ first (Anaconda recommended).

export PYTHONPATH=.
# build a virtual env (recommended).
python -m venv venv
source venv/bin/activate
# install requirements.
pip install -U pip
pip install Cython numpy==1.19.1
pip install torch==1.9.0 # torch >= 1.9.0 recommended
pip install -r requirements.txt
sudo apt install -y sox libsox-fmt-mp3
bash mfa_usr/install_mfa.sh # install forced alignment tool

서류

프레임 워크에 대해
portaspeech를 실행하십시오
diffspeech를 실행하십시오

소환

이것이 귀하의 연구에 유용하다면 다음과 같은 논문을 인용하십시오.

portaspeech

 @article { ren2021portaspeech ,
  title = { PortaSpeech: Portable and High-Quality Generative Text-to-Speech } ,
  author = { Ren, Yi and Liu, Jinglin and Zhao, Zhou } ,
  journal = { Advances in Neural Information Processing Systems } ,
  volume = { 34 } ,
  year = { 2021 }
}

diffspeech

 @article { liu2021diffsinger ,
  title = { Diffsinger: Singing voice synthesis via shallow diffusion mechanism } ,
  author = { Liu, Jinglin and Li, Chengxi and Ren, Yi and Chen, Feiyang and Liu, Peng and Zhao, Zhou } ,
  journal = { arXiv preprint arXiv:2105.02446 } ,
  volume = { 2 } ,
  year = { 2021 }
 }