voicefixer_main 다운로드 - voicefixer_main 소스 코드 다운로드

voicefixer_main

AI 소스 코드

1.0.0

다운로드

2021-11-06 : 이해하기 쉽도록 코드 구조를 업데이트했습니다. 지금은 잠재적 인 버그가있을 수 있습니다. 나중에 시험 훈련을하겠습니다.

~~2021-11-01 : 코드를 업데이트하고 나중에 더 쉽게 사용할 수 있도록하겠습니다.~~

VoiceFixer

VoiceFixer는 일반적인 음성 복원을위한 프레임 워크입니다. 우리는 심하게 타락한 연설과 역사적 연설의 회복을 목표로합니다.

VoiceFixer
- 재료
- 용법
  - 환경 (처음에는이 일)
  - 일반적인 음성 복원을위한 Voicefixer
  - 일반적인 음성 복원을위한 Resunet
  - 단일 작업 음성 복원을위한 Resunet
- 소환

재료

Arxiv preprint : https://arxiv.org/abs/2109.13731
데모 페이지에는 단일 작업 음성 복원, 일반적인 음성 복원 및 VoiceFixer의 비교가 포함되어 있습니다.
우리는 VoiceFixer를위한 PIP 패키지를 썼습니다.
이 repo에서 사용하는 데이터 세트 : 교육 및 테스트 데이터 세트

용법

환경 (처음에는이 일)

 # Download dataset and prepare running environment
git clone https://github.com/haoheliu/voicefixer_main.git
cd voicefixer_main
source init.sh

일반적인 음성 복원을위한 Voicefixer

여기서 우리는 VF_UNET (분석 모듈로 UNET를 가진 VoiceFixer)를 예로 들어 본다.

훈련

 # pass in a configuration file to the training script
python3 train_gsr_voicefixer.py -c config/vctk_base_voicefixer_unet.json # you can modify the configuration file to personalize your training

체크 포인트, 로깅 및 유효성 검사 결과에 대해 Logs 디렉토리를 확인할 수 있습니다.

평가

모든 테스트 세트에서 자동 평가 및 .CSV 파일 생성.

예를 들어, 모든 테스트 세트 (기본값)를 평가하려는 경우

python3 eval_gsr_voicefixer.py  
                    --config  < path-to-the-config-file > 
                    --ckpt  < path-to-the-checkpoint >

예를 들어, GSR 테스트 세트를 평가하려는 경우

python3 eval_gsr_voicefixer.py  
                    --config  < path-to-the-config-file > 
                    --ckpt  < path-to-the-checkpoint > 
                    --testset  general_speech_restoration  
                    --description  general_speech_restoration_eval

일반적으로 -테스트 세트 로 전달할 수있는 7 개의 테스트 세트가 있습니다.

기본 : 모든 테스트 세트
클립 : 클리핑 임계 값이 0.1, 0.25 및 0.5 인 음성으로 테스트 세트
리버브 : 반향 연설로 테스트 세트
general_speech_Restoration : 모든 종류의 임의 왜곡이 포함 된 음성 테스트 세트
향상 : 시끄러운 말을 가진 테스트 세트
speech_super_resolution : 샘플링 속도가 2kHz, 4kHz, 8kHz, 16kHz 및 24kHz 인 저해상도가 낮은 테스트 세트.

그리고 소수의 데이터에 대해 평가하고 싶다면 10 발언. 숫자를 ---limit_numbers 인수로 전달할 수 있습니다.

python3 eval_gsr_voicefixer.py  
                    --config  < path-to-the-config-file > 
                    --ckpt  < path-to-the-checkpoint > 
                    --limit_numbers 10

평가 결과는 Exp_Results 폴더에 제시됩니다.

일반적인 음성 복원을위한 Resunet

훈련

 # pass in a configuration file to the training script
python3 train_gsr_voicefixer.py -c config/vctk_base_voicefixer_unet.json

체크 포인트, 로깅 및 유효성 검사 결과에 대해 Logs 디렉토리를 확인할 수 있습니다.

평가 (VoiceFixer 평가와 유사)

python3 eval_ssr_unet.py  
                    --config  < path-to-the-config-file > 
                    --ckpt  < path-to-the-checkpoint > 
                    --limit_numbers < int-test-only-on-a-few-utterance > 
                    --testset  < the-testset-you-want-to-use >  
                    --description  < describe-this-test >

단일 작업 음성 복원을위한 Resunet

훈련

비난

 # pass in a configuration file to the training script
python3 train_ssr_unet.py -c config/vctk_base_ssr_unet_denoising.json

Dereverberation

 # pass in a configuration file to the training script
python3 train_ssr_unet.py -c config/vctk_base_ssr_unet_dereverberation.json

슈퍼 해상도

 # pass in a configuration file to the training script
python3 train_ssr_unet.py -c config/vctk_base_ssr_unet_super_resolution.json

거절

 # pass in a configuration file to the training script
python3 train_ssr_unet.py -c config/vctk_base_ssr_unet_declipping.json

체크 포인트, 로깅 및 유효성 검사 결과에 대해 Logs 디렉토리를 확인할 수 있습니다.

평가 (VoiceFixer 평가와 유사)

python3 eval_ssr_unet.py  
                    --config  < path-to-the-config-file > 
                    --ckpt  < path-to-the-checkpoint > 
                    --limit_numbers < int-test-only-on-a-few-utterance > 
                    --testset  < the-testset-you-want-to-use >  
                    --description  < describe-this-test >

소환

 @misc { liu2021voicefixer ,   
     title = { VoiceFixer: Toward General Speech Restoration With Neural Vocoder } ,   
     author = { Haohe Liu and Qiuqiang Kong and Qiao Tian and Yan Zhao and DeLiang Wang and Chuanzeng Huang and Yuxuan Wang } ,  
     year = { 2021 } ,  
     eprint = { 2109.13731 } ,  
     archivePrefix = { arXiv } ,  
     primaryClass = { cs.SD }  
 }