parrots 다운로드 - parrots 소스 코드 다운로드

parrots

AI 소스 코드

1.0.1

다운로드

?? 중국어 | 영어 | 문서/문서 | ? 모델/모델

온라인 데모

앵무새 : ASR 및 TTS 툴킷

소개

앵무새, 자동 음성 인식 ( ASR ), 텍스트 음성 ( TTS ) 툴킷, 지원 중국어, 영어, 일본어 등

앵무새는 음성 인식 및 음성 합성 모델에 대한 원 클릭 호출을 구현하며, 이는 상자가없고 중국어와 영어를 지원합니다.

특징

ASR : distilwhisper 기반으로 한 중국어 음성 인식 (ASR) 모델은 중국어 및 영어와 같은 여러 언어를 지원합니다.
TTS : GPT-SoVITS 교육을 기반으로 한 음성 합성 (TTS) 모델, 중국어, 영어, 일본어 및 기타 언어를 지원합니다.

설치하다

pip install torch # or conda install pytorch
pip install -r requirements.txt
pip install parrots

또는

pip install torch # or conda install pytorch
git clone https://github.com/shibing624/parrots.git
cd parrots
python setup.py install

데모

공식 데모 : https://www.mulanai.com/product/tts/
Huggingface 데모 : https://huggingface.co/spaces/shibing624/parrots

실행 예 : 예제/tts_gradio_demo.py 데모를 보려면 :

python examples/tts_gradio_demo.py

용법

ASR (음성 인식)

예 : 예/demo_asr.py

 import os
import sys

sys . path . append ( '..' )
from parrots import SpeechRecognition

pwd_path = os . path . abspath ( os . path . dirname ( __file__ ))

if __name__ == '__main__' :
    m = SpeechRecognition ()
    r = m . recognize_speech_from_file ( os . path . join ( pwd_path , 'tushuguan.wav' ))
    print ( '[提示] 语音识别结果：' , r )

산출:

 {'text': '北京图书馆'}

TTS (음성 합성)

예 : 예/demo_tts.py

 import sys
sys . path . append ( '..' )
import parrots
from parrots . tts import TextToSpeech
parrots_path = parrots . __path__ [ 0 ]
sys . path . append ( parrots_path )

m = TextToSpeech (
    speaker_model_path = "shibing624/parrots-gpt-sovits-speaker-maimai" ,
    speaker_name = "MaiMai" ,
)
m . predict (
    text = "你好，欢迎来北京。welcome to the city." ,
    text_language = "auto" ,
    output_path = "output_audio.wav"
)

산출:

 Save audio to output_audio.wav

명령 줄 모드 (CLI)

명령 줄, 코드 : cli.py를 통해 ARS 및 TTS 작업 실행 지원

 > parrots -h                                    

NAME
    parrots

SYNOPSIS
    parrots COMMAND

COMMANDS
    COMMAND is one of the following:

     asr
       Entry point of asr, recognize speech from file

     tts
       Entry point of tts, generate speech audio from text

달리다:

pip install parrots -U
# asr example
parrots asr -h
parrots asr examples/tushuguan.wav

# tts example
parrots tts -h
parrots tts "你好，欢迎来北京。welcome to the city. " output_audio.wav

asr 과 tts 는 2 차 명령, ASR은 음성 인식, TTS는 음성 합성이며 기본 모델은 중국 모델입니다.
각 보조 명령의 사용에 대해서는 parrots asr -h 참조하십시오.
위의 examples/tushuguan.wav 에서는 asr 메소드의 audio_file_path 매개 변수와 입력 오디오 파일 (필수)입니다.

릴리스 모델

ASR

Belle-2/Belle-distilwhisper-large-v2-zh

TTS

Shibing624/Parrots-Gpt-Sovits-Speaker

스피커 이름	화자의 이름	성격	형질	언어	언어
Kuileblanc	KWAI LEBRON	숙녀	표준 미국 여성 목소리	en	영국
Longshouren	롱 쇼렌	신사	표준 미국 남성 목소리	en	영국
마이마	판매 및 판매	여성 앵커를 부릅니다	노래하는 여성 앵커 목소리	ZH	가운데
xingtong	스타 아이	에어 걸이 노래	활기찬 여성 목소리	ZH	가운데
Xuanshen	하나님을 과시하십시오	게임 남성 앵커	게임의 남성 앵커의 목소리	ZH	가운데
쿠사 나기넨	Kusanagi Ning	롤리	롤리 여학생 목소리	자	낮

Shibing624/Parrots-Gpt-Sovits-Speaker-Maimai

스피커 이름	화자의 이름	성격	형질	언어	언어
마이마	판매 및 판매	여성 앵커를 부릅니다	노래하는 여성 앵커 목소리	ZH	가운데

연락하다

문제 (제안) :
이메일을 보내주십시오 : xuming : [email protected]
wechat me : add me wechat id : xuming624 , python-nlp 커뮤니케이션 그룹을 입력하십시오 .

소환

연구에 앵무새를 사용하는 경우 다음 형식으로 인용하십시오.

@misc{parrots,
  title={parrots: ASR and TTS Tool},
  author={Ming Xu},
  year={2024},
  howpublished={ url {https://github.com/shibing624/parrots}},
}