parrotsダウンロード - parrotsソースコードのダウンロード

parrots

AI ソースコード

1.0.1

ダウンロード

??中国語|英語|ドキュメント/ドキュメント| ？モデル/モデル

オンラインデモ

オウム：ASRおよびTTSツールキット

導入

オウム、自動音声認識（ ASR ）、テキストからスピーチ（ TTS ）ツールキット、中国語、英語、日本語などをサポートします。

オウムは、箱から出して中国語と英語をサポートする音声認識と音声合成モデルへのワンクリックコールを実装します。

特徴

ASR： distilwhisperに基づく中国の音声認識（ASR）モデルは、中国語や英語などの複数の言語をサポートしています。
TTS： GPT-SoVITSトレーニングに基づく音声合成（TTS）モデル、中国語、英語、日本語、その他の言語をサポートします

インストール

pip install torch # or conda install pytorch
pip install -r requirements.txt
pip install parrots

または

pip install torch # or conda install pytorch
git clone https://github.com/shibing624/parrots.git
cd parrots
python setup.py install

デモ

公式デモ：https：//www.mulanai.com/product/tts/
Huggingfaceデモ：https：//huggingface.co/spaces/shibing624/parrots

実行例：例/tts_gradio_demo.pyデモを見るには：

python examples/tts_gradio_demo.py

使用法

ASR（音声認識）

例：例/demo_asr.py

 import os
import sys

sys . path . append ( '..' )
from parrots import SpeechRecognition

pwd_path = os . path . abspath ( os . path . dirname ( __file__ ))

if __name__ == '__main__' :
    m = SpeechRecognition ()
    r = m . recognize_speech_from_file ( os . path . join ( pwd_path , 'tushuguan.wav' ))
    print ( '[提示] 语音识别结果：' , r )

出力：

 {'text': '北京图书馆'}

TTS（音声統合）

例：Examples/demo_tts.py

 import sys
sys . path . append ( '..' )
import parrots
from parrots . tts import TextToSpeech
parrots_path = parrots . __path__ [ 0 ]
sys . path . append ( parrots_path )

m = TextToSpeech (
    speaker_model_path = "shibing624/parrots-gpt-sovits-speaker-maimai" ,
    speaker_name = "MaiMai" ,
)
m . predict (
    text = "你好，欢迎来北京。welcome to the city." ,
    text_language = "auto" ,
    output_path = "output_audio.wav"
)

出力：

 Save audio to output_audio.wav

コマンドラインモード（CLI）

コマンドライン、コード：cli.pyを介したARSおよびTTSタスクの実行をサポートする

 > parrots -h                                    

NAME
    parrots

SYNOPSIS
    parrots COMMAND

COMMANDS
    COMMAND is one of the following:

     asr
       Entry point of asr, recognize speech from file

     tts
       Entry point of tts, generate speech audio from text

走る：

pip install parrots -U
# asr example
parrots asr -h
parrots asr examples/tushuguan.wav

# tts example
parrots tts -h
parrots tts "你好，欢迎来北京。welcome to the city. " output_audio.wav

asrとtts二次コマンドであり、ASRは音声認識、TTSは音声合成、デフォルトモデルは中国モデルです
各セカンダリコマンドの使用についてはparrots asr -h参照してください
上記のexamples/tushuguan.wavは、 asrメソッドのaudio_file_pathパラメーターと入力オーディオファイル（必須）です。

モデルをリリースします

ASR

belle-2/belle-distilwhisper-large-v2-zh

TTS

SHIBING624/PARROTS-GPT-SOVITS-SPEAKER

スピーカー名	スピーカーの名前	キャラクター	特性	言語	言語
クイレブラン	Kwai Lebron	レディ	標準的なアメリカの女性の声	en	英国
ロングショーレン	長い少量	紳士	標準的なアメリカの男性の声	en	英国
マイマイ	販売と販売	女性のアンカーを歌います	女性のアンカーの声を歌います	Zh	真ん中
xingtong	スターアイ	エアガールを歌います	活気のある女性の声	Zh	真ん中
Xuanshen	神を見せてください	ゲームの男性アンカー	ゲームの男性アンカーの声	Zh	真ん中
クサナギネン	kusanagi ning	ロリ	ロリの女子学生の声	JA	日

SHIBING624/PARROTS-GPT-SOVITS-SPEAKER-MAIMAI

スピーカー名	スピーカーの名前	キャラクター	特性	言語	言語
マイマイ	販売と販売	女性のアンカーを歌います	女性のアンカーの声を歌います	Zh	真ん中

接触

問題（提案）：
私にメールしてください：xuming：[email protected]
Wechat Me：Me Wechat IDを追加：Xuming624 、Python-NLPコミュニケーショングループを入力してください、注： name-company name-nlp

引用

研究でオウムを使用する場合は、次の形式で引用してください。

@misc{parrots,
  title={parrots: ASR and TTS Tool},
  author={Ming Xu},
  year={2024},
  howpublished={ url {https://github.com/shibing624/parrots}},
}