parrots Download - parrots Source code download

parrots

AI Source Code

1.0.1

Download

??Chinese | English | Documents/Docs | ?Models/Models

Online Demo

Parrots: ASR and TTS toolkit

Introduction

Parrots, Automatic Speech Recognition( ASR ), Text-To-Speech( TTS ) toolkit, support Chinese, English, Japanese, etc.

parrots implements one-click call to speech recognition and speech synthesis models, which are out of the box and support Chinese and English.

Features

ASR: Chinese speech recognition (ASR) model based on distilwhisper , supports multiple languages such as Chinese and English.
TTS: A voice synthesis (TTS) model based on GPT-SoVITS training, supports Chinese, English, Japanese and other languages

Install

pip install torch # or conda install pytorch
pip install -r requirements.txt
pip install parrots

or

pip install torch # or conda install pytorch
git clone https://github.com/shibing624/parrots.git
cd parrots
python setup.py install

Demo

Official Demo: https://www.mulanai.com/product/tts/
HuggingFace Demo: https://huggingface.co/spaces/shibing624/parrots

run example: examples/tts_gradio_demo.py to see the demo:

python examples/tts_gradio_demo.py

Usage

ASR (Speech Recognition)

example: examples/demo_asr.py

 import os
import sys

sys . path . append ( '..' )
from parrots import SpeechRecognition

pwd_path = os . path . abspath ( os . path . dirname ( __file__ ))

if __name__ == '__main__' :
    m = SpeechRecognition ()
    r = m . recognize_speech_from_file ( os . path . join ( pwd_path , 'tushuguan.wav' ))
    print ( '[提示] 语音识别结果：' , r )

output:

 {'text': '北京图书馆'}

TTS (Speech Synthesis)

example: examples/demo_tts.py

 import sys
sys . path . append ( '..' )
import parrots
from parrots . tts import TextToSpeech
parrots_path = parrots . __path__ [ 0 ]
sys . path . append ( parrots_path )

m = TextToSpeech (
    speaker_model_path = "shibing624/parrots-gpt-sovits-speaker-maimai" ,
    speaker_name = "MaiMai" ,
)
m . predict (
    text = "你好，欢迎来北京。welcome to the city." ,
    text_language = "auto" ,
    output_path = "output_audio.wav"
)

output:

 Save audio to output_audio.wav

Command Line Mode (CLI)

Support execution of ARS and TTS tasks through command line, code: cli.py

 > parrots -h                                    

NAME
    parrots

SYNOPSIS
    parrots COMMAND

COMMANDS
    COMMAND is one of the following:

     asr
       Entry point of asr, recognize speech from file

     tts
       Entry point of tts, generate speech audio from text

run:

pip install parrots -U
# asr example
parrots asr -h
parrots asr examples/tushuguan.wav

# tts example
parrots tts -h
parrots tts "你好，欢迎来北京。welcome to the city. " output_audio.wav

asr and tts are secondary commands, asr is speech recognition, tts is speech synthesis, and the default model is Chinese model
See parrots asr -h for the usage of each secondary command
In the above examples/tushuguan.wav is the audio_file_path parameter of asr method, and the input audio file (required)

Release Models

ASR

BELLE-2/Belle-distilwhisper-large-v2-zh

TTS

shibing624/parrots-gpt-sovits-speaker

speaker name	Name of the speaker	character	Characteristics	language	language
KuileBlanc	Kwai LeBron	lady	Standard American female voice	en	Britain
LongShouRen	Long Shouren	gentleman	Standard American male voice	en	Britain
MaiMai	Sell and sell	sing female anchor	Singing female anchor voice	zh	middle
XingTong	Star Eye	sing air girl	Lively female voice	zh	middle
XuanShen	Show off God	game male anchor	The voice of the male anchor of the game	zh	middle
KusanagiNene	Kusanagi Ning	loli	Loli female student voice	ja	day

shibing624/parrots-gpt-sovits-speaker-maimai

speaker name	Name of the speaker	character	Characteristics	language	language
MaiMai	Sell and sell	sing female anchor	Singing female anchor voice	zh	middle

Contact

Issue (suggestions):
Email me: xuming: [email protected]
WeChat Me: Add me WeChat ID: xuming624 , enter the Python-NLP communication group, note: Name-Company Name-NLP

Citation

If you use parrots in your research, please quote it in the following format:

@misc{parrots,
  title={parrots: ASR and TTS Tool},
  author={Ming Xu},
  year={2024},
  howpublished={ url {https://github.com/shibing624/parrots}},
}

License

The license agreement is The Apache License 2.0, which can be used for commercial purposes for free. Please attach the parrots link and authorization agreement to the product description.

Contribute

The project code is still very rough. If you have improved the code, you are welcome to submit it back to this project. Before submitting, pay attention to the following two points: