MiniMania is a web application that provides text-to-speech (TTS) and voice cloning capabilities. The application utilizes deep learning models to generate high-quality speech output in multiple languages, with options to customize the pitch, speed, and volume of the generated audio. MiniMania's TTS engine is built on top of the Tacotron 2 and WaveGlow models, while its voice cloning capabilities are based on the FastSpeech and MelGAN models.
Click the Mega Link To see the Demo: Link For The Video
Before you can install and use MimicMania, you will need to download and install the following:
apt-get install ffmpeg in your terminal.sudo apt-get install espeak-ng in your terminal.sudo apt-get install espeak in your terminal.Additionally, you will also need to download the required module and Python dependencies as described in the next section.
To install and run MimicMania, follow these steps:
git clone https://github.com/everydaycodings/MimicMania.gitcd MimicManiapython setup.py to download the required modules. This process may take some time, as the required module is around 6 GB.pip install -r requirements.txt to download all the Python dependencies.streamlit run app.py to start the web application.If you'd like to contribute to MimicMania, please fork the repository and create a pull request. We welcome contributions of all kinds, including bug fixes, new features, and documentation improvements.
MimicMania was developed by Kumar Saksham(everydaycodings) with the help of various open source resources.
We would like to extend a special thank you to coqui-ai/TTS for providing their text-to-speech model as a resource for our project.
MimicMania is licensed under the MIT License.
If you have any questions or issues with MimicMania, please contact us at [email protected] or reach out to us on Twitter @everydaycodings or Medium @everydaycodings.
We're always happy to help!