CogNative Download - CogNative Source code download

CogNative

AI Source Code

1.0.0

Download

CogNative

Translated Voice Synthesis

Clone a voice and output speech in another language with the original voice.

Installation

1. Install Python:

Python 3.7 is recommended. Python 3.7 is REQUIRED, due to the version of tensorflow being used in this project.

2. Create virtual environment (optional):

python3 -m venv pyvenv

Activate virtual environment: Windows: ./pyvenv/Scripts/activate MacOS/Linux: source pyvenv/bin/activate

Deactivating the virtual environment: deactivate

Note: Your python virtual environment may cause issues when running the UI.

3. Install ffmpeg.

Once installed, extract the folder and add <ffmpeg folder path>/bin to path.

4. Install PyTorch:

Pytorch Build: Stable (1.11.0).
Your OS: Pick the OS your environment is running CogNative in (Windows or Linux recommended).
Package: Pick what package installer you are using (pip recommended).
Language: Python.
Compute Platform: CUDA 11.3 recommended. If you don't have a GPU pick CPU.

5. Install required Python packages:

pip3 install -r requirements.txt

6. Install models.

Once downloaded, add the models (*.pt) to CogNative/CogNative/models/RTVC/saved_models/default

The taco_pretrained folder (including the folder itself) needs to be downloaded and added to CogNative/CogNative/models/RTVCSwedish/synthesizer/saved_models/swedish

7. Create Google Cloud credentials:

Follow steps to setup Google Cloud credentials.
Add Google Credentials to credentials.json in the top-level directory. There is currently a file named credentials.json.template, your credentials.json should match the key/value pairs shown there.

Usage

Start from the CogNative root directory.

GUI

To launch GUI, run python -m CogNative.testUI.UI

CLI

Any necessary flags which are not specified will cause a prompt to be generated which must be answered before continuing. Examples follow.

Display Help Message: python -m CogNative.main -help

CogNative CLI FLags:
    -sampleAudio <PATH>: audio file of voice to clone
    -synType <text, audio>: synthesis mode either given input text or by transcribing audio file
    [-dialogueAudio] <PATH>: for audio synType, audio file of dialogue to speak
    [-dialogueText] <PATH>: for text synType, text string of dialogue to speak
    -out <PATH>: output audio file path
    -useExistingEmbed <y/yes/n/no>: Uses saved embedding of previously used voice samples if enabled and present.

Generate cloned voice from sample voice and text input: python -m CogNative.main -sampleAudio CogNative/examples/MatthewM66.wav -synType text -dialogueText "The turbo-encabulator has now reached a high level of development, and it's being successfully used in the operation of novertrunnions." -out cmdExampleText.wav -useExistingEmbed y

Loaded encoder "english_encoder.pt" trained to step 1564501
Synthesizer using device: cuda
Building Wave-RNN
Trainable Parameters: 4.481M
Loading model weights at CogNativemodelsRTVCsaved_modelsdefaultvocoder.pt
Synthesizing...
Clone output to cmdExampleText.wav

Generate cloned voice from sample voice and audio input file: python -m CogNative.main -sampleAudio CogNativeexamplesMatthewM66.wav -synType audio -dialogueAudio CogNativeexamplesBillMaher22.wav -out cmdExampleAudio.wav -useExistingEmbed n

Loaded encoder "english_encoder.pt" trained to step 1564501
Synthesizer using device: cuda
Building Wave-RNN
Trainable Parameters: 4.481M
Loading model weights at CogNativemodelsRTVCsaved_modelsdefaultvocoder.pt
Loading requested file...
Synthesizing...
Clone output to cmdExampleAudio.wav

AutoTranslate scripts

This script will translate audio from a supported language to English. To use the AutoTranslate script on Windows, drag and drop an audio file onto the script or place a SHORTCUT to the script in %AppData%MicrosoftWindowsSendTo and use the "Send To" context menu function on an audio file to be translated. In both cases a new .wav file with the orginal filename followed by "_ + destination language" will be placed in the same folder. For other platforms, the same CLI flags should be used but details on context menu integration will vary by what packages are installed.