CogNative下載 - CogNative源代碼下載

CogNative

Ai源碼

1.0.0

下載

認知

翻譯的語音綜合

用另一種語言克隆語音和輸出語音。

安裝

1。安裝python：

建議使用Python 3.7。由於該項目中使用了TensorFlow的版本，因此需要Python 3.7。

2。創建虛擬環境（可選）：

python3 -m venv pyvenv

激活虛擬環境：Windows： ./pyvenv/Scripts/activate scripts/activate macos/linux： source pyvenv/bin/activate

停用虛擬環境： deactivate

注意：運行UI時，您的Python虛擬環境可能會引起問題。

3。安裝FFMPEG。

安裝後，提取文件夾，然後將<ffmpeg folder path>/bin添加到路徑。

4。安裝pytorch：

Pytorch構建：穩定（1.11.0）。
您的操作系統：選擇OS您的環境正在運行認知（Windows或Linux推薦）。
軟件包：選擇您使用的包裝安裝程序（建議使用PIP）。
語言：Python。
計算平台：CUDA 11.3推薦。如果您沒有GPU選擇CPU。

5。安裝所需的Python軟件包：

pip3 install -r requirements.txt

6。安裝模型。

下載後，將模型（*.pt）添加到CogNative/CogNative/models/RTVC/saved_models/default

需要下載taco_pretrented文件夾（包括文件夾本身），並添加到CogNative/CogNative/models/RTVCSwedish/synthesizer/saved_models/swedish

7。創建Google Cloud憑據：

按照步驟設置Google Cloud憑據。
將Google憑據添加到頂級目錄中的credentials.json 。當前有一個名為credentials.json.template的文件，您的credentials.json應該匹配那裡顯示的鍵/值對。

用法

從認知根目錄開始。

GUI

要啟動GUI，請運行python -m CogNative.testUI.UI

CLI

未指定的任何必要標誌將導致生成提示，這些提示必須在繼續之前回答。如下。

顯示幫助消息： python -m CogNative.main -help

 CogNative CLI FLags:
    -sampleAudio <PATH>: audio file of voice to clone
    -synType <text, audio>: synthesis mode either given input text or by transcribing audio file
    [-dialogueAudio] <PATH>: for audio synType, audio file of dialogue to speak
    [-dialogueText] <PATH>: for text synType, text string of dialogue to speak
    -out <PATH>: output audio file path
    -useExistingEmbed <y/yes/n/no>: Uses saved embedding of previously used voice samples if enabled and present.

從示例語音和文本輸入中生成克隆的語音： python -m CogNative.main -sampleAudio CogNative/examples/MatthewM66.wav -synType text -dialogueText "The turbo-encabulator has now reached a high level of development, and it's being successfully used in the operation of novertrunnions." -out cmdExampleText.wav -useExistingEmbed y

 Loaded encoder "english_encoder.pt" trained to step 1564501
Synthesizer using device: cuda
Building Wave-RNN
Trainable Parameters: 4.481M
Loading model weights at CogNativemodelsRTVCsaved_modelsdefaultvocoder.pt
Synthesizing...
Clone output to cmdExampleText.wav

從示例語音和音頻輸入文件中生成克隆的語音： python -m CogNative.main -sampleAudio CogNativeexamplesMatthewM66.wav -synType audio -dialogueAudio CogNativeexamplesBillMaher22.wav -out cmdExampleAudio.wav -useExistingEmbed n

 Loaded encoder "english_encoder.pt" trained to step 1564501
Synthesizer using device: cuda
Building Wave-RNN
Trainable Parameters: 4.481M
Loading model weights at CogNativemodelsRTVCsaved_modelsdefaultvocoder.pt
Loading requested file...
Synthesizing...
Clone output to cmdExampleAudio.wav

自動轉換腳本

該腳本將將音頻從受支持的語言轉換為英語。要使用Windows上的自動轉換腳本，請將音頻文件拖放到腳本上，或將快捷方式放在%AppData%MicrosoftWindowsSendTo中，並使用“發送到“發送到”上下文菜單函數在音頻文件上。在這兩種情況下，一個帶有原始文件名的新的.WAV文件，然後將“ _ +目標語言”放置在同一文件夾中。對於其他平台，應使用相同的CLI標誌，但上下文菜單集成上的詳細信息將因安裝哪些軟件包而有所不同。