detail_tts Download - detail_tts Source code download

detail_tts

AI Source Code

1.0.0

Download

Detail TTS

The model newly proposed three significant important methods to become the best practice of AR TTS.

Although RVQ is used, the actual training employs continuous features, I call it fake discretization.
All in one model. The model contains gpt, diffusion, vqvae, gan and flowvae all in one. One train one inference.
Both prefixed spk emb and prompt are used to get benefit from both Valle type inference and Tortoise type training.

Here is the result obtained after the model was trained on 10000 hours of very dirty data. The model can be easily scaled up with many low quality data.

prompt 0

prompt00.mov

generated 0

prompt01.mov

prompt 1

prompt10.mov

generated 1

prompt12.mov

prompt 2

prompt20.mov

generated 2

prompt21.mov

Inference

check api.py

Dataset prepare

Change the path contains audios in script and run

python prepare/0_vad_asr_save_to_jsonl.py

Train and Fine Tune

accelerate launch train.py

For fine tuning, change the pretrain model load path.

Acknowledgements

VQ and VITS from GSV

Diffusion and GPT from tortoise

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2025-08-24
size 2.2MB
From Github

Related Applications

OpenCore_NO_ACPI_Build

2024-11-13
nspanel_pro_tools_apk

2024-11-12
zkwork_aleo_gpu_worker

2024-11-11
F5 TTS ComfyUI

2024-11-02
nextcloud_share_url_downloader

2024-11-01
Lihua data analysis engine free version 3.0_search_navigation_collection_public opinion_ranking_api

2022-06-28

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All