vits japros webui Download - vits japros webui Source Code Download

vits japros webui

AI Source Code

1.0.0

Download

I will be focusing on developing Style-Bert-VITS2 so I will not update this anymore: https://github.com/litagin02/Style-Bert-VITS2

Bert-VITS2 reference article: https://zenn.dev/litagin/articles/b1ddc1da5ea2b3

VITS-JaPros-WebUI

This is a WebUI for Windows that allows you to learn Japanese VITS models and allows you to synthesize speech with accents. If you only have a speech synthesis, you can use it even without a graphics card.

? Speech synthesis demo

Speech synthesis	study

JaPros?

There is a framework called ESPnet that allows for a unified handling of various voice processing tasks in machine learning.
VITS can be used as a way to learn TTS on ESPnet.
When learning TTS, ESPnet allows you to specify a method ( g2p ) for converting learning text (Japanese sentences) to phoneme sequences (g2p), and one of them is pyopenjtalk_prosody , which has an accent symbol added.

In this situation, I'm taking a model trained with g2p in Japanese using pyopenjtalk_prosody and reading it for convenience (a proposal from Bing-chan).

pyopenjtalk_prosody also handles symbols such as accents, so you can use them to control accents (ハ➚シハ➘シ).

Accent symbol details

symbol	role	example
`[`	The accent rises from here (image of ➚)	Hello →`コ[ンニチワ`
`]`	The accent falls from here (image of ➘)	Kyoto →`キョ]オト`
(Half-width space)	The cut in accent poem (somehow a single piece of cake)	`ソ[レワム[ズカシ]イ`
`、`	Pose (taking a breath). Use it when you want to make a short pose.	`ハ]イ、ソ[オオ[モイマ]ス`
`?`	I'll add it to the end of the question.	`キ[ミワダ]レ?`

what is this?

This is something that allows you to train, load and speech synthesis of VITS JaPros models in a local Windows environment.

Learning

With automatic transcription from audio files using faster-whisper
The learning itself is modified to run on Windows, allowing VITS JaPros to be learned with minimal operations.

About speech synthesis

A somewhat intuitive accent control (probably) with katakana and symbols
Simple speech speed, pitch and intonation adjustment function (from pyworld)
It also works on the CPU (can be started up separately during learning and checked)
Even if it is not a model created using this, if it is a model that is pyopenjtalk_prosody in VITS with ESPnet, it should work if it is included with config.yaml

How to use

install

I have confirmed it works on RTX 4070 on Windows 11 with Python 3.10.

First, clone this repository.

git clone https://github.com/litagin02/vits-japros-webui.git

Double-click setup.bat inside and wait a moment. When Setup complete. appears, you are done.

How to use

Learning: Double click webui_train.bat
Speech Synthesis: View below to place the pth file and then double-click webui_infer.bat
Update: Double click update.bat

For more information and if you don't need a WebUI, please click here.

Place a model for speech synthesis

For models, create a subdirectory in the weights directory and place the {数字}epoch.pth file inside. If you are using an external model (only compatible with models created with pyopenjtalk_prosody in VITS with ESPnet), please also include config.yaml when studying.

 weights
├── model1
│    └── 100epoch.pth
|── model2
│    ├── 50epoch.pth
│    └── config.yaml
...

credit

ESPnet: This repository uses the original ESPnet Python module to run on Windows (there are only modifications to os.uname and symbolic link creation locations).

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2025-08-22
size 1.97MB
From Github

Related Applications

JableTVDownload WebUI

2024-11-12
flux webui

2024-11-09
stable diffusion webui forge

2024-11-08
open webui

2024-11-03
stable diffusion webui

2024-11-01
JOKE

2024-02-26

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All