Irene Voice Assistant Download - Irene Voice Assistant Source code download

Irene Voice Assistant

AI Source Code

v8.1

Download

Voice assistant Irina

Irina is a Russian voice assistant for offline work. Requires Python 3.5+ (the dependence can be less, but in any case Python 3)

Supports plugins (skills).

Article on Habri | The second article on Habri | The third article on Habri | Group in telegram

Through the service vsegpt.ru, another project of the author of Irina:

Supports communication with ChatGPT, GPT-4, Claude 3.
Supports receiving reference information from the Internet (team certificate) using special models of Perplexity Online.
Supports Openai TTS (if difficult to set something locally) (instructions for tuning the plugin). (You can also use any Openai-compatible EndPoint)

The fastest installation under Windows

Go to https://github.com/janvarev/irene-va-win-installer, download the code (Code/Download Zip) and follow the instructions.

After the installation, the following commands will be available: “Irina Hello”, “Irina Throw the Coin”, “Irina Tear the Cube”, “Irina The game is smaller”, “Irina Timer three minutes”

To prepare or solve problems, start start-settings-manager.bat to start the settings manager-you can finish the plugins and find out additional commands.

More docks for bonding this option: doCs/install_win_compact.md

The fastest installation under Windows 2 (outdated)

Go to releases: https://github.com/janvarev/irene-voice-assistant/releases
Download the release and follow the instructions. Python and Git are in the release, nothing needs to be put.

After installation, offline teams will be available (because this is a default configuration). Example: "Irina Hello", "Irina Tear the Coin", "Irina Tear the Cube", "Irina The game is smaller", "Irina Timer three minutes"

How to prepare this option: doCs/install_win_compact.md

Installation / fast start

You will need the installed Python (approximately 3.7-3.11).

To quickly install all the required dependencies, you can use the command: pip install -r requirements.txt (for Linux and MacOS - first install packages for Audioplayer)
To start starting, run the Runva_vosk.py file from the root folder. By default, he will launch VOSK offline supporter to recognize speech with a microphone, and PytSX engine for voicing an assistant more about Pyttsx here.
After starting, you can check with a simple team - say "Irina, hello!" In the microphone

The folder with the Options settings will appear after the first launch of Irina, in it you can correct the settings.

More step -by -step infa about installation on Win (especially Win 7): DOCS/Install_WIN.MD

Solving some problems when installing under Linux: DOCS/Install_Linux.md

Solving some problems when installing under Macs: Docs/Install_mac.md

The principles of debugging during installation problems: doCs/install_debug.md

Bugs can be written in Issues, discussed - in telegram

Settings manager

C version 9.0 Available web settings manager via Gradio.

To launch, start the Runva_Settings_manager.py file from the root folder.

Installation through a dock

If you want to run everything through a dock: doCs/install_docker.md (there are also docks of the Doker for ARM (raspberries, etc.) from Ivan-Firefly)

If you want only complex key components, run through the dock: doCs/install_docker_comp.md

General logic

The launch of all teams begins with the assistant name (tuned in Options/Core.json, by default - Irina). This is done to exclude incorrect works when constant listening to the microphone. Next will be described by teams without the prefix "Irina".

Support for local control through the web interface by the MPC-HC player is built into the engine, so it is recommended to use it. It can be configured in Options/Core.json

Plugins

Support of plugins is made on the JAA.PY engine - the minimalistic single -file engine support engine and their settings.

The plugins are located in the Plugins folder and should begin with the Plugins_ prefix.

Plugins settings, if any, are located in the Options folder (created after the first launch).

Ready plugins/skills (already in the Plugins folder)

For each plugin it is written whether online is required. Remove to turn off the Plugins folder

Complete information: docs/plugins.md

Third Plugins

If you want to know:

what other plugins from other developers are there
Post a link to your plugin made

Visit: #1

Plugin manager

(From version 10.0.0) For launch, run runva_plugin_installer.py

Attention: the proposed plugins are supported by third -party developers and they can supplement and change! The author of Irina is not responsible for their maintenance!

For developers : If you want to add your plugin to this list for a simplified installation, you will need to do the following:

Place the plugin on the githabe
Files of type Plugin_x.py should fundamentally. There may be several
If you need to install additional modules, the REQUREMENTS.TXT file should lie
Test the ability to install by launching Runva_plugin_installer, selecting paragraph 0 (independently set the address of the github project with plugin) and set your plugin
After all, challenge your link to ISSUE or make Pull Request by changing Plugins_catalog.json, which contains links to well -known dopplagins.

Example of the design of the plugin: https://github.com/janvarev/irene_plugin_boltalka2_openai

Integration with Home Assistant

There is a good third-party plugin that allows you to launch Home Assistant scripts through Irina: https://github.com/timhok/ireneva-script-trigger-plugugin

Nuclear settings (Core.json)

Settings of specific plugins are best watched in plugins

{
    "contextDefaultDuration" : 10 , # Время в секундах, пока Ирина находится в контексте (контекст используется в непрерывном чате, играх и пр.; в контексте не надо использовать слово Ирина)
    "contextRemoteWaitForCall" : false , # должна ли Ирина ждать от клиентов сингнала "Проигрывание ответа закончена, запускаем время для контекста?"
    # официальные клиенты поддерживают contextRemoteWaitForCall, рекомендуется true
    "fuzzyThreshold" : 0.5 , # (ПРО) Порог уверенности при использовании плагинов нечеткого распознавания команд
    "isOnline" : true , # при установке в false будет выдавать заглушку на команды плагинов, требующих онлайн. Рекомендуется, если нужен только оффлайн.
    "linguaFrancaLang" : "ru" , # язык для конвертации чисел в lingua-franca. Смените, если будете работать с другим языком
    "logPolicy" : "cmd" , # all|cmd|none . Когда распознается речь с микрофона - выводить в консоль всегда | только, если является командой | никогда
    "mpcHcPath" : "C: \ Program Files (x86) \ K-Lite Codec Pack \ MPC-HC64 \ mpc-hc64_nvo.exe" , # путь до MPC HC, если используете
    "mpcIsUse" : true , # используется ли MPC HC?
    "mpcIsUseHttpRemote" : true , # MPC HC - включено ли управление через веб-интерфейс?
    "playWavEngineId" : "audioplayer" , # плагин проигрыша WAV-файлов. Некоторые WAV требуют sounddevice.
    "replyNoCommandFound" : "Извини, я не поняла" , # ответ при непонимании
    "replyNoCommandFoundInContext" : "Не поняла..." , # ответ при непонимании в состоянии контекста
    "replyOnlineRequired" : "Нужен онлайн" , # ответ при вызове в оффлайн функции плагина, требующего онлайн 
    "tempDir" : "temp" , # папка для временных файлов
    "ttsEngineId" : "pyttsx" , # используемый TTS-движок
    "ttsEngineId2" : "" , # 2 используемый TTS-движок. Работает только на локальную озвучку - например, буфера обмена. Вызывается командой say2
    "useTTSCache" : false , # при установке true в папке tts_cache будет кэшировать .wav файлы со сгенерированными TTS-движком ответами
    "v" : "1.7" , # версия плагина core. Обновляется автоматически, не трогайте
    "voiceAssNames" : "ирина|ирины|ирину" , # Если это появится в звуковом потоке, то дальше будет команда. (Различные имена помощника, рекомендуется несколько)
    "voiceAssNameRunCmd" : { # если вы обратитесь к помощнику по этому имени, то в начало вашей команды будет подставлено соответствующее слово
        "альбина" : "чатгпт"
    }
}

Debugging and development (for developers)

For debugging, you can use the system launch through the runva_cmdline.py file.

She makes the nucleus ( Vacore in vacore.py ) through the command line interface, this is more convenient than dictating in a voice.

You can connect your own skill by creating a plugin in plugins_ . See examples.
You can connect your own TTS with a plugin. As examples, see Plugins_TTS_CONSOLE.PY, plugins_TTS_PYTTSX.PY.
Also, by creating your own Runva_ File, you can, if desired, connect the Speech-to-Text engine.

Development of plugins

Development documentation

Remote work (server-client, multimicrophone/machine installations)

The multi-installation in the "client-server" mode is somewhat more complicated, but allows you to manage Irina:

from several microphones
from different cars
from telegram (using telegram botto)

Read more about customer-server settings

WEB API documentation

Speech-to-Text via Vosk Remote

If you have problems with the VOSK installation (for example, on Mac), then you can use the work through the VOSK Auto Speech Recognition Server, which is launched through the dock.

Launch docker run -d -p 2700:2700 alphacep/kaldi-ru:latest (details: https://alphacephei.com/vosk/server)
- or as an option, you can run vosk_asr_server.py , reducing the parameters inside

    args . interface = os . environ . get ( 'VOSK_SERVER_INTERFACE' , "0.0.0.0" )
    args . port = int ( os . environ . get ( 'VOSK_SERVER_PORT' , 2700 )

Launch runva_voskrem.py . He will read data from the microphone and send to the server for recognition.

If you need to launch recognition on another machine -use the parameter -u (--uri): runva_voskrem.py -u=ws://100.100.100.100:2700 to clarify the address of the server.

Speech-to-Text via Speechrecognition

Speechrecognition - a classic engine for launching by Google and a number of other services. To launch this recognition, start the system through the Runva_Speecrecognition.py file.

For work, you will need:

pip install PyAudio

pip install SpeechRecognition

If there are problems with the installation of Pyaudio, read the details from Enjirouz

Features: recognition of numerals. The same phrase is recognized as follows:

VOSK: Timer ten seconds
Speechrecognition (Google): Timer 10 seconds

Support for multilingual terms

The project as a whole does not imply support for multilingual terms, because Uses custom parsing words in plugins. But, nevertheless, the nucleus ( Vacore.py ) is completely not tied to the tongue, and you can collect your own installation in another language, simply rewriting the plugins for them.

Several language phrases that determine the CORE-behavior of the language assistant (his name, as well as phrases like "I did not understand") are set up in the Core plugin configuration file.

Fuzzy processing of phrases

C version 7.5 supports fuzzy user input processing.

To set the recognition threshold, there is a global parameter Fuzzythreshold in Core.json, it accepts values from 0 to 1 (1 - complete confidence in the phrase)

Famous plugins working with this:

https://github.com/janvarev/irene_plugin_fuzzzy_thefuzz - through Thefuzz, fuzzy comparison of the lines
https://github.com/modos189/irene_plugin_fuzzy_sklearn - through Scikit-learn
https://github.com/janvarev/irene_plugin_fuzzy_ai_sentence - semantic comparison of lines on neural networks (sentence_transformers)

Plugins from Vasisual's voice assistant

From version 8.1 in test mode, support for Core-plane from Vasya's voice assistant was made: https://github.com/oknolaz/vasisualy

To add:

Plugins must be thrown in Plugins_vasi/Skills (take it to https://github.com/oknolaz/vasisualy/tree/master/vasisualy/skills)
From each plugin it is expected that Triggers will be spelled out in the module, on the basis of which a list of commands is formed. If not, the plugin must be finalized.

It works in the simplest cases - tested on the coin and Crystall_ball plugins.

If it doesn't work, read the code. Support is made through the plugin Plugin_vasi.py.

Contributing

If you want to add something to the project, it is good to familiarize yourself with the Contributing.md policy

Short:

It is advisable for plugins to make separate GitHub projects (or place them somewhere else) that you are ready to support. Links can be thrown in #1 so that others find your plugin. It is not necessary to throw additional plugins into this project - I have no time and strength to support what I do not understand.
Make point changes that improve functionality or fickering bugs (for example, inexplicability in some conditions). Such Pull Request with a high probability will be accepted.
Mass code changes (bringing the code style to a single, the organization of imports) will not be considered and will be rejected . Please do not make them.

Gratitude

@Enjirouz for the project of the voice assistant: https://github.com/enjirouz/voice-SSistant-App, which became the basis (though it was very redesigned)

Alphacephei for the beautiful recognition library VOSK (https://alphacephei.com/vosk/index.ru)

Project support

The main difficulty in opensors is not to write a code. Writing the code is interesting.

The difficulty in opensors is to maintain the code and users for a long time.

Answer questions. Fix bugs. Write articles and documentation.

If you want to support my interest and make Irina, as a vocal assistant independent of large companies, you can support, you can:

Write a new plugin (it always pleases me!)
Throw the money through a subscription at https://boosty.to/irene-Voice The more subscribers have, the better I understand that the project is needed.
Tell someone about Irina, or help you configure her.
Just say "thank you" in this branch: #12

Expand

Additional Information

Version v8.1
Type AI Source Code
Update Time 2025-08-23
size 90.68MB
From Github

Related Applications

GLM 4 Voice

2024-11-02
flutter_voice_friend

2024-11-01
Retrieval based Voice Conversion WebUI

2024-11-01
HLS Assistant Movie and TV Free Edition

2023-12-06
Assistant T app

2023-08-18
GOOGLE VOICE unlimited SMS interface

2009-11-07

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All