Speech technologies for Russian
Communication
- https://t.me/speech_recognition_ru - group "Speech recognition"
- https://t.me/speech_recognition - group in English
- https://t.me/speechtech - news channel
- https://t.me/betterdatacommunity/15 - Speech community in Datacommunity
- https://t.me/voitsestuff https://t.me/voice_stuff_chat - frappucino's space
- https://t.me/teraspace https://t.me/teraspace_chat - tera's space
Courses
- https://github.com/markovka17/dla
- https://github.com/yandexdataschool/speech_course
- https://github.com/severilov/dl- audio-course
- https://hugingFace.co/learn/audio-couurse/en/CHAPTER0/INTRODUCTION - play with sound models HF
- https://www.youtube.com/playlist?list=plyg3whdp5cwvrxljxzblqiqtwy_qjkmz - Deep Learning for Audio
Data for training
- https://github.com/salute-developers/golos
- https://github.com/snakers4/open_stt
- https://github.com/georgefedoseev/deepspeech
- https://github.com/sovaai/sova-dataset
- https://www.openslr.org/96/ - Russian librispeech
- https://commonvoice.mozilla.org/ru/datasets - mcv
Speech synthesis
- https://www.caito.de/2019/01/03/the-m-ailabs-speech-dataset/-M-ailabs Dataset (from librivox)
- https://ruslan-corpus.github.io/
- https://github.com/sovaai/sova-tts
- https://hugingFace.co/bene-ges/tts_ru_hifigan_ruslan
- https://github.com/alphacep/vosk-tts
- https://github.com/rhvoice
- https://github.com/snakers4/silero-models#text-to-spech
- https://github.com/tera2space/teratts
- https://hugingFace.co/mogr/xts-ru-ipa
Voice transformations
- https://www.weights.gg/ru - a bunch of models for RVC
- https://2ch- ai.gitgud.site/wiki/speech/ - school and dual
- https://lunaiproject.uwu.ai/ - Russian diffsinger
- there are a bunch of telegram channels, mainly muddy orientation
General counting for synthesis
- https://github.com/sovaai/sova-tts-tps
- https://github.com/snakers4/silero-models#text-enhancement
- https://github.com/snakers4/russian_stt_text_normalization
- https://www.kagghe.com/competitions/text-normalization-prussian-laguage/VERVIEW-old competition for KAGGLE
- https://github.com/ppleskov/text-normalization-challenge-russian-laguage-one of the winners
- https://github.com/shigabeev/russian_tts_normalization
- https://github.com/saarus72/text_normalization/tree/dev - based on Fred-t5
- https://github.com/den4ika/runorm - numbers in text, processing English words, disclosure of abbreviations
- https://github.com/just-a/multilingual-text-parser
Squeezing stress, phonetic dictionaries and g2p
- https://github.com/reynoldsnlp/udar
- https://github.com/einhornus/russian_accentuation
- https://github.com/wilpert/rusphonetizer
- https://hugingFace.co/bene-ges/ru_g2p_ipa_bert_large
- https://github.com/desklop/stressrnn
- https://github.com/nsu- arussian_g2p
- https://github.com/nsu- ai- team/russian_g2p_neuro
- https://github.com/suralmasha/rutranscript
- https://github.com/mashapo/russtress
- https://hugingFace.co/ilyagusev/ru-word-stress-transformer
- https://github.com/aishutin/rustress
- https://github.com/koziev/stressmodel
- https://github.com/mogr/omogre
- https://github.com/den4ika/ruaccent - YELIFIER, stress and resolution of homographs
Dictionary
- https://github.com/reynoldsnlp/udar/blob/src/src/resources/src/tixonov.txt - Morphematum -Orphographic Dictionary of Tikhonov
- http://aot.ru - the source of the dictionary of the bog in the machine format
- https://github.com/gramdict/gramdict - a modern version
- http://odict.ru/ - another development
- http://opencorpora.org/ - marked morphological dictionary
- https://ru.wiktionary.org - Wiktionary
- https://kaikki.org/dictionary/russian/ - Dump Wiktionary in a convenient format
Yeephores
- https://github.com/sovaai/sova-tts-tps
- https://github.com/e2yo/eyo-cernel
- https://github.com/kalashnikovisme/karamzin
- https://github.com/text-extenD-tools/python-yoficator
- https://github.com/emacsmiror/yoficator
- https://github.com/unabashed/yoficator
Recognition of emotions
- https://github.com/aniemore/aniemore
- https://hugingFace.co/xbgoose/hubert-lage-speech-motion-recognition-russian-dusha-finetified
- https://github.com/salute-developers/golos/tree/master/dusha
Speech recognition models
Comparison of models here.
- Vosk Small https://alphacephei.com/vosk/models/vosk-model-small-ru-0.22.zip
- VOSK BIG 0.22 https://alphacephei.com/vosk/models/vosk-model-0.22.zip
- Vosk Big 0.42 https://alphacephei.com/vosk/models/vosk-model-0.42.zip
- Nvidia rnnt Large https://hugingFace.co/nvidia/stt_ru_conformer_transducer_large
- Whisper Medium https://github.com/openai/whisper
- Whisper Adapted Medium https://hugingFace.co/mitcheldehaven/whisper-medium-ru
- Whisper Adapted Large https://hugingFace.co/mitcheldehaven/whisper-large-v2-u2-
- WAV2VECLM https://hugingFace.co/jonatasgrosman/wav2VeC2-xls-r-1b-russian
- WAV2VECLM bond005 https://hugingFace.co/bond005/wav2VeC2-large-ru-golos (Version 03.2023)
- Salute Citrinet https://github.com/salute-developers/golos
- Funasr Russian https://modelscope.cn/models/damo/speech_uniasr_asr_2pass-6k-common-vocab1664-tensorflow1-offline/summary
Not tested (worse than quality)
- https://github.com/sovse/base_rus_whisper_stt
Linguistics (words lists, morphology)
- http://aot.ru
- https://natasha.github.io
Punctuation and title letters
https://alphacephei.com/vosk/models/vosk-recasepunch-en-0.22.zip
https://hugingFace.co/kontur-a/sbert_punch_case_ru
https://github.com/kotikkontantin/ru-upopunction
https://github.com/vlomme/bert-russian-punctual
https://github.com/lesha17/punctual
https://github.com/gleb-skobinsky/ru_punct
https://github.com/sviperm/neuro-commma
https://github.com/snakers4/silero-models
https://github.com/marlon-br/neuro-commma
https://github.com/sviperm/neuro-commma
https://github.com/averkij/multipunct
https://github.com/denis-berezutskiy-lad/transcription-bert-punctuator-scripts hugingFace
https://hugingFace.co/ai-Forever/sage-fredt5-distilled-95m-set of Sage models
Story
- A.A. Grammar dictionary
- Otipple is a department and a separation of theoretical and applied linguistics of the Faculty of Philology of Moscow State University
- Lobanov Boris Methodich
- Ipppi - Sorokin Victor Nikolaevich
- IPA RAS History
- Digital processing and recognition of the speech signals of the WC RAS
Alphacep
- 2005 began work on the festival synthesizer
- Festlang Clunits
- Russian language on VoxForge
- CMUSPHINX