تنزيل AutoVocoder - تنزيل رمز مصدر AutoVocoder

AutoVocoder

كود الذكاء الاصطناعي

1.0.0

تنزيل

Autovocoder: توليد شكل موجة سريعة من تمثيل الكلام المستفاد باستخدام معالجة الإشارة الرقمية القابلة للتمييز

تنفيذ Pytorch غير الرسمي لـ AutoVocoder: توليد الموجة السريعة من تمثيل الكلام المستفاد باستخدام معالجة الإشارات الرقمية القابلة للتمييز. يعتمد هذا المستودع على iStftnet github (ورقة) .

Disclaimer : This repo is built for testing purpose.

تمرين :

 python train.py --config config.json

in train.py ، تغيير- --input_wavs_dir إلى دليل LJSPEEDE-1.1/WAVS.
في config.json ، قم بتغيير latent_dim لـ AV128 و AV192 و AV256 (افتراضي).
بالنظر إلى Section 3.3 ، يمكنك تحديد dec_istft_input بين cartesian (الافتراضي) ، polar ، both .

ملحوظة:

التحقق من صحة AV256 أثناء التدريب.
في اختبارنا ، يتقارب ما يقرب من 3 مرات أسرع من HIFI-V1 (في إشارة إلى الريبو الرسمي).

الاستشهادات:

 @article{Webber2022AutovocoderFW,
  title={Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing},
  author={Jacob J. Webber and Cassia Valentini-Botinhao and Evelyn Williams and Gustav Eje Henter and Simon King},
  journal={ArXiv},
  year={2022},
  volume={abs/2211.06989}
}