iSTFT Avocodo pytorch
1.0.0
Le générateur d'ISTFT et les discriminateurs d'Avocodo. Utilisez une branche faster pour une formation et une inférence ultra rapides par rapport à hifi-gan d'origine. 

python train.py --config config_v1.json
faster de ce dépôt est de 50% de vitesse (0,33 s / b) en formation et une amélioration de 60% de vitesse d'inférence. @inproceedings{kaneko2022istftnet,
title={{iSTFTNet}: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform},
author={Takuhiro Kaneko and Kou Tanaka and Hirokazu Kameoka and Shogo Seki},
booktitle={ICASSP},
year={2022},
}
@misc{https://doi.org/10.48550/arxiv.2206.13404,
doi = {10.48550/ARXIV.2206.13404},
url = {https://arxiv.org/abs/2206.13404},
author = {Bak, Taejun and Lee, Junmo and Bae, Hanbin and Yang, Jinhyeok and Bae, Jae-Sung and Joo, Young-Sun},
keywords = {Audio and Speech Processing (eess.AS), Artificial Intelligence (cs.AI), Sound (cs.SD), FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Computer and information sciences, FOS: Computer and information sciences},
title = {Avocodo: Generative Adversarial Network for Artifact-free Vocoder},
publisher = {arXiv},
year = {2022},
copyright = {arXiv.org perpetual, non-exclusive license}
}