iSTFT Avocodo pytorch
1.0.0
ISTFT的發電機和鱷梨的歧視器。與原始的hifi-gan相比,使用faster分支進行超快速訓練和推理。 

python train.py --config config_v1.json
faster分支在訓練中的速度為50%(0.33 s/b),推理速度提高了60%。 @inproceedings{kaneko2022istftnet,
title={{iSTFTNet}: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform},
author={Takuhiro Kaneko and Kou Tanaka and Hirokazu Kameoka and Shogo Seki},
booktitle={ICASSP},
year={2022},
}
@misc{https://doi.org/10.48550/arxiv.2206.13404,
doi = {10.48550/ARXIV.2206.13404},
url = {https://arxiv.org/abs/2206.13404},
author = {Bak, Taejun and Lee, Junmo and Bae, Hanbin and Yang, Jinhyeok and Bae, Jae-Sung and Joo, Young-Sun},
keywords = {Audio and Speech Processing (eess.AS), Artificial Intelligence (cs.AI), Sound (cs.SD), FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Computer and information sciences, FOS: Computer and information sciences},
title = {Avocodo: Generative Adversarial Network for Artifact-free Vocoder},
publisher = {arXiv},
year = {2022},
copyright = {arXiv.org perpetual, non-exclusive license}
}