共 30 条
- [1] Bak T., 2023, P AAAI C ART INT AAA, p12 562
- [2] LightVoc: An Upsampling-Free GAN Vocoder Based On Conformer And Inverse Short-time Fourier Transform [J]. INTERSPEECH 2023, 2023, : 3043 - 3047
- [3] SingGAN: Generative Adversarial NetWork For High-Fidelity Singing Voice Generation [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2525 - 2535
- [4] Ito K., 2017, LJSPEECH DATASET
- [6] UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-FidelityWaveform Generation [J]. INTERSPEECH 2021, 2021, : 2207 - 2211
- [7] iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN [J]. INTERSPEECH 2023, 2023, : 4369 - 4373
- [8] Karras T, 2021, ADV NEUR IN, V34
- [9] Fre-GAN: Adversarial Frequency-consistent Audio Synthesis [J]. INTERSPEECH 2021, 2021, : 2197 - 2201
- [10] SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping [J]. INTERSPEECH 2022, 2022, : 803 - 807