JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis

被引:0
|
作者
Cho, Hyunjae [1 ]
Lee, Junhyeok [2 ]
Jung, Wonbin [3 ]
机构
[1] Seoul Natl Univ SNU, Seoul, South Korea
[2] Supertone Inc, Seoul, South Korea
[3] Korea Adv Inst Sci & Technol KAIST, Daejeon, South Korea
来源
INTERSPEECH 2024 | 2024年
关键词
speech synthesis; vocoder; alias-free; GAN; shift-equivariant;
D O I
10.21437/Interspeech.2024-1447
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-autoregressive GAN-based neural vocoders are widely used due to their fast inference speed and high perceptual quality. However, they often suffer from audible artifacts such as tonal artifacts in their generated results. Therefore, we propose JenGAN, a new training strategy that involves stacking shifted low-pass filters to ensure the shift-equivariant property. This method helps prevent aliasing and reduce artifacts while preserving the model structure used during inference. In our experimental evaluation, JenGAN consistently enhances the performance of vocoder models, yielding significantly superior scores across the majority of evaluation metrics.
引用
收藏
页码:3879 / 3883
页数:5
相关论文
共 50 条
  • [21] A study on GaN-based betavoltaic batteries
    Toprak, A.
    Yilmaz, D.
    Ozbay, E.
    SEMICONDUCTOR SCIENCE AND TECHNOLOGY, 2022, 37 (12)
  • [22] GAN-Based Fusion Adversarial Training
    Cao, Yifan
    Lin, Ying
    Ning, Shengfu
    Pi, Huan
    Zhang, Junyuan
    Hu, Jianpeng
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 51 - 64
  • [23] Progress in GaN-based materials and optical devices
    Melngailis, I
    ADVANCED OPTICAL DEVICES, TECHNOLOGIES, AND MEDICAL APPLICATIONS, 2002, 5123 : 231 - 237
  • [24] GaN-Based GAA Vertical CMOS Inverter
    Liu, Xinke
    Yang, Jiaying
    Li, Jian
    Lin, Feng
    Li, Bo
    Zhang, Ziyue
    He, Wei
    Huang, Mark
    IEEE JOURNAL OF THE ELECTRON DEVICES SOCIETY, 2022, 10 : 224 - 228
  • [25] Loss Functions for GAN-based Style Transfer
    Bui, Nhat-Tan
    Nguyen, Hai-Dang
    Bui-Huynh, Trung-Nam
    Nguyen, Ngoc-Thao
    Cao, Xuan-Nam
    FIFTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2022, 2023, 12701
  • [26] A Survey on GaN-Based Devices for Terahertz Photonics
    Ahi, Kiarash
    Anwar, Mehdi
    WIDE BANDGAP POWER DEVICES AND APPLICATIONS, 2016, 9957
  • [27] Design of GaN-based VCSEL with high performance
    Jasim, Farah Z.
    Abdul-Razzak, Mohammed J.
    Ahmed, Hisham M.
    OPTOELECTRONICS AND ADVANCED MATERIALS-RAPID COMMUNICATIONS, 2014, 8 (1-2): : 7 - 9
  • [28] GaN-based PIN alpha particle detectors
    Wang, Guo
    Fu, Kai
    Yao, Chang-sheng
    Su, Dan
    Zhang, Guo-guang
    Wang, Jin-yan
    Lu, Min
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2012, 663 (01) : 10 - 13
  • [29] Status of GaN-based Power Switching Devices
    Hikita, Masahiro
    Ueno, Hiroaki
    Matsuo, Hisayoshi
    Ueda, Tetsuzo
    Uemoto, Yasuhiro
    Inoue, Kaoru
    Tanaka, Tsuyoshi
    Ueda, Daisuke
    SILICON CARBIDE AND RELATED MATERIALS 2007, PTS 1 AND 2, 2009, 600-603 : 1257 - 1262
  • [30] GaN-based LEDs with Ar plasma treatment
    Kuo, D. S.
    Lam, K. T.
    Wen, K. H.
    Chang, S. J.
    Ko, T. K.
    Hon, S. J.
    MATERIALS SCIENCE IN SEMICONDUCTOR PROCESSING, 2012, 15 (01) : 52 - 55