JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis

被引:0
|
作者
Cho, Hyunjae [1 ]
Lee, Junhyeok [2 ]
Jung, Wonbin [3 ]
机构
[1] Seoul Natl Univ SNU, Seoul, South Korea
[2] Supertone Inc, Seoul, South Korea
[3] Korea Adv Inst Sci & Technol KAIST, Daejeon, South Korea
来源
INTERSPEECH 2024 | 2024年
关键词
speech synthesis; vocoder; alias-free; GAN; shift-equivariant;
D O I
10.21437/Interspeech.2024-1447
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-autoregressive GAN-based neural vocoders are widely used due to their fast inference speed and high perceptual quality. However, they often suffer from audible artifacts such as tonal artifacts in their generated results. Therefore, we propose JenGAN, a new training strategy that involves stacking shifted low-pass filters to ensure the shift-equivariant property. This method helps prevent aliasing and reduce artifacts while preserving the model structure used during inference. In our experimental evaluation, JenGAN consistently enhances the performance of vocoder models, yielding significantly superior scores across the majority of evaluation metrics.
引用
收藏
页码:3879 / 3883
页数:5
相关论文
共 50 条
  • [11] Controlled synthesis of GaN-based nanowires for photoelectrochemical water splitting applications
    Ebaid, Mohamed
    Kang, Jin-Ho
    Ryu, Sang-Wan
    SEMICONDUCTOR SCIENCE AND TECHNOLOGY, 2017, 32 (01)
  • [12] GAN acoustic model for Kazakh speech synthesis
    Kaliyev, Arman
    Zeno, Bassel
    Rybin, Sergey V.
    Matveev, Yuri N.
    Lyakso, Elena E.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (03) : 729 - 735
  • [13] Backdoor Attack is a Devil in Federated GAN-Based Medical Image Synthesis
    Jin, Ruinan
    Li, Xiaoxiao
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2022, 2022, 13570 : 154 - 165
  • [14] Progress on the GaN-based LEDs and LDs
    Wang, Yong
    Zou, Yonggang
    Ma, Xiaohui
    Yu, Naisen
    Deng, Dongmei
    Lau, Kei May
    2012 INTERNATIONAL CONFERENCE ON OPTOELECTRONICS AND MICROELECTRONICS (ICOM), 2012, : 105 - 110
  • [15] Microscopic modeling of GaN-based heterostructures
    Sacconi, F
    Della Sala, F
    Di Carlo, A
    Lugli, P
    VLSI DESIGN, 2001, 13 (1-4) : 387 - 391
  • [16] GaN-based optoelectronics on silicon substrates
    Krost, A
    Dadgar, A
    MATERIALS SCIENCE AND ENGINEERING B-SOLID STATE MATERIALS FOR ADVANCED TECHNOLOGY, 2002, 93 (1-3): : 77 - 84
  • [17] GAN-Based Prediction of Time Series
    Festag, Sven
    Spreckelsen, Cord
    PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 500 - 501
  • [18] GAN-Based LiDAR Intensity Simulation
    Marcus, Richard
    Gabel, Felix
    Knoop, Niklas
    Stamminger, Marc
    DEEP LEARNING THEORY AND APPLICATIONS, DELTA 2023, 2023, 1875 : 419 - 433
  • [19] Concept of the CW GaN-based VECSEL
    Sokol, Adam K.
    Piskorski, Lukasz
    Kuc, Maciej
    Wasiak, Michal
    Sarzala, Robert P.
    VERTICAL EXTERNAL CAVITY SURFACE EMITTING LASERS (VECSELS) VIII, 2018, 10515
  • [20] Generation-recombination noise in GaN and GaN-based devices
    Pala, N
    Rumyantsev, SL
    Shur, MS
    Levinshtein, ME
    Khan, MA
    Simin, G
    Gaska, R
    NOISE IN DEVICES AND CIRCUITS, 2003, 5113 : 217 - 231