CycleGAN-based speech enhancement for the unpaired training data

被引:0
|
作者
Yuan, Jing [1 ]
Bao, Changchun [1 ]
机构
[1] Beijing Univ Technol, Beijing, Peoples R China
来源
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2019年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/apsipaasc47483.2019.9023072
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Speech enhancement is an important task of improving speech quality in noise scenario. Many speech enhancement methods have achieved remarkable success based on the paired data. However, for many tasks, the paired training data is not available. In this paper, we present a speech enhancement method for the unpaired data based on cycle-consistent generative adversarial network (CycleGAN) that can minimize the reconstruction loss as much as possible. The proposed model employs two discriminators and two generators to preserve speech components and reduce noise so that the network could map features better for the unseen noise. In this method, the generators are used to generate the enhanced speech, and two discriminators are employed to discriminate real inputs and the outputs of the generators. The experimental results showed that the proposed method effectively improved the performance compared to traditional deep neural network (DNN) and the recent GAN-based speech enhancement methods.
引用
收藏
页码:878 / 883
页数:6
相关论文
共 50 条
  • [1] CycleGAN-Based Unpaired Speech Dereverberation
    Muckenhirn, Hannah
    Safin, Aleksandr
    Erdogan, Hakan
    Quitry, Felix de Chaumont
    Tagliasacchi, Marco
    Wisdom, Scott
    Hershey, John R.
    INTERSPEECH 2022, 2022, : 196 - 200
  • [2] CycleGAN-based data enhancement method for lunar surface images
    Song T.
    Wu Z.
    Gao A.
    Yuan J.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (10): : 3041 - 3048
  • [3] CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition
    Bao, Fang
    Neumann, Michael
    Ngoc Thang Vu
    INTERSPEECH 2019, 2019, : 2828 - 2832
  • [4] Unpaired Underwater Image Enhancement Based on CycleGAN
    Du, Rong
    Li, Weiwei
    Chen, Shudong
    Li, Congying
    Zhang, Yong
    INFORMATION, 2022, 13 (01)
  • [5] CycleGAN-based Non-parallel Speech Enhancement with an Adaptive Attention-in-attention Mechanism
    Yu, Guochen
    Wang, Yutian
    Zheng, Chengshi
    Wang, Hui
    Zhang, Qin
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 523 - 529
  • [6] Speech Enhancement Based on CycleGAN with Noise-informed Training
    Ting, Wen-Yuan
    Wang, Syu-Siang
    Chang, Hsin-Li
    Su, Borching
    Tsao, Yu
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 155 - 159
  • [7] CYCLEGAN-BASED CLOUD REMOVAL FROM A FEATURE ENHANCEMENT PERSPECTIVE BY TRANSFORMER
    Huang, Yiming
    Ma, Xianping
    Zhang, Xiaokang
    Pun, Man-On
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 3772 - 3775
  • [8] Correction to: CycleGAN-Based Speech Mode Transformation Model for Robust Multilingual ASR
    Kumud Tripathi
    K. Sreenivasa Rao
    Circuits, Systems, and Signal Processing, 2022, 41 : 5306 - 5306
  • [9] An Improved CycleGAN-Based Model for Low-Light Image Enhancement
    Tang, Guangyi
    Ni, Jianjun
    Chen, Yan
    Cao, Weidong
    Yang, Simon X.
    IEEE SENSORS JOURNAL, 2024, 24 (14) : 21879 - 21892
  • [10] CycleGAN-Based Image to Image Translation for Realistic Surgical Training Phantoms
    Rodrigues, N. S.
    Torres, H. R.
    Morais, P.
    Buschle, L. R.
    Haag, S.
    Correia-Pinto, J.
    Lima, E.
    Vilaca, J. L.
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,