CycleGAN-based speech enhancement for the unpaired training data

被引:0
作者
Yuan, Jing [1 ]
Bao, Changchun [1 ]
机构
[1] Beijing Univ Technol, Beijing, Peoples R China
来源
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2019年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/apsipaasc47483.2019.9023072
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Speech enhancement is an important task of improving speech quality in noise scenario. Many speech enhancement methods have achieved remarkable success based on the paired data. However, for many tasks, the paired training data is not available. In this paper, we present a speech enhancement method for the unpaired data based on cycle-consistent generative adversarial network (CycleGAN) that can minimize the reconstruction loss as much as possible. The proposed model employs two discriminators and two generators to preserve speech components and reduce noise so that the network could map features better for the unseen noise. In this method, the generators are used to generate the enhanced speech, and two discriminators are employed to discriminate real inputs and the outputs of the generators. The experimental results showed that the proposed method effectively improved the performance compared to traditional deep neural network (DNN) and the recent GAN-based speech enhancement methods.
引用
收藏
页码:878 / 883
页数:6
相关论文
共 50 条
[41]   Improved CycleGAN-based feature recognition in young children and preschool education research [J].
Han J. ;
Yuchi X. ;
Su Y. .
Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
[42]   CYCLETRANSGAN-EVC: A cyclegan-based emotional voice conversion model with transformer [J].
Fu, Changzeng ;
Liu, Chaoran ;
Ishi, Carlos Toshinori ;
Ishiguro, Hiroshi .
arXiv, 2021,
[43]   CycleGAN-based realistic image dataset generation for forward-looking sonar [J].
Liu, Dingyu ;
Wang, Yusheng ;
Ji, Yonghoon ;
Tsuchiya, Hiroshi ;
Yamashita, Atsushi ;
Asama, Hajime .
ADVANCED ROBOTICS, 2021, 35 (3-4) :242-254
[44]   Cross-Modality Image Synthesis from Unpaired Data Using CycleGAN Effects of Gradient Consistency Loss and Training Data Size [J].
Hiasa, Yuta ;
Otake, Yoshito ;
Takao, Masaki ;
Matsuoka, Takumi ;
Takashima, Kazuma ;
Carass, Aaron ;
Prince, Jerry L. ;
Sugano, Nobuhiko ;
Sato, Yoshinobu .
SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, 2018, 11037 :31-41
[45]   SM-CycleGAN: crop image data enhancement method based on self-attention mechanism CycleGAN [J].
Liu, Dian ;
Cao, Yang ;
Yang, Jing ;
Wei, Jianyu ;
Zhang, Jili ;
Rao, Chenglin ;
Wu, Banghong ;
Zhang, Dabin .
SCIENTIFIC REPORTS, 2024, 14 (01)
[46]   Augmented CycleGAN: Learning Many-to-Many Mappings from Unpaired Data [J].
Almahairi, Amjad ;
Rajeswar, Sai ;
Sordoni, Alessandro ;
Bachman, Philip ;
Courville, Aaron .
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[47]   Speech Intelligibility Enhancement By Non-Parallel Speech Style Conversion Using CWT and iMetricGAN Based CycleGAN [J].
Xiao, Jing ;
Liu, Jiaqi ;
Li, Dengshi ;
Zhao, Lanxin ;
Wang, Qianrui .
MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 :544-556
[48]   UHA-CycleGAN: Unpaired hybrid attention network based on CycleGAN for terahertz image super-resolution [J].
Liu, Huanyu ;
Guo, Haipeng ;
Liu, Xiaodong .
IET IMAGE PROCESSING, 2023, 17 (08) :2547-2559
[49]   SDCGAN: A CycleGAN-based single-domain generalization method for mechanical fault diagnosis [J].
Guo, Yu ;
Li, Xiangyu ;
Zhang, Jundong ;
Cheng, Ziyi .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2025, 258
[50]   IE-CycleGAN: improved cycle consistent adversarial network for unpaired PET image enhancement [J].
Cui, Jianan ;
Luo, Yi ;
Chen, Donghe ;
Shi, Kuangyu ;
Su, Xinhui ;
Liu, Huafeng .
EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2024, 51 (13) :3874-3887