DISCRIMINATIVE DEEP RECURRENT NEURAL NETWORKS FOR MONAURAL SPEECH SEPARATION

被引:0
|
作者
Wang, Guan-Xiang [1 ]
Hsu, Chung-Chien [1 ]
Chien, Jen-Tzung [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan
关键词
deep learning; discriminative learning; neural network; monaural speech separation; FACTORIZATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep neural network is now a new trend towards solving different problems in speech processing. In this paper, we propose a discriminative deep recurrent neural network (DRNN) model for monaural speech separation. Our idea is to construct DRNN as a regression model to discover the deep structure and regularity for signal reconstruction from a mixture of two source spectra. To reinforce the discrimination capability between two separated spectra, we estimate DRNN separation parameters by minimizing an integrated objective function which consists of two measurements. One is the within source reconstruction errors due to the individual source spectra while the other conveys the discrimination information which preserves the mutual difference between two source spectra during the supervised training procedure. This discrimination information acts as a kind of regularization so as to maintain between-source separation in monaural source separation. In the experiments, we demonstrate the effectiveness of the proposed method for speech separation compared with the other methods.
引用
收藏
页码:2544 / 2548
页数:5
相关论文
共 50 条
  • [1] Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
    Fan, Cunhang
    Liu, Bin
    Tao, Jianhua
    Yi, Jiangyan
    Wen, Zhengqi
    INTERSPEECH 2019, 2019, : 4599 - 4603
  • [2] Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation
    Huang, Po-Sen
    Kim, Minje
    Hasegawa-Johnson, Mark
    Smaragdis, Paris
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2136 - 2147
  • [3] Improvement of joint optimization of masks and deep recurrent neural networks for monaural speech separation using optimized activation functions
    MASOOD Asim
    YE Zhongfu
    Chinese Journal of Acoustics, 2020, 39 (03) : 420 - 432
  • [4] Monaural Source Separation Based on Adaptive Discriminative Criterion in Neural Networks
    Sun, Yang
    Zhu, Lei
    Chambers, Jonathon A.
    Naqvi, Syed Mohsen
    2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2017,
  • [5] Deep Recurrent Neural Network based Monaural Speech Separation using Recurrent Temporal Restricted Boltzmann Machines
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya K.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3622 - 3626
  • [6] PERCEPTUAL IMPROVEMENT OF DEEP NEURAL NETWORKS FOR MONAURAL SPEECH ENHANCEMENT
    Han, Wei
    Zhang, Xiongwei
    Sun, Meng
    Shi, Wenhua
    Chen, Xushan
    Hu, Yonggang
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [7] DEEP LEARNING FOR MONAURAL SPEECH SEPARATION
    Huang, Po-Sen
    Kim, Minje
    Hasegawa-Johnson, Mark
    Smaragdis, Paris
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [8] PROXIMAL DEEP RECURRENT NEURAL NETWORK FOR MONAURAL SINGING VOICE SEPARATION
    Yuan, Weitao
    Wang, Shengbei
    Li, Xiangrui
    Unoki, Masashi
    Wang, Wenwu
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 286 - 290
  • [9] A convolutional recurrent neural network with attention framework for speech separation in monaural recordings
    Chao Sun
    Min Zhang
    Ruijuan Wu
    Junhong Lu
    Guo Xian
    Qin Yu
    Xiaofeng Gong
    Ruisen Luo
    Scientific Reports, 11
  • [10] A convolutional recurrent neural network with attention framework for speech separation in monaural recordings
    Sun, Chao
    Zhang, Min
    Wu, Ruijuan
    Lu, Junhong
    Xian, Guo
    Yu, Qin
    Gong, Xiaofeng
    Luo, Ruisen
    SCIENTIFIC REPORTS, 2021, 11 (01)