Monaural Source Separation Based on Adaptive Discriminative Criterion in Neural Networks

被引:0
|
作者
Sun, Yang [1 ]
Zhu, Lei [2 ]
Chambers, Jonathon A. [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne, Tyne & Wear, England
[2] Harbin Engn Univ, Sci Coll, Harbin, Heilongjiang, Peoples R China
关键词
Monaural Source Separation; Deep Recurrent Neural Network; Penalty Factor; Adaptive;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Monaural source separation is an important research area which can help to improve the performance of several real-world applications, such as speech recognition and assisted living systems. Huang et al. proposed deep recurrent neural networks (DRNNs) with discriminative criterion objective function to improve the performance of source separation. However, the penalty factor in the objective function is selected randomly and empirically. Therefore, we introduce an approach to calculate the parameter in the discriminative term adaptively via the discrepancy between target features. The penalty factor can be changed with inputs to improve the separation performance. The proposed method is evaluated with different settings and architectures of neural networks. In these experiments, the TIMIT corpus is explored as the database and the signal to distortion ratio (SDR) as the measurement. Comparing with the previous approach, our method has improved robustness and a better separation performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] DISCRIMINATIVE DEEP RECURRENT NEURAL NETWORKS FOR MONAURAL SPEECH SEPARATION
    Wang, Guan-Xiang
    Hsu, Chung-Chien
    Chien, Jen-Tzung
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2544 - 2548
  • [2] Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation
    Huang, Po-Sen
    Kim, Minje
    Hasegawa-Johnson, Mark
    Smaragdis, Paris
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2136 - 2147
  • [3] Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation
    Grais, Emad M.
    Wierstorf, Hagen
    Ward, Dominic
    Plumbley, Mark D.
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 340 - 350
  • [4] Mask Optimisation for Neural Network Monaural Source Separation
    Cant, Richard
    Langensiepen, Caroline
    Metcalf, William
    2017 19TH UKSIM-AMSS INTERNATIONAL CONFERENCE ON MATHEMATICAL MODELLING & COMPUTER SIMULATION (UKSIM), 2017, : 116 - 121
  • [5] SUPERVISED MONAURAL SOURCE SEPARATION BASED ON AUTOENCODERS
    Osako, Keiichi
    Mitsufuji, Yuki
    Singh, Rita
    Raj, Bhiksha
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 11 - 15
  • [6] Audio Source Separation with Discriminative Scattering Networks
    Sprechmann, Pablo
    Bruna, Joan
    LeCun, Yann
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 259 - 267
  • [7] Asymmetric PCA neural networks for adaptive blind source separation
    Diamantaras, KI
    NEURAL NETWORKS FOR SIGNAL PROCESSING VIII, 1998, : 103 - 112
  • [8] Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks
    Grais, Emad M.
    Roma, Gerard
    Simpson, Andrew J. R.
    Plumbley, Mark D.
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 236 - 246
  • [9] Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks
    Sun, Yang
    Wang, Wenwu
    Chambers, Jonathon
    Naqvi, Syed Mohsen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 125 - 139
  • [10] ENHANCED TIME-FREQUENCY MASKING BY USING NEURAL NETWORKS FOR MONAURAL SOURCE SEPARATION IN REVERBERANT ROOM ENVIRONMENTS
    Sun, Yang
    Wang, Wenwu
    Chambers, Jonathon A.
    Naqvi, Syed Mohsen
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1647 - 1651