Monaural Source Separation Based on Adaptive Discriminative Criterion in Neural Networks

被引:0
作者
Sun, Yang [1 ]
Zhu, Lei [2 ]
Chambers, Jonathon A. [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne, Tyne & Wear, England
[2] Harbin Engn Univ, Sci Coll, Harbin, Heilongjiang, Peoples R China
来源
2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP) | 2017年
关键词
Monaural Source Separation; Deep Recurrent Neural Network; Penalty Factor; Adaptive;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Monaural source separation is an important research area which can help to improve the performance of several real-world applications, such as speech recognition and assisted living systems. Huang et al. proposed deep recurrent neural networks (DRNNs) with discriminative criterion objective function to improve the performance of source separation. However, the penalty factor in the objective function is selected randomly and empirically. Therefore, we introduce an approach to calculate the parameter in the discriminative term adaptively via the discrepancy between target features. The penalty factor can be changed with inputs to improve the separation performance. The proposed method is evaluated with different settings and architectures of neural networks. In these experiments, the TIMIT corpus is explored as the database and the signal to distortion ratio (SDR) as the measurement. Comparing with the previous approach, our method has improved robustness and a better separation performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation
    Huang, Po-Sen
    Kim, Minje
    Hasegawa-Johnson, Mark
    Smaragdis, Paris
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2136 - 2147
  • [2] Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks
    Sun, Yang
    Wang, Wenwu
    Chambers, Jonathon
    Naqvi, Syed Mohsen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 125 - 139
  • [3] A SI-SDR Loss Function based Monaural Source Separation
    Li, Shuai
    Liu, Hongqing
    Zhou, Yi
    Luo, Zhen
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 356 - 360
  • [4] Bayesian Factorization and Learning for Monaural Source Separation
    Chien, Jen-Tzung
    Yang, Po-Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 185 - 195
  • [5] Adaptive Weighted Performance Criterion for Artificial Neural Networks
    Bal, Cagatay
    Demir, Serdar
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [6] Audio Source Separation from a Monaural Mixture Using Convolutional Neural Network in the Time Domain
    Zhang, Peng
    Ma, Xiaohong
    Ding, Shuxue
    ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 : 388 - 395
  • [7] Monaural Source Separation Based on Sequentially Trained LSTMs in Real Room Environments
    Li, Yi
    Sun, Yang
    Naqvi, Syed Mohsen
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [8] SEQUENTIALLY TRAINED DNNS BASED MONAURAL SOURCE SEPARATION IN REAL ROOM ENVIRONMENTS
    Li, Yi
    Sun, Yang
    Naqvi, Syed Mohsen
    2019 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE (SSPD), 2019,
  • [9] Monaural Source Separation Using a Random Forest Classifier
    Riday, Cosimo
    Bhargava, Saurabh
    Hahnloser, Richard H. R.
    Liu, Shih-Chii
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3344 - 3348
  • [10] PROXIMAL DEEP RECURRENT NEURAL NETWORK FOR MONAURAL SINGING VOICE SEPARATION
    Yuan, Weitao
    Wang, Shengbei
    Li, Xiangrui
    Unoki, Masashi
    Wang, Wenwu
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 286 - 290