Monaural Source Separation Based on Adaptive Discriminative Criterion in Neural Networks

被引:0
|
作者
Sun, Yang [1 ]
Zhu, Lei [2 ]
Chambers, Jonathon A. [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne, Tyne & Wear, England
[2] Harbin Engn Univ, Sci Coll, Harbin, Heilongjiang, Peoples R China
关键词
Monaural Source Separation; Deep Recurrent Neural Network; Penalty Factor; Adaptive;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Monaural source separation is an important research area which can help to improve the performance of several real-world applications, such as speech recognition and assisted living systems. Huang et al. proposed deep recurrent neural networks (DRNNs) with discriminative criterion objective function to improve the performance of source separation. However, the penalty factor in the objective function is selected randomly and empirically. Therefore, we introduce an approach to calculate the parameter in the discriminative term adaptively via the discrepancy between target features. The penalty factor can be changed with inputs to improve the separation performance. The proposed method is evaluated with different settings and architectures of neural networks. In these experiments, the TIMIT corpus is explored as the database and the signal to distortion ratio (SDR) as the measurement. Comparing with the previous approach, our method has improved robustness and a better separation performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
    Fan, Cunhang
    Liu, Bin
    Tao, Jianhua
    Yi, Jiangyan
    Wen, Zhengqi
    INTERSPEECH 2019, 2019, : 4599 - 4603
  • [22] Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation
    Pyykkonen, Pyry
    Mimilakis, Styliannos, I
    Drossos, Konstantinos
    Virtanen, Tuomas
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [23] Hybrid neural networks for ISFET source separation
    Bermejo, S
    Bedoya, G
    Cabestany, J
    SMART SENSORS, ACTUATORS, AND MEMS, PTS 1 AND 2, 2003, 5116 : 109 - 119
  • [24] Inference-Adaptive Steering of Neural Networks for Real-Time Area-Based Sound Source Separation
    Strauss, Martin
    Mack, Wolfgang
    Valero, Maria Luis
    Koepueklue, Okan
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1041 - 1045
  • [25] Adaptive blind source separation using a risk-sensitive criterion
    Shimizu, J
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2003, E86A (07) : 1724 - 1731
  • [26] Monaural Source Separation Based on Sequentially Trained LSTMs in Real Room Environments
    Li, Yi
    Sun, Yang
    Naqvi, Syed Mohsen
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [27] Audio Source Separation from a Monaural Mixture Using Convolutional Neural Network in the Time Domain
    Zhang, Peng
    Ma, Xiaohong
    Ding, Shuxue
    ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 : 388 - 395
  • [28] Discriminative adaptive training using the MPE criterion
    Wang, L
    Woodland, PC
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 279 - 284
  • [29] Monaural Source Separation in Complex Domain With Long Short-Term Memory Neural Network
    Sun, Yang
    Xian, Yang
    Wang, Wenwu
    Naqvi, Syed Mohsen
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (02) : 359 - 369
  • [30] SEQUENTIALLY TRAINED DNNS BASED MONAURAL SOURCE SEPARATION IN REAL ROOM ENVIRONMENTS
    Li, Yi
    Sun, Yang
    Naqvi, Syed Mohsen
    2019 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE (SSPD), 2019,