Monaural Source Separation Based on Adaptive Discriminative Criterion in Neural Networks

被引:0
|
作者
Sun, Yang [1 ]
Zhu, Lei [2 ]
Chambers, Jonathon A. [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne, Tyne & Wear, England
[2] Harbin Engn Univ, Sci Coll, Harbin, Heilongjiang, Peoples R China
关键词
Monaural Source Separation; Deep Recurrent Neural Network; Penalty Factor; Adaptive;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Monaural source separation is an important research area which can help to improve the performance of several real-world applications, such as speech recognition and assisted living systems. Huang et al. proposed deep recurrent neural networks (DRNNs) with discriminative criterion objective function to improve the performance of source separation. However, the penalty factor in the objective function is selected randomly and empirically. Therefore, we introduce an approach to calculate the parameter in the discriminative term adaptively via the discrepancy between target features. The penalty factor can be changed with inputs to improve the separation performance. The proposed method is evaluated with different settings and architectures of neural networks. In these experiments, the TIMIT corpus is explored as the database and the signal to distortion ratio (SDR) as the measurement. Comparing with the previous approach, our method has improved robustness and a better separation performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation
    Drossos, Konstantinos
    Mimilakis, Stylianos Ioannis
    Serdyuk, Dmitriy
    Schuller, Gerald
    Virtanen, Tuomas
    Bengio, Yoshua
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [32] GEOMETRIC INFORMATION BASED MONAURAL SPEECH SEPARATION USING DEEP NEURAL NETWORK
    Xian, Yang
    Sun, Yang
    Chambers, Jonathon A.
    Naqvi, Syed Mohsen
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4454 - 4458
  • [33] Spatial Dispersion Constrained NMF for Monaural Source Separation
    Viet-Hang Duong
    Lee, Yuan-Shan
    Bach-Tung Pham
    Mathulaprangsan, Seksan
    Pham-The Bao
    Wang, Jia-Ching
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [34] Joint Amplitude and Phase Refinement for Monaural Source Separation
    Masuyama, Yoshiki
    Yatabe, Kohei
    Nagatomo, Kento
    Oikawa, Yasuhiro
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1939 - 1943
  • [35] Monaural Source Separation Using a Random Forest Classifier
    Riday, Cosimo
    Bhargava, Saurabh
    Hahnloser, Richard H. R.
    Liu, Shih-Chii
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3344 - 3348
  • [36] Iterative Monaural Audio Source Separation for Subspace Grouping
    Spiertz, Martin
    Gnann, Volker
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2008), 2008, : 102 - 105
  • [37] AUTOCLIP: ADAPTIVE GRADIENT CLIPPING FOR SOURCE SEPARATION NETWORKS
    Seetharaman, Prem
    Wichern, Gordon
    Pardo, Bryan
    Le Roux, Jonathan
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [38] Monaural Source Separation Using Ramanujan Subspace Dictionaries
    Liao, Hsueh-Wei
    Su, Li
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (08) : 1156 - 1160
  • [39] Monaural Audio Source Separation using Variational Autoencoders
    Pandey, Laxmi
    Kumar, Anurendra
    Namboodiri, Vinay
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3489 - 3493
  • [40] MONAURAL SOURCE SEPARATION: FROM ANECHOIC TO REVERBERANT ENVIRONMENTS
    Cord-Landwehr, Tobias
    Boeddeker, Christoph
    Von Neumann, Thilo
    Zorila, Catalin
    Doddipatla, Rama
    Haeb-Umbach, Reinhold
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,