SEQUENTIALLY TRAINED DNNS BASED MONAURAL SOURCE SEPARATION IN REAL ROOM ENVIRONMENTS

被引:0
|
作者
Li, Yi [1 ]
Sun, Yang [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Engn, Intelligent Sensing & Commun Grp, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
关键词
Deep neural networks; monaural source separation; dereverbertation mask; sequentially; FALL DETECTION SYSTEM; SPEECH SEPARATION; RECOGNITION; MASKING; NOISE;
D O I
10.1109/sspd.2019.8751658
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent studies, deep neural networks (DNN) have been introduced to solve monaural source separation (MSS) problem within real room environments. However, the separation performance of the existing methods is limited, especially for environments with larger RT60s. In this paper, we propose a system to train two DNNs sequentially, to mitigate the challenge and improve the separation performance. Our dereverberation mask (DM) is exploited as a training target for DNN1 and new enhanced ratio mask (ERM) is used as a training target for DNN2. The IEEE and the TIMIT corpora with real room impulse responses and noise interferences from the NOISEX dataset are used to generate speech mixtures for evaluations. The proposed method outperforms the state-of-the-art methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Monaural Source Separation Based on Sequentially Trained LSTMs in Real Room Environments
    Li, Yi
    Sun, Yang
    Naqvi, Syed Mohsen
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [2] MONAURAL SOURCE SEPARATION: FROM ANECHOIC TO REVERBERANT ENVIRONMENTS
    Cord-Landwehr, Tobias
    Boeddeker, Christoph
    Von Neumann, Thilo
    Zorila, Catalin
    Doddipatla, Rama
    Haeb-Umbach, Reinhold
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [3] Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks
    Sun, Yang
    Wang, Wenwu
    Chambers, Jonathon
    Naqvi, Syed Mohsen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 125 - 139
  • [4] ENHANCED TIME-FREQUENCY MASKING BY USING NEURAL NETWORKS FOR MONAURAL SOURCE SEPARATION IN REVERBERANT ROOM ENVIRONMENTS
    Sun, Yang
    Wang, Wenwu
    Chambers, Jonathon A.
    Naqvi, Syed Mohsen
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1647 - 1651
  • [5] SUPERVISED MONAURAL SOURCE SEPARATION BASED ON AUTOENCODERS
    Osako, Keiichi
    Mitsufuji, Yuki
    Singh, Rita
    Raj, Bhiksha
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 11 - 15
  • [6] Blind source separation in real environments
    Ohnishi, Hideharu
    Nagasaka, Kenji
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 5, 2005, : 371 - 375
  • [7] Efficiency of blind Source separation in a real room
    Libiszewski, Pawel
    Kocinski, Jedrzej
    ARCHIVES OF ACOUSTICS, 2007, 32 (04) : 337 - 342
  • [8] IVA ALGORITHMS USING A MULTIVARIATE STUDENT'S T SOURCE PRIOR FOR SPEECH SOURCE SEPARATION IN REAL ROOM ENVIRONMENTS
    Rafique, Waqas
    Naqvi, Syed Mohsen
    Jackson, Philip J. B.
    Chambers, Jonathon A.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 474 - 478
  • [9] INFORMED MONAURAL SOURCE SEPARATION OF MUSIC BASED ON CONVOLUTIONAL SPARSE CODING
    Jao, Ping-Keng
    Yang, Yi-Hsuan
    Wohlberg, Brendt
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 236 - 240
  • [10] A SI-SDR Loss Function based Monaural Source Separation
    Li, Shuai
    Liu, Hongqing
    Zhou, Yi
    Luo, Zhen
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 356 - 360