SEQUENTIALLY TRAINED DNNS BASED MONAURAL SOURCE SEPARATION IN REAL ROOM ENVIRONMENTS

被引:0
|
作者
Li, Yi [1 ]
Sun, Yang [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Engn, Intelligent Sensing & Commun Grp, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
关键词
Deep neural networks; monaural source separation; dereverbertation mask; sequentially; FALL DETECTION SYSTEM; SPEECH SEPARATION; RECOGNITION; MASKING; NOISE;
D O I
10.1109/sspd.2019.8751658
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent studies, deep neural networks (DNN) have been introduced to solve monaural source separation (MSS) problem within real room environments. However, the separation performance of the existing methods is limited, especially for environments with larger RT60s. In this paper, we propose a system to train two DNNs sequentially, to mitigate the challenge and improve the separation performance. Our dereverberation mask (DM) is exploited as a training target for DNN1 and new enhanced ratio mask (ERM) is used as a training target for DNN2. The IEEE and the TIMIT corpora with real room impulse responses and noise interferences from the NOISEX dataset are used to generate speech mixtures for evaluations. The proposed method outperforms the state-of-the-art methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Audio Source Separation in Reverberant Environments Using β-Divergence-Based Nonnegative Factorization
    Fakhry, Mahmoud
    Svaizer, Piergiorgio
    Omologo, Maurizio
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1462 - 1476
  • [32] Real-time convolutive blind source separation based on a broadband approach
    Aichner, R
    Buchner, H
    Fei, Y
    Kellermann, W
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 840 - 848
  • [33] Real-time source separation based on sound localization in a reverberant environment
    Aoki, M
    Furuya, K
    NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 475 - 484
  • [34] Deep-learning-based Single-channel Sound Source Separation in Noisy Environments
    Furuya, Ken'ichi
    Miura, Iori
    2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 71 - 72
  • [35] Real-time Multi Source Speech Enhancement based on Sound Source Separation using Microphone Array
    Jeyasingh, P.
    Ismail, M. Mohamed
    2018 CONFERENCE ON EMERGING DEVICES AND SMART SYSTEMS (ICEDSS), 2018, : 183 - 187
  • [36] OPTIMUM BLOCK ADAPTIVE ICA FOR SEPARATION OF REAL AND COMPLEX SIGNALS WITH KNOWN SOURCE DISTRIBUTIONS IN DYNAMIC FLAT FADING ENVIRONMENTS
    Ranganathan, Raghuram
    Yang, Thomas
    Mikhael, Wasfy B.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2010, 19 (02) : 367 - 379
  • [37] Video-Aided Model-Based Source Separation in Real Reverberant Rooms
    Khan, Muhammad Salman
    Naqvi, Syed Mohsen
    Ata-ur-Rehman
    Wang, Wenwu
    Chambers, Jonathon
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1900 - 1912
  • [38] Analysis of room acoustic characteristics based on voice and noise levels in real-time learning environments
    Choi, Young-Ji
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (05): : 517 - 528
  • [39] ROBUST BLIND SOURCE SEPARATION IN A REVERBERANT ROOM BASED ON BEAMFORMING WITH A LARGE-APERTURE MICROPHONE ARRAY
    Sanz-Robinson, Josue
    Huang, Liechao
    Moy, Tiffany
    Rieutort-Louis, Warren
    Hu, Yingzhe
    Wagner, Sigurd
    Sturm, James C.
    Verma, Naveen
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 440 - 444
  • [40] Blind Source Separation of Acoustic Signals in Realistic Environments Based on ICA in the Time-Frequency Domain
    Ding, Shuxue
    Cichocki, Andrzej
    Huang, Jie
    Wei, Daming
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2005, 1 (02) : 89 - 100