SEQUENTIALLY TRAINED DNNS BASED MONAURAL SOURCE SEPARATION IN REAL ROOM ENVIRONMENTS

被引:0
|
作者
Li, Yi [1 ]
Sun, Yang [1 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Engn, Intelligent Sensing & Commun Grp, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
关键词
Deep neural networks; monaural source separation; dereverbertation mask; sequentially; FALL DETECTION SYSTEM; SPEECH SEPARATION; RECOGNITION; MASKING; NOISE;
D O I
10.1109/sspd.2019.8751658
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent studies, deep neural networks (DNN) have been introduced to solve monaural source separation (MSS) problem within real room environments. However, the separation performance of the existing methods is limited, especially for environments with larger RT60s. In this paper, we propose a system to train two DNNs sequentially, to mitigate the challenge and improve the separation performance. Our dereverberation mask (DM) is exploited as a training target for DNN1 and new enhanced ratio mask (ERM) is used as a training target for DNN2. The IEEE and the TIMIT corpora with real room impulse responses and noise interferences from the NOISEX dataset are used to generate speech mixtures for evaluations. The proposed method outperforms the state-of-the-art methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder
    Tsai, Kun-Hsi
    Wang, Wei-Chien
    Cheng, Chui-Hsuan
    Tsai, Chan-Yen
    Wang, Jou-Kou
    Lin, Tzu-Hao
    Fang, Shih-Hau
    Chen, Li-Chin
    Tsao, Yu
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (11) : 3203 - 3214
  • [22] Model-based monaural source separation using a vector-quantized phase-vocoder representation
    Ellis, Daniel P. W.
    Weiss, Ron J.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5815 - 5818
  • [23] A multichannel learning-based approach for sound source separation in reverberant environments
    You-Siang Chen
    Zi-Jie Lin
    Mingsian R. Bai
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [24] A multichannel learning-based approach for sound source separation in reverberant environments
    Chen, You-Siang
    Lin, Zi-Jie
    Bai, Mingsian R.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [25] Real-time Acoustic Source Separation Based on Kalman Filter
    Wei, Yangjie
    Wang, Yi
    He, Yuqing
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 1278 - 1283
  • [26] Blind source separation method based on real coded genetic algorithm
    DSP Lab, Chongqing Communication Institute, Chongqing 400035, China
    不详
    不详
    Dianzi Keji Diaxue Xuebao, 2006, 3 (295-297+327):
  • [27] Moving Sound Source Localization Based on Sequential Subspace Estimation in Actual Room Environments
    Tsuji, Daisuke
    Suyama, Kenji
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2011, 94 (07) : 17 - 26
  • [28] A real-time blind source separation scheme and its application to reverberant and noisy acoustic environments
    Aichner, R
    Buchner, H
    Yan, F
    Kellermann, W
    SIGNAL PROCESSING, 2006, 86 (06) : 1260 - 1277
  • [29] Sound Source Localization for Subband-based Two Speech Separation in Room Environment
    Hai Quang Hong Dam
    Nordholm, Sven
    2013 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2013,
  • [30] Two Model-Based EM Algorithms for Blind Source Separation in Noisy Environments
    Schwartz, Boaz
    Gannot, Sharon
    Habets, Emanuel A. P.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2209 - 2222