Monaural Source Separation in Complex Domain With Long Short-Term Memory Neural Network

被引:29
|
作者
Sun, Yang [1 ]
Xian, Yang [1 ]
Wang, Wenwu [2 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Engn, Intelligent Sensing & Commun Res Grp, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Dept Elect & Elect Engn, Surrey GU2 7XH, England
关键词
Deep neural networks; monaural speech separation; long short-term memory; complex signal approximation; SPEECH DEREVERBERATION; MASKING; RECOGNITION; FEATURES; NOISE;
D O I
10.1109/JSTSP.2019.2908760
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent research, deep neural network (DNN) has been used to solve the monaural source separation problem. According to the training objectives, DNN-based monaural speech separation is categorized into three aspects, namely masking, mapping, and signal approximation based techniques. However, the performance of the traditional methods is not robust due to variations in real-world environments. Besides, in the vanilla DNN-based methods, the temporal information cannot be fully utilized. Therefore, in this paper, the long short-term memory (LSTM) neural network is applied to exploit the long-term speech contexts. Then, we propose the complex signal approximation (cSA), which is operated in the complex domain to utilize the phase information of the desired speech signal to improve the separation performance. The IEEE and the TIMIT corpora are used to generate mixtures with noise and speech interferences to evaluate the efficacy of the proposed method. The experimental results demonstrate the advantages of the proposed cSA-based LSTM recurrent neural network method in terms of different objective performance measures.
引用
收藏
页码:359 / 369
页数:11
相关论文
共 50 条
  • [1] COMBINING MONAURAL SOURCE SEPARATION WITH LONG SHORT-TERM MEMORY FOR INCREASED ROBUSTNESS IN VOCALIST GENDER RECOGNITION
    Weninger, Felix
    Durrieu, Jean-Louis
    Eyben, Florian
    Richard, Gael
    Schuller, Bjoern
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2196 - 2199
  • [2] Long Short-term Memory Neural Network for Network Traffic Prediction
    Zhuo, Qinzheng
    Li, Qianmu
    Yan, Han
    Qi, Yong
    2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [3] An FPGA Implementation of a Long Short-Term Memory Neural Network
    Ferreira, Joao Canas
    Fonseca, Jose
    2016 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG16), 2016,
  • [4] Long short-term memory neural network for glucose prediction
    Carrillo-Moreno, Jaime
    Perez-Gandia, Carmen
    Sendra-Arranz, Rafael
    Garcia-Saez, Gema
    Hernando, M. Elena
    Gutierrez, Alvaro
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (09): : 4191 - 4203
  • [5] Long short-term memory neural network for glucose prediction
    Jaime Carrillo-Moreno
    Carmen Pérez-Gandía
    Rafael Sendra-Arranz
    Gema García-Sáez
    M. Elena Hernando
    Álvaro Gutiérrez
    Neural Computing and Applications, 2021, 33 : 4191 - 4203
  • [6] Short-term neural network memory
    Morris, Robert J.T.
    Wong, Wing Shing
    SIAM Journal on Computing, 1988, 17 (06): : 1103 - 1118
  • [7] A SHORT-TERM NEURAL NETWORK MEMORY
    MORRIS, RJT
    WONG, WS
    SIAM JOURNAL ON COMPUTING, 1988, 17 (06) : 1103 - 1118
  • [8] Predicting Short-term Traffic Flow by Long Short-Term Memory Recurrent Neural Network
    Tian, Yongxue
    Pan, Li
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 153 - 158
  • [9] Short-term runoff forecasting in an alpine catchment with a long short-term memory neural network
    Frank, Corinna
    Russwurm, Marc
    Fluixa-Sanmartin, Javier
    Tuia, Devis
    FRONTIERS IN WATER, 2023, 5
  • [10] A Deep Neural Network Model for Short-Term Load Forecast Based on Long Short-Term Memory Network and Convolutional Neural Network
    Tian, Chujie
    Ma, Jian
    Zhang, Chunhong
    Zhan, Panpan
    ENERGIES, 2018, 11 (12)