Nonlinear enhancement of noisy speech, using continuous attractor dynamics formed in recurrent neural networks

被引:13
作者
Dehyadegary, Louiza [1 ]
Seyyedsalehi, Seyyed Ali [1 ]
Nejadgholi, Isar [1 ]
机构
[1] Amir Kabir Univ Technol, Dept Biomed Engn, Tehran, Iran
关键词
Noisy speech robust recognition; Recurrent neural networks; Nonlinear dynamics; Continuous attractors; Nonlinear filtering; RECOGNITION; MACHINES;
D O I
10.1016/j.neucom.2010.12.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Here, formation of continuous attractor dynamics in a nonlinear recurrent neural network is used to achieve a nonlinear speech denoising method, in order to implement robust phoneme recognition and information retrieval. Formation of attractor dynamics in recurrent neural network is first carried out by training the clean speech subspace as the continuous attractor. Then, it is used to recognize noisy speech with both stationary and nonstationary noise. In this work, the efficiency of a nonlinear feedforward network is compared to the same one with a recurrent connection in its hidden layer. The structure and training of this recurrent connection, is designed in such a way that the network learns to denoise the signal step by step, using properties of attractors it has formed, along with phone recognition. Using these connections, the recognition accuracy is improved 21% for the stationary signal and 14% for the nonstationary one with 0db SNR, in respect to a reference model which is a feedforward neural network. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:2716 / 2724
页数:9
相关论文
共 26 条
[1]  
ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[2]  
Amit DJ., 1989, MODELING BRAIN FUNCT, DOI DOI 10.1017/CBO9780511623257
[3]  
BENGIO Y, 1996, ADV NEUROL INFORM PR, V8
[4]  
BEROUTI M, 1997, ENHANCEMENT SPEECH C
[5]  
BIJANKHAN M, 1994, SST94 PERTH, P826
[6]   Robust automatic speech recognition with missing and unreliable acoustic data [J].
Cooke, M ;
Green, P ;
Josifovski, L ;
Vizinho, A .
SPEECH COMMUNICATION, 2001, 34 (03) :267-285
[7]  
ELTOFT T, 2001, ICA NONLINEAR TIME S
[8]  
EPHRAIM Y, 1992, P IEEE, V80
[9]  
FURUI S, 2003, EUROSPEECH
[10]  
GAAFAR M, 1998, P ICASSP