Neuromorphic detection of speech dynamics

被引:9
作者
Gomez-Vilda, Pedro [1 ]
Ferrandez-Vicente, Jose M. [2 ]
Rodellar-Biarge, Victoria [1 ]
Alvarez-Marquina, Agustin [1 ]
Miguel Mazaira-Fernandez, Luis [1 ]
Martinez Olalla, Rafael [1 ]
Munoz-Mulas, Cristina [1 ]
机构
[1] Univ Politecn Madrid, Fac Informat, E-28660 Madrid, Spain
[2] Univ Politecn Cartagena, Cartagena 30202, Spain
关键词
Neuromorphic computing; Auditory pathways; Phonetic labelling; Contextual speech information; TIME; INTEGRATION; PERCEPTION;
D O I
10.1016/j.neucom.2010.07.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech and voice technologies are experiencing a profound review as new paradigms are sought to overcome some specific problems which cannot be completely solved by classical approaches. Neuromorphic Speech Processing is an emerging area in which research is turning the face to understand the natural neural processing of speech by the Human Auditory System in order to capture the basic mechanisms solving difficult tasks in an efficient way. In the present paper a further step ahead is presented in the approach to mimic basic neural speech processing by simple neuromorphic units standing on previous work to show how formant dynamics - and henceforth consonantal features - can be detected by using a general neuromorphic unit which can mimic the functionality of certain neurons found in the upper auditory pathways. Using these simple building blocks a General Speech Processing Architecture can be synthesized as a layered structure. Results from different simulation stages are provided as well as a discussion on implementation details. Conclusions and future work are oriented to describe the functionality to be covered in the next research steps. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:1191 / 1202
页数:12
相关论文
共 27 条
[1]  
Ainsworth W., 2006, Listening to speech: An auditory perspective, P3
[2]  
Allen J. B., 2008, SPRINGER HDB SPEECH, P27, DOI [10.1007/978-3-540-49127-9_3, DOI 10.1007/978-3-540-49127-9_3]
[3]  
[Anonymous], 1993, Discrete-Time Processing of Speech Signals
[4]  
[Anonymous], 2004, The Synaptic Organization of the Brain, ed
[5]   Ultrastructure of dendritic spines: correlation between synaptic and spine morphologies [J].
Arellano, Jon I. ;
Benavides-Piccione, Ruth ;
DeFelipe, Javier ;
Yuste, Rafael .
FRONTIERS IN NEUROSCIENCE, 2007, 1 (01) :131-143
[6]   Time-critical integration of formants for perception of communication calls in mice [J].
Geissler, DB ;
Ehret, G .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (13) :9021-9025
[7]  
Gómez-Vilda P, 2007, LECT NOTES COMPUT SC, V4527, P132
[8]   Time-frequency representations in speech perception [J].
Gomez-Vilda, Pedro ;
Ferrandez-Vicente, Jose M. ;
Rodellar-Biarge, Victoria ;
Fernandez-Baillo, Roberto .
NEUROCOMPUTING, 2009, 72 (4-6) :820-830
[9]  
Greenberg Steven, 2004, VVolume 18, P1
[10]  
Hebb D. O., 1949, ORG BEHAV NEUROPSYCH