Neural Network Architectures for Speaker Independent Phoneme Recognition

被引:0
作者
Cutajar, M. [1 ]
Gatt, E. [1 ]
Grech, I [1 ]
Casha, O. [1 ]
Micallef, J. [1 ]
机构
[1] Univ Malta, Fac ICT, Dept Microelect & Nanoelect, MSD-2080 Msida, Malta
来源
PROCEEDINGS OF THE 7TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2011) | 2011年
关键词
SPEECH RECOGNITION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Two different neural network architectures were designed for speaker independent phoneme recognition systems. The first architecture consists of the Radial Basis Function (RBF), while in the second architecture a Self-Organising Maps (SOM) neural network replaces the RBF. The Discrete Wavelet Transform (DWT) is used for feature extraction in both systems. Both systems were tested on the TIMIT database. The highest recognition rates obtained are 36.3% and 46.7%, for the RBF and SOM architectures respectively for multi-speaker unlimited vocabulary speech.
引用
收藏
页码:90 / 94
页数:5
相关论文
共 17 条
[1]   CDHMM Parameters Selection for Speaker-Independent Phone Recognition In Continuous Speech System [J].
Ben Messaoud, Zaineb ;
Ben Hamida, Ahmed .
MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, :253-258
[2]   Digital Hardware Implementation of Self-Organising Maps [J].
Cutajar, M. ;
Gatt, E. ;
Micallef, J. ;
Grech, I ;
Casha, O. .
MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, :1123-1128
[3]   Speech recognition moves from software to hardware [J].
Dailey Paulson, Linda .
COMPUTER, 2006, 39 (11) :15-18
[4]  
Du XP, 2006, LECT NOTES COMPUT SC, V3972, P150
[5]   A Fractal-based Approach for Speech Segmentation [J].
Fantinato, Paulo Cesar ;
Guido, Rodrigo Capobianco ;
Chen, Shi-Huang ;
Silveira Santos, Bruno Leonardo ;
Vieira, Lucimar Sasso ;
Barbon Junior, Sylvio ;
Rodrigues, Luciene Cavalcanti ;
Sanchez, Fabricio Lopes ;
Lemos Escola, Joao Paulo ;
Souza, Leonardo Mendes ;
Maciel, Carlos Dias ;
Scalassara, Paulo Rogerio ;
Pereira, Jose Carlos .
ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, :551-+
[6]  
Fransen E., RADIAL BASIS FUNCTIO
[7]  
Gowdy JN, 2000, INT CONF ACOUST SPEE, P1351, DOI 10.1109/ICASSP.2000.861829
[8]  
Krishnan V. R. Vimal, 2009, International Journal of Computer and Network Security, V1, P52
[9]   System for automatic collection, annotation and indexing of Czech broadcast speech with full-text search [J].
Nouza, Jan ;
Zdansky, Jindrich ;
Cerva, Petr .
MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, :202-205
[10]   Interacting with computers by voice: Automatic speech recognition and synthesis [J].
O'Shaughnessy, D .
PROCEEDINGS OF THE IEEE, 2003, 91 (09) :1272-1305