HMM/ANN hybrid model for continuous Malayalam speech recognition

被引:11
作者
Mohamed, Anuj [1 ]
Nair, K. N. Ramachandran [2 ]
机构
[1] Viswajyothi Coll Engn & Technol, Dept Comp Sci & Engn, Muvattupuzha, Kerala, India
[2] Mahatma Gandhi Univ, Sch Comp Sci, Kottayam, Kerala, India
来源
INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011 | 2012年 / 30卷
关键词
Automatic Speech Recognition; Continuous Malayalam speech recognition; Hybrid speech recognition; Hidden Markov Models; Artificial Neural Networks;
D O I
10.1016/j.proeng.2012.01.906
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes the development of a context independent, small vocabulary, connectionist-statistical continuous Malayalam speech recognition system which combines the time normalization property of Hidden Markov Models (HMMs) with the superior discriminative ability of Artificial Neural Networks (ANNs). In this work, the HMM based phoneme models use the emission probabilities estimated from the posterior probabilities obtained through Multi Layer Perceptrons. We evaluated the performance of our proposed system on a small vocabulary, speaker independent continuous Malayalam speech corpus and our system has produced a promising result of 86.67% word and 66.67% sentence recognition rates. This is the first reported result for a Malayalam speaker independent continuous speech recognizer based on an HMM/ANN hybrid framework. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of ICCTSD 2011
引用
收藏
页码:616 / 622
页数:7
相关论文
共 10 条
[1]  
[Anonymous], 2000, Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition
[2]  
Anuj M., 2010, P 1 AMR ACM W CEL WO, P326
[3]  
Bishop CM., 1995, NEURAL NETWORKS PATT
[4]   LINKS BETWEEN MARKOV-MODELS AND MULTILAYER PERCEPTRONS [J].
BOURLARD, H ;
WELLEKENS, CJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (12) :1167-1178
[5]  
Bourlard H., 1995, IEEE SIGNAL PROCESSI, P25
[6]  
Haton J. P., 2000, HDB NEURAL NETWORKS
[7]  
KRUGER SE, 2005, P EUR, P993
[8]  
Lee K.-F., 1989, AUTOMATIC SPEECH REC
[9]  
Rabiner L. R., 1993, Fundamentals of Speech Recognition
[10]   Neural networks for classification: A survey [J].
Zhang, GQP .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04) :451-462