Performance Analysis of Speech Enhancement Algorithm for Robust Speech Recognition System

被引:0
作者
Babu, C. Ganesh [1 ]
Vanathi, P. T. [2 ]
Ramachandran, R. [1 ]
Rajaa, M. Senthil [1 ]
机构
[1] BIT, Sathyamangalam, India
[2] PSGCT, Coimbatore, Tamil Nadu, India
来源
RECENT ADVANCES IN NETWORKING, VLSI AND SIGNAL PROCESSING | 2010年
关键词
Hidden Markov Model; Vector Quantization; Speech Enhancement; Linear Predictive Coding; Speech Recognition; VECTOR QUANTIZATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Widely Speech Signal Processing has not been used much in the field of electronics and computers due to the complexity and variety of speech signals and sounds with the advent of new technology. However, with modern processes, algorithms, and methods which can process speech signals easily and also recognize the text. Demand for speech recognition technology is expected to raise dramatically over the next few years as people use their mobile phones as all purpose lifestyle devices. In this paper, an implementation of a speech-to-text system using isolated word recognition with a vocabulary of ten words (digits 0 to 9 with each 100 samples) and statistical modeling (Hidden Markov Model - HMM) for machine speech recognition was undertaken. In the training phase, the uttered digits are recorded using 8-bit Pulse Code Modulation (PCM) with a sampling rate of 8 KHz and saved as a wave file using sound recorder software. The system performs speech analysis using the Linear Predictive Coding (LPC) method of degree. From the LPC coefficients, the weighted cepstral coefficients and cepstral time derivatives are derived. From these variables the feature vector for a frame is arrived. Then, the system performs Vector Quantization (VQ) utilizing a vector codebook which result vectors form of the observation sequence. For a given word in the vocabulary, the system builds an HMM model and trains the model during the training phase. The training steps, from Speech Enhancement to HMM model building, are performed using PC-based Matlab programs. Our current framework uses a speech processing module includes Speech Enhancement algorithm with Hidden Markov Model (HMM)-based classification and noise language modeling to achieve effective noise knowledge estimation.
引用
收藏
页码:197 / +
页数:2
相关论文
共 14 条
[1]  
BALAMURAGAN MT, 2006, SOPC BASED SPEECH TE, P83
[2]   Hidden Markov processes [J].
Ephraim, Y ;
Merhav, N .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2002, 48 (06) :1518-1569
[3]   Subjective comparison and evaluation of speech enhancement algorithms [J].
Hu, Yi ;
Loizou, Philipos C. .
SPEECH COMMUNICATION, 2007, 49 (7-8) :588-601
[4]  
JUANG BH, 1987, IEEE T ACOUST SPEECH, V35, P947, DOI 10.1109/TASSP.1987.1165237
[5]   VECTOR QUANTIZATION IN SPEECH CODING [J].
MAKHOUL, J ;
ROUCOS, S ;
GISH, H .
PROCEEDINGS OF THE IEEE, 1985, 73 (11) :1551-1588
[6]   LINEAR PREDICTION - TUTORIAL REVIEW [J].
MAKHOUL, J .
PROCEEDINGS OF THE IEEE, 1975, 63 (04) :561-580
[7]  
MARKEL JD, 1976, LINEAR PREDICTION SP, P71
[8]  
PORUBA J, 2002, P 4 IEEE INT C DEV C
[9]   ON THE APPLICATION OF VECTOR QUANTIZATION AND HIDDEN MARKOV-MODELS TO SPEAKER-INDEPENDENT, ISOLATED WORD RECOGNITION [J].
RABINER, LR ;
LEVINSON, SE ;
SONDHI, MM .
BELL SYSTEM TECHNICAL JOURNAL, 1983, 62 (04) :1075-1105
[10]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286