A Discrete Wavelet Transform Based Approach to Hindi Speech Recognition

被引:15
作者
Ranjan, Shivesh [1 ]
机构
[1] Manipal Inst Technol, Manipal 576104, Karnataka, India
来源
2010 INTERNATIONAL CONFERENCE ON SIGNAL ACQUISITION AND PROCESSING: ICSAP 2010, PROCEEDINGS | 2010年
关键词
discrete wavelet transform; linear predicitive coding; vector quantization; hindi; speech recognition;
D O I
10.1109/ICSAP.2010.21
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new scheme for recognition of isolated words in Hindi Language speech, based on the Discrete Wavelet Transform. We first compute the Discrete Wavelet Transform coefficients of the speech signal. Then, Linear Predictive Coding Coefficients of the Discrete Wavelet Transform coefficients are calculated. Our scheme then uses K Means Algorithm on the obtained Linear Predictive Coding Coefficients to form a Vector Quantized codebook. Recognition of a spoken Hindi word is carried out by first calculating its Discrete Wavelet Transform Coefficients, followed by Linear Predictive Coding Coefficient calculation of these Discrete Wavelet Transform Coefficients, and then deciding in favor of the Hindi word whose corresponding centroid (in the Vector Quantized codebook) gives a minimum squared Euclidean distance error with respect to the word under test.
引用
收藏
页码:345 / 348
页数:4
相关论文
共 10 条
[1]  
Daubechies I., 1992, 10 LECT WAVELETS, p115 132 194 242
[2]  
GAMULKIEWICZ B, 2003, P IEEE INT S MICRN H, V2, P678, DOI DOI 10.1109/MWSCAS.2003.1562377
[3]  
Goswami J.C., 1999, WILEY MICRO
[4]  
HJuang B., 1985, IEEE ICASSP PROC, P9
[5]  
MacQueen J, 1965, Proc of Berkeley Symposium on Mathematical Statistics Probability, V1, P281
[6]  
RABINER LR, 1993, FUNDAMENTALS SPEECH, P133
[7]  
RAO N, 2001, THESIS U QUEENSLAND
[8]  
SHARMA A, 2008, INT J INFORM COMMUNI, P373
[9]  
Strang Gilbert, 1997, WAVELETS FILTER BANK, p[174, 365]
[10]  
VETTERELI M, 1995, WAVELETS SUBBAND COD, P233