Performance Analysis of Speech Enhancement Algorithm for Robust Speech Recognition System

被引：0

作者：

Babu, C. Ganesh ^{[1
]}

Vanathi, P. T. ^{[2
]}

Ramachandran, R. ^{[1
]}

Rajaa, M. Senthil ^{[1
]}

机构：

[1] BIT, Sathyamangalam, India

[2] PSGCT, Coimbatore, Tamil Nadu, India

来源：

RECENT ADVANCES IN NETWORKING, VLSI AND SIGNAL PROCESSING | 2010年

关键词：

Hidden Markov Model; Vector Quantization; Speech Enhancement; Linear Predictive Coding; Speech Recognition; VECTOR QUANTIZATION;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Widely Speech Signal Processing has not been used much in the field of electronics and computers due to the complexity and variety of speech signals and sounds with the advent of new technology. However, with modern processes, algorithms, and methods which can process speech signals easily and also recognize the text. Demand for speech recognition technology is expected to raise dramatically over the next few years as people use their mobile phones as all purpose lifestyle devices. In this paper, an implementation of a speech-to-text system using isolated word recognition with a vocabulary of ten words (digits 0 to 9 with each 100 samples) and statistical modeling (Hidden Markov Model - HMM) for machine speech recognition was undertaken. In the training phase, the uttered digits are recorded using 8-bit Pulse Code Modulation (PCM) with a sampling rate of 8 KHz and saved as a wave file using sound recorder software. The system performs speech analysis using the Linear Predictive Coding (LPC) method of degree. From the LPC coefficients, the weighted cepstral coefficients and cepstral time derivatives are derived. From these variables the feature vector for a frame is arrived. Then, the system performs Vector Quantization (VQ) utilizing a vector codebook which result vectors form of the observation sequence. For a given word in the vocabulary, the system builds an HMM model and trains the model during the training phase. The training steps, from Speech Enhancement to HMM model building, are performed using PC-based Matlab programs. Our current framework uses a speech processing module includes Speech Enhancement algorithm with Hidden Markov Model (HMM)-based classification and noise language modeling to achieve effective noise knowledge estimation.

引用

页码：197 / +

页数：2

共 14 条

[1]

BALAMURAGAN MT, 2006, SOPC BASED SPEECH TE, P83

[2] Hidden Markov processes [J].