Discriminative Training using Heterogeneous Feature Vector for Hindi Automatic Speech Recognition System

被引：0

作者：

Dua, Mohit ^{[1
]}

Aggarwal, Rajesh Kumar ^{[1
]}

Biswas, Mantosh ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Comp Engn, Kurukshetra, Haryana, India

来源：

2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA) | 2017年

关键词：

automatic speech recognition; MFCC; PLP; minimum phone error; HMM; ACOUSTIC MODELING PROBLEM;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Training and testing are the two phases that are used in statistical approach of designing an automatic speech recognition (ASR) system. The training phase includes parameterization of input speech signal and acoustic modeling of speech features. The paper proposes discriminative training of hidden markov Model (HMM) that uses heterogeneous feature vector for continuous Hindi ASR system. A linear interpolation of mel frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) is used to generate heterogeneous feature streams. The implemented work uses maximum mutual information estimation (MMIE) and minimum phone error (MPE) discriminative techniques for acoustic model training. The results show that MF-PLP parameterization with MPE discriminative techniques combination outperforms the other feature extraction and discriminative combination.

引用

页码：158 / 162

页数：5

共 22 条

[1]

Acero Alejandro., 2012, Acoustical and environmental robustness in automatic speech recognition, V201

[2]

Adiga A, 2013, TENCON IEEE REGION

[3]

Aggarwal R., 2011, International Journal of Signal Processing, Image Processing and Pattern Recognition, V4, P157

[4] Acoustic modeling problem for automatic speech recognition system: Advances and refinements (Part II) [J].

Aggarwal R.K. ;

Dave M. .

International Journal of Speech Technology, 2011, 14 (4) :309-320

[5] Acoustic modeling problem for automatic speech recognition system: conventional methods (Part I) [J].

Aggarwal, Rajesh ;

Dave, Mayank .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (04) :297-308

[6]

Aggarwal RK, 2011, COMM COM INF SC, V139, P261

[7]

Aggarwal Rajesh Kumar, 2013, TELECOMMUNICATION SY, V1-10

[8]

[Anonymous], 2002, The HTK book

[9]

[Anonymous], 1993, Fundamentals of speech recognition

[10]

[Anonymous], 2009, HTK BOOK VERSION 3 4

← 1 2 3 →