Speech Recognition Approach Based on Speech Feature Clustering and HMM

被引：3

作者：

Li, XinGuang ^{[1
]}

Yao, MinFeng ^{[1
]}

Yang, JiaNeng ^{[1
]}

机构：

[1] Guangdong Univ Foreign Studies, Guangzhou 510006, Guangdong, Peoples R China

来源：

JOURNAL OF COMPUTERS | 2012年 / 7卷 / 09期

关键词：

HMM; Speech Feature Parameters; Segment-Mean; K-Means Clustering; Model Cross-group;

D O I：

10.4304/jcp.7.9.2269-2275

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The paper presents a Segment-Mean method for reducing the dimension of the speech feature parameters. K-Means function is used to group the speech feature parameters whose dimension has been reduced. And then the speech samples are classified into different clusters according to their features. It proposes a cross-group training algorithm for the speech feature parameters clustering which improves the accuracy of the clustering function. When recognizing speech, the system uses a crossgroup HMM models algorithm to match patterns which reduces the calculation by more than 50% and without reducing the recognition rate of the small vocabulary speech recognition system.

引用

页码：2269 / 2275

页数：7

共 12 条

[1]

[冯宏伟 Feng Hongwei], 2010, [计算机工程与设计, Computer Engineering and Design], V31, P5324

[2]

[俸云 FENG Yun], 2009, [计算机工程与科学, Computer Engineering and Science], V31, P146

[3] The Application of Hidden Markov Models in Speech Recognition [J].

Gales, Mark ;

Young, Steve .

FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03) :195-304

[4]

Jun Yuan, 2001, ELECT TECHNOLOGY, V2, P48

[5]

Li Dongdong, 2009, PATTERN RECOGN, V22, P139

[6] Weighted finite-state transducers in speech recognition [J].

Mohri, M ;

Pereira, F ;

Riley, M .

COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01) :69-88

[7] SUBSPACE GAUSSIAN MIXTURE MODELS FOR SPEECH RECOGNITION [J].

Povey, Daniel ;

Burget, Lukas ;

Agarwal, Mohit ;

Akyazi, Pinar ;

Feng, Kai ;

Ghoshal, Arnab ;

Glembek, Ondrej ;

Goel, Nagendra Kumar ;

Karafiat, Martin ;

Rastrow, Ariya ;

Rose, Richard C. ;

Schwarz, Petr ;

Thomas, Samuel .

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :4330-4333

[8] A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].

RABINER, LR .

PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286

[9]

Wang Xianbao, 2011, Computer Engineering and Applications, V47, P20, DOI 10.3778/j.issn.1002-8331.2011.12.006

[10]

[叶庆云 YE Qingyun], 2007, [武汉理工大学学报, Journal of wuhan university of technology], V29, P150

← 1 2 →