Robust Speech Recognition Using Improved Vector Taylor Series Algorithm for Embedded Systems

被引：3

作者：

Lue, Yong ^{[1
]}

Wu, Haiyang ^{[1
]}

Wu, Zhenyang ^{[1
]}

机构：

[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Peoples R China

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2010年 / 56卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Robust speech recognition; vector Taylor series; feature compensation; hidden Markov model; ENVIRONMENTS;

D O I：

10.1109/TCE.2010.5505999

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes a novel robust speech recognition technique using improved vector Taylor series (VTS) algorithm for embedded systems. It uses a hidden Markov model (HMM) to replace the Gaussian mixture model (GMM) for estimating the clean speech feature, and gives the closed-form solutions of the noise parameters including the mean and variance at each expectation-maximization (EM) iteration. The experimental results show that the proposed algorithm makes a good balance between the computational complexity and recognition accuracy, and thus is more useful for embedded systems(1).

引用

页码：764 / 769

页数：6

共 14 条

[1]

[Anonymous], 1996, THESIS CARNEGIE MELL

[2]

[Anonymous], 2000, INTERSPEECH, DOI DOI 10.1016/S0167-6393(03)00016-5

[3] EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].

ATAL, BS .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312

[4] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[5] Robust Mandarin speech recognition in car environments for embedded navigation system [J].

Ding, Pei ;

He, Lei ;

Yan, Xiang ;

Zhao, Rui ;

Hao, Jie .

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) :584-590

[6] Robust distributed speech recognition using speech enhancement [J].

Flynn, Ronan ;

Jones, Edward .

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) :1267-1273

[7] ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USING PARALLEL MODEL COMBINATION [J].

GALES, MJF ;

YOUNG, SJ .

COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04) :289-307

[8] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY [J].

GONG, YF .

SPEECH COMMUNICATION, 1995, 16 (03) :261-291

[9] RASTA Processing of Speech [J].

Hermansky, Hynek ;

Morgan, Nelson .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :578-589

[10] A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions [J].

Li, Jinyu ;

Deng, Li ;

Yu, Dong ;

Gong, Yifan ;

Acero, Alex .

COMPUTER SPEECH AND LANGUAGE, 2009, 23 (03) :389-405

← 1 2 →