Robust Speech Recognition Using Improved Vector Taylor Series Algorithm for Embedded Systems

被引:3
作者
Lue, Yong [1 ]
Wu, Haiyang [1 ]
Wu, Zhenyang [1 ]
机构
[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Robust speech recognition; vector Taylor series; feature compensation; hidden Markov model; ENVIRONMENTS;
D O I
10.1109/TCE.2010.5505999
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a novel robust speech recognition technique using improved vector Taylor series (VTS) algorithm for embedded systems. It uses a hidden Markov model (HMM) to replace the Gaussian mixture model (GMM) for estimating the clean speech feature, and gives the closed-form solutions of the noise parameters including the mean and variance at each expectation-maximization (EM) iteration. The experimental results show that the proposed algorithm makes a good balance between the computational complexity and recognition accuracy, and thus is more useful for embedded systems(1).
引用
收藏
页码:764 / 769
页数:6
相关论文
共 14 条
[1]  
[Anonymous], 1996, THESIS CARNEGIE MELL
[2]  
[Anonymous], 2000, INTERSPEECH, DOI DOI 10.1016/S0167-6393(03)00016-5
[3]   EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].
ATAL, BS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312
[4]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]   Robust Mandarin speech recognition in car environments for embedded navigation system [J].
Ding, Pei ;
He, Lei ;
Yan, Xiang ;
Zhao, Rui ;
Hao, Jie .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) :584-590
[6]   Robust distributed speech recognition using speech enhancement [J].
Flynn, Ronan ;
Jones, Edward .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) :1267-1273
[7]   ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USING PARALLEL MODEL COMBINATION [J].
GALES, MJF ;
YOUNG, SJ .
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04) :289-307
[8]   SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY [J].
GONG, YF .
SPEECH COMMUNICATION, 1995, 16 (03) :261-291
[9]   RASTA Processing of Speech [J].
Hermansky, Hynek ;
Morgan, Nelson .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :578-589
[10]   A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions [J].
Li, Jinyu ;
Deng, Li ;
Yu, Dong ;
Gong, Yifan ;
Acero, Alex .
COMPUTER SPEECH AND LANGUAGE, 2009, 23 (03) :389-405