Speech Feature Compensation in Multiple Model Based Speech Recognition System Using VTS-based Environmental Parameter Estimation

被引：0

作者：

Chung, Yongjoo ^{[1
]}

机构：

[1] Keimyung Univ, Dept Elect, Taegu, South Korea

来源：

2013 INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS TECHNOLOGY (ICCAT) | 2013年

关键词：

component; speech recognition; multiple-model frame; noise robustness; environmental sniffing;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Multiple-model based speech recognition (MMSR) has been shown to be quite successful in noisy speech recognition. In this study, we propose a method to improve recognition performance by mitigating the mismatch in noise/channel type for an MMSR solution. We propose a novel method to reduce the effect of noise and channel mismatch by compensating the test noisy speech in the log-spectrum domain. We derive the relation between the log-spectrum vectors in the test and training noisy speech by using vector Taylor series (VTS) algorithm. Based on it, minimum mean square error estimation of the training log-spectrum vectors is obtained from the test noisy vectors by iteratively estimating environmental parameters. The estimated training vectors are used for recognition to reduce the noise and channel mismatch. We could find that the proposed method achieved WER reduction based on the Aurora2 task by +18.7% compared with a conventional MMSR method.

引用

页数：2

共 50 条

[1] A VTS-based Feature Compensation Method using Noisy Speech HMMs
Chung, Yongjoo
APPLIED MATHEMATICS & INFORMATION SCIENCES, 2014, 8 (06): : 2849 - 2856
[2] A VTS-BASED FEATURE COMPENSATION APPROACH TO NOISY SPEECH RECOGNITION USING MIXTURE MODELS OF DISTORTION
Du, Jun
Huo, Qiang
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7078 - 7082
[3] IVN-Based Joint Training Of GMM And HMMs Using An Improved VTS-Based Feature Compensation For Noisy Speech Recognition
Du, Jun
Huo, Qiang
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1226 - 1229
[4] A Comparative Study of Noise Estimation Algorithms for VTS-Based Robust Speech Recognition
Zhao, Yong
Juang, Biing-Hwang
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2090 - 2093
[5] COMBINING EIGENVOICE SPEAKER MODELING AND VTS-BASED ENVIRONMENT COMPENSATION FOR ROBUST SPEECH RECOGNITION
Ou, Zhijian
Deng, Kan
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4673 - 4676
[6] Model-based feature compensation for robust speech recognition
Shen, Haifeng
Li, Qunxia
Guo, Jun
Liu, Gang
FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
[7] Model-based feature compensation for robust speech recognition
School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
不详
不详
Fundam Inf, 2006, 4 (529-539):
[8] VTS feature compensation based on two-layer GMM structure for robust speech recognition
Zhou, Lin
Li, Haijing
Chen, Ying
Wu, Zhenyang
Lu, Yong
2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
[9] Feature compensation based on independent noise estimation for robust speech recognition
Lu, Yong
Lin, Han
Wu, Pingping
Chen, Yitao
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
[10] Feature compensation based on independent noise estimation for robust speech recognition
Yong Lü
Han Lin
Pingping Wu
Yitao Chen
EURASIP Journal on Audio, Speech, and Music Processing, 2021

← 1 2 3 4 5 →