Speech Feature Compensation in Multiple Model Based Speech Recognition System Using VTS-based Environmental Parameter Estimation

被引:0
|
作者
Chung, Yongjoo [1 ]
机构
[1] Keimyung Univ, Dept Elect, Taegu, South Korea
来源
2013 INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS TECHNOLOGY (ICCAT) | 2013年
关键词
component; speech recognition; multiple-model frame; noise robustness; environmental sniffing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Multiple-model based speech recognition (MMSR) has been shown to be quite successful in noisy speech recognition. In this study, we propose a method to improve recognition performance by mitigating the mismatch in noise/channel type for an MMSR solution. We propose a novel method to reduce the effect of noise and channel mismatch by compensating the test noisy speech in the log-spectrum domain. We derive the relation between the log-spectrum vectors in the test and training noisy speech by using vector Taylor series (VTS) algorithm. Based on it, minimum mean square error estimation of the training log-spectrum vectors is obtained from the test noisy vectors by iteratively estimating environmental parameters. The estimated training vectors are used for recognition to reduce the noise and channel mismatch. We could find that the proposed method achieved WER reduction based on the Aurora2 task by +18.7% compared with a conventional MMSR method.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] A VTS-based Feature Compensation Method using Noisy Speech HMMs
    Chung, Yongjoo
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2014, 8 (06): : 2849 - 2856
  • [2] A VTS-BASED FEATURE COMPENSATION APPROACH TO NOISY SPEECH RECOGNITION USING MIXTURE MODELS OF DISTORTION
    Du, Jun
    Huo, Qiang
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7078 - 7082
  • [3] IVN-Based Joint Training Of GMM And HMMs Using An Improved VTS-Based Feature Compensation For Noisy Speech Recognition
    Du, Jun
    Huo, Qiang
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1226 - 1229
  • [4] A Comparative Study of Noise Estimation Algorithms for VTS-Based Robust Speech Recognition
    Zhao, Yong
    Juang, Biing-Hwang
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2090 - 2093
  • [5] COMBINING EIGENVOICE SPEAKER MODELING AND VTS-BASED ENVIRONMENT COMPENSATION FOR ROBUST SPEECH RECOGNITION
    Ou, Zhijian
    Deng, Kan
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4673 - 4676
  • [6] Model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Li, Qunxia
    Guo, Jun
    Liu, Gang
    FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
  • [7] Model-based feature compensation for robust speech recognition
    School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
    不详
    不详
    Fundam Inf, 2006, 4 (529-539):
  • [8] VTS feature compensation based on two-layer GMM structure for robust speech recognition
    Zhou, Lin
    Li, Haijing
    Chen, Ying
    Wu, Zhenyang
    Lu, Yong
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [9] Feature compensation based on independent noise estimation for robust speech recognition
    Lu, Yong
    Lin, Han
    Wu, Pingping
    Chen, Yitao
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [10] Feature compensation based on independent noise estimation for robust speech recognition
    Yong Lü
    Han Lin
    Pingping Wu
    Yitao Chen
    EURASIP Journal on Audio, Speech, and Music Processing, 2021