A Vector Space Approach to Environment Modeling for Robust Speech Recognition

被引:0
|
作者
Tsao, Yu [1 ]
Lee, Chin-Hui [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
acoustic modeling; environment adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a vector space approach to characterizing environments for robust speech recognition. We represent a given environment by a super-vector formed by concatenating all the mean vectors of the Gaussian mixture components of the state observation densities of all hidden Markov models trained in the particular environment. New environment super-vectors can now be obtained either by an interpolation method with a collection of super-vectors trained from many real or simulated environments or by a transformation performed on an anchor super-vector for a specific environment, such as a clean condition. At a 5dB signal-to-noise (SNR) level, both interpolation- and transformation-based approaches achieve a significant error rate reduction of close to 47% from a baseline system with cepstral mean subtraction (CMS) with only two adaptation utterances. When incorporating N-best information to perform unsupervised adaptation at 5dB SNR with the same two utterances, we achieve a relative error reduction of about 40%, close to that achieved in the supervised mode.
引用
收藏
页码:785 / 788
页数:4
相关论文
共 50 条
  • [41] A perceptual masking approach for noise robust speech recognition
    Maganti, Hari Krishna
    Matassoni, Marco
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [42] Robust speech recognition based on a Bayesian prediction approach
    Jiang, H
    Hirose, K
    Huo, Q
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (04): : 426 - 440
  • [43] Approach of feature with confident weight for robust speech recognition
    Ge, YB
    Song, J
    Ge, LN
    Shirai, K
    2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 11 - 14
  • [44] AN ECONOMICAL APPROACH TO MODELING SPEECH RECOGNITION ACCURACY
    SPINE, TM
    WILLIGES, BH
    MAYNARD, JF
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1984, 21 (03): : 191 - 202
  • [45] SECOND ORDER VECTOR TAYLOR SERIES BASED ROBUST SPEECH RECOGNITION
    Bu, Suliang
    Qian, Yanmin
    Sim, Khe Chai
    You, Yongbin
    Yu, Kai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [46] ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES
    Zhao, Yong
    Juang, Biing-Hwang
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4290 - 4293
  • [47] Matrix quantization with vector quantization error compensation for robust speech recognition
    Cong, L
    Asghar, S
    1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 131 - 136
  • [48] Structured Support Vector Machines for Noise Robust Continuous Speech Recognition
    Zhang, Shi-Xiong
    Gales, M. J. F.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 996 - 999
  • [49] Cepstral vector normalization based on stereo data for robust speech recognition
    Buera, Luis
    Lleida, Eduardo
    Miguel, Antonio
    Ortega, Alfonso
    Saz, Oscar
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1098 - 1113
  • [50] Robust Speech Recognition under Noisy Environment using Speech Rate Training System
    Dhas, Edwin D.
    Ruban, Bency L.
    King, Arul J.
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,