Monte Carlo Model-Space Noise Adaptation for Speech Recognition

被引:0
|
作者
Povey, Daniel [1 ]
Kingsbury, Brian [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
来源
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年
关键词
speech recognition; noise adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a Monte Carlo method for model-space noise adaptation of Gaussian mixture models (GMMs). This method combines a single-Gaussian noise model with the GMM speech model to produce an adapted model. It is similar to Parallel Model Combination or model-space Joint, except that it applies to spliced and projected MFCC features rather than to MFCC plus dynamic features. We demonstrate the necessity of re-estimating the noise using both the silence and speech frames rather than just estimating it from silence frames, and obtain improvements on a matched test set without added noise using a system that includes all standard adaptation techniques.
引用
收藏
页码:1281 / 1284
页数:4
相关论文
共 50 条
  • [31] Comparing Jacorian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition
    Pärssinen, K
    Salmela, P
    Harju, M
    Kiss, I
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 193 - 196
  • [32] Noise Robust Speech Recognition Based on Parallel Model Combination Adaptation Using Frequency-Variant
    Choi, Sook-Nam
    Chung, Hyun-Yeol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (03): : 252 - 261
  • [33] Towards non-stationary model-based noise adaptation for large vocabulary speech recognition
    Kristjansson, T
    Frey, B
    Deng, L
    Acero, A
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 337 - 340
  • [34] A temporal auditory model with adaptation for automatic speech recognition
    Haque, Serajul
    Togneri, Roberto
    Zaknich, Anthony
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1141 - +
  • [35] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Suh, Youngjoo
    Kim, Hoirin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [36] Model adaptation method for recognition of speech with missing frames
    Lee, Lee-Min
    Jean, Fu-Rong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (03): : EL166 - EL171
  • [37] IMPACT OF DYNAMIC MODEL ADAPTATION BEYOND SPEECH RECOGNITION
    Batista, Fernando
    Amaral, Rui
    Trancoso, Isabel
    Mamede, Nuno
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 277 - 280
  • [38] Instantaneous model adaptation method for reverberant speech recognition
    Ban, Sung Min
    Kim, Hyung Soon
    ELECTRONICS LETTERS, 2015, 51 (06) : 528 - 529
  • [39] Model-Space Localization in Serial Ensemble Filters
    Shlyaeva, Anna
    Whitaker, Jeffrey S.
    Snyder, Chris
    JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2019, 11 (06) : 1627 - 1636
  • [40] STATISTICAL LANGUAGE MODEL ADAPTATION FOR ESTONIAN SPEECH RECOGNITION
    Alumaee, Tanel
    EESTI RAKENDUSLINGVISTIKA UHINGU AASTARAAMAT, 2008, 4 : 5 - 16