A Simulated-Data Adaptation Technique for Robust Speech Recognition

被引:0
|
作者
Thatphithakkul, Nattanun [1 ]
Kruatrachue, Boontee [1 ]
Wutiwiwatchai, Chai [2 ]
Marukatat, Sanparith [2 ]
Boonpiam, Vataya
机构
[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Fac Engn, Bangkok 10520, Thailand
[2] Natl Elect & Comp Technol Ctr, Pathum Thani 12120, Thailand
关键词
robust speech recognition; MLLR; online-adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an efficient acoustic model adaptation method based on the use of simulated-data in maximum likelihood linear regression (MLLR) adaptation for robust speech recognition. Online MLLR adaptation is an unsupervised process which requires an input speech with phone labels transcribed automatically. Instead of using only the input signal in adaptation, our proposed simulated data method increases the size of adaptation data by adding noise portions extracted from the input speech to a set of pre-recorded clean speech, whose correct transcriptions are known. Various configurations of the proposed method are explored. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system achieves higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
引用
收藏
页码:777 / +
页数:2
相关论文
共 50 条
  • [1] Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition
    Thatphithakkul, Nattanun
    Kruatrachue, Boontee
    Wutiwiwatchai, Chai
    Marukatat, Sanparith
    Boonpiam, Vataya
    2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1570 - +
  • [2] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
    CUNG, HM
    NORMANDIN, Y
    SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
  • [3] Feature Adaptation for Robust Mobile Speech Recognition
    Lee, Hyeopwoo
    Yook, Dongsuk
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1393 - 1398
  • [4] An environment adaptation method for robust speech recognition
    Han, JQ
    Zhang, L
    Wang, CF
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 726 - 729
  • [5] NORMALIZATION AND ADAPTATION OF SPEECH DATA FOR AUTOMATIC SPEECH RECOGNITION
    SCARR, RWA
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1970, 2 (01): : 41 - 59
  • [6] A PITCH BASED NOISE ESTIMATION TECHNIQUE FOR ROBUST SPEECH RECOGNITION WITH MISSING DATA
    Morales-Cordovilla, Juan A.
    Ma, Ning
    Sanchez, Victoria
    Carmona, Jose L.
    Peinado, Antonio M.
    Barker, Jon
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4808 - 4811
  • [7] Robust Statistic Estimates for Adaptation in the Task of Speech Recognition
    Zajic, Zbynek
    Machlica, Lukas
    Mueller, Ludek
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 464 - 471
  • [8] ROBUST FEATURE SPACE ADAPTATION FOR TELEPHONY SPEECH RECOGNITION
    Lei, Xin
    Hamaker, Jon
    He, Xiaodong
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 773 - +
  • [9] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Suh, Youngjoo
    Kim, Hoirin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [10] Channel and speaker adaptation techniques for robust speech recognition
    Chen, Jingdong
    Yao, Lei
    Huang, Taiyi
    Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544