A Simulated-Data Adaptation Technique for Robust Speech Recognition

被引:0
|
作者
Thatphithakkul, Nattanun [1 ]
Kruatrachue, Boontee [1 ]
Wutiwiwatchai, Chai [2 ]
Marukatat, Sanparith [2 ]
Boonpiam, Vataya
机构
[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Fac Engn, Bangkok 10520, Thailand
[2] Natl Elect & Comp Technol Ctr, Pathum Thani 12120, Thailand
关键词
robust speech recognition; MLLR; online-adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an efficient acoustic model adaptation method based on the use of simulated-data in maximum likelihood linear regression (MLLR) adaptation for robust speech recognition. Online MLLR adaptation is an unsupervised process which requires an input speech with phone labels transcribed automatically. Instead of using only the input signal in adaptation, our proposed simulated data method increases the size of adaptation data by adding noise portions extracted from the input speech to a set of pre-recorded clean speech, whose correct transcriptions are known. Various configurations of the proposed method are explored. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system achieves higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
引用
收藏
页码:777 / +
页数:2
相关论文
共 50 条
  • [21] UNSUPERVISED ADAPTATION WITH DOMAIN SEPARATION NETWORKS FOR ROBUST SPEECH RECOGNITION
    Meng, Zhong
    Chen, Zhuo
    Mazalov, Vadim
    Li, Jinyu
    Gong, Yifan
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 214 - 221
  • [22] Multi-Channel Feature Adaptation for Robust Speech Recognition
    Zhang, Zhaofeng
    Xiao, Xiong
    Wang, Longbiao
    Dang, Jianwu
    Iwahashi, Masahiro
    Chng, Eng Siong
    Li, Haizhou
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [23] Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition
    Guo, Pengcheng
    Sun, Sining
    Xie, Lei
    INTERSPEECH 2019, 2019, : 749 - 753
  • [24] An unsupervised deep domain adaptation approach for robust speech recognition
    Sun, Sining
    Zhang, Binbin
    Xie, Lei
    Zhang, Yanning
    NEUROCOMPUTING, 2017, 257 : 79 - 87
  • [25] Unsupervised speaker adaptation for robust speech recognition in real environments
    Yamade, S
    Baba, A
    Yoshikawa, S
    Lee, A
    Saruwatari, H
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (08): : 30 - 41
  • [26] Study on the Use and Adaptation of Bottleneck Features for Robust Speech Recognition of Nonlinearly Distorted Speech
    Malek, Jiri
    Cerva, Petr
    Seps, Ladislav
    Nouza, Jan
    SIGMAP: PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS - VOL. 5, 2016, : 65 - 71
  • [27] UNSUPERVISED DOMAIN ADAPTATION FOR ROBUST SPEECH RECOGNITION VIA VARIATIONAL AUTOENCODER-BASED DATA AUGMENTATION
    Hsu, Wei-Ning
    Zhang, Yu
    Glass, James
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 16 - 23
  • [28] Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
    Buera, Luis
    Miguel, Antonio
    Saz, Oscar
    Ortega, Alfonso
    Lleida, Eduardo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 296 - 309
  • [29] Combining feature compensation and Weighted Viterbi Decoding for noise robust speech recognition with limited adaptation data
    Cui, XD
    Alwan, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 969 - 972
  • [30] Robust Submodular Data Partitioning for Distributed Speech Recognition
    Qi, Jun
    Tejedor, Javier
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2254 - 2258