Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition

被引:0
|
作者
Thatphithakkul, Nattanun [1 ]
Kruatrachue, Boontee [1 ]
Wutiwiwatchai, Chai [2 ]
Marukatat, Sanparith [2 ]
Boonpiam, Vataya [2 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Bangkok 10520, Thailand
[2] Natl Elect & Comp Technol Ctr, Human Language Technol Lab, Pathum Thani 12120, Thailand
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes the use of tree-structured model selection and simulated-data in maximum likelihood linear regression (MLLR) adaptation for environment and speaker robust speech recognition. The objective of this work is to solve major problems in robust speech recognition system, namely unknown speaker and unknown environmental noise. The proposed solution is composed of two components. The first one is based on a tree-structured model for selecting a speaker-dependent model that best matches to the input speech. The second component uses simulated-data to adapt the selected acoustic model to fit with the unknown noise. The proposed technique can thus alleviate both problems simultaneously. Experimental results show that the proposed system achieves a higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
引用
收藏
页码:1570 / +
页数:2
相关论文
共 50 条
  • [1] A Simulated-Data Adaptation Technique for Robust Speech Recognition
    Thatphithakkul, Nattanun
    Kruatrachue, Boontee
    Wutiwiwatchai, Chai
    Marukatat, Sanparith
    Boonpiam, Vataya
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 777 - +
  • [2] Speaker-independent speech recognition based on tree-structured speaker clustering
    Kosaka, T
    Matsunaga, S
    Sagayama, S
    COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 55 - 74
  • [3] Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation
    Wang, SJ
    Zhao, YX
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (06): : 663 - 677
  • [4] ROBUST TREE-STRUCTURED NAMED ENTITIES RECOGNITION FROM SPEECH
    Raymond, Christian
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8475 - 8479
  • [5] On-line Bayesian speaker adaptation using tree-structured transformation and robust priors
    Wang, SJ
    Zhao, YX
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 977 - 980
  • [6] Tree-structured vector quantization for speech recognition
    Barszcz, M
    Chen, W
    Boulianne, G
    Kenny, P
    COMPUTER SPEECH AND LANGUAGE, 2000, 14 (03): : 227 - 239
  • [7] Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition
    Kannan, A
    Khudanpur, S
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 769 - 772
  • [8] Channel and speaker adaptation techniques for robust speech recognition
    Chen, Jingdong
    Yao, Lei
    Huang, Taiyi
    Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
  • [9] A decision tree-structured algorithm of speaker adaptation based on Gaussian Similarity Analysis
    Wu, J
    Wang, ZY
    CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (02): : 166 - 169
  • [10] Tree-Structured Model with Unbiased Variable Selection and Interaction Detection for Ranking Data
    Shih, Yu-Shan
    Kung, Yi-Hung
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (02): : 448 - 459