Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition

被引:0
|
作者
Thatphithakkul, Nattanun [1 ]
Kruatrachue, Boontee [1 ]
Wutiwiwatchai, Chai [2 ]
Marukatat, Sanparith [2 ]
Boonpiam, Vataya [2 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Bangkok 10520, Thailand
[2] Natl Elect & Comp Technol Ctr, Human Language Technol Lab, Pathum Thani 12120, Thailand
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes the use of tree-structured model selection and simulated-data in maximum likelihood linear regression (MLLR) adaptation for environment and speaker robust speech recognition. The objective of this work is to solve major problems in robust speech recognition system, namely unknown speaker and unknown environmental noise. The proposed solution is composed of two components. The first one is based on a tree-structured model for selecting a speaker-dependent model that best matches to the input speech. The second component uses simulated-data to adapt the selected acoustic model to fit with the unknown noise. The proposed technique can thus alleviate both problems simultaneously. Experimental results show that the proposed system achieves a higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
引用
收藏
页码:1570 / +
页数:2
相关论文
共 50 条
  • [31] ON COMBINING DNN AND GMM WITH UNSUPERVISED SPEAKER ADAPTATION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Liu, Shilin
    Sim, Khe Chai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] Joint speaker and environment adaptation using Tensor Voice for robust speech recognition
    Jeong, Yongwon
    SPEECH COMMUNICATION, 2014, 58 : 1 - 10
  • [33] Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results
    Dinarelli, Marco
    Rosset, Sophie
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1266 - 1272
  • [34] Tree-structured subgroup analysis for censored survival data: Validation of computationally inexpensive model selection criteria
    Negassa, A
    Ciampi, A
    Abrahamowicz, M
    Shapiro, S
    Boivin, JF
    STATISTICS AND COMPUTING, 2005, 15 (03) : 231 - 239
  • [35] Tree-structured subgroup analysis for censored survival data: Validation of computationally inexpensive model selection criteria
    Abdissa Negassa
    Antonio Ciampi
    Michal Abrahamowicz
    Stanley Shapiro
    Jean-François Boivin
    Statistics and Computing, 2005, 15 : 231 - 239
  • [36] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
    Hariharan, R
    Viikki, O
    SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361
  • [37] TREE-STRUCTURED PREDICTION FOR CENSORED SURVIVAL-DATA AND THE COX MODEL
    CIAMPI, A
    NEGASSA, A
    LOU, ZY
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (05) : 675 - 689
  • [38] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Suh, Youngjoo
    Kim, Hoirin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [39] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Youngjoo Suh
    Hoirin Kim
    EURASIP Journal on Advances in Signal Processing, 2010
  • [40] ACOUSTIC MODEL SELECTION USING LIMITED DATA FOR ACCENT ROBUST SPEECH RECOGNITION
    Najafian, Maryam
    Safavi, Saeid
    Hanani, Abualsoud
    Russell, Martin
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1786 - 1790