Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition

被引:0
|
作者
Thatphithakkul, Nattanun [1 ]
Kruatrachue, Boontee [1 ]
Wutiwiwatchai, Chai [2 ]
Marukatat, Sanparith [2 ]
Boonpiam, Vataya [2 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Bangkok 10520, Thailand
[2] Natl Elect & Comp Technol Ctr, Human Language Technol Lab, Pathum Thani 12120, Thailand
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes the use of tree-structured model selection and simulated-data in maximum likelihood linear regression (MLLR) adaptation for environment and speaker robust speech recognition. The objective of this work is to solve major problems in robust speech recognition system, namely unknown speaker and unknown environmental noise. The proposed solution is composed of two components. The first one is based on a tree-structured model for selecting a speaker-dependent model that best matches to the input speech. The second component uses simulated-data to adapt the selected acoustic model to fit with the unknown noise. The proposed technique can thus alleviate both problems simultaneously. Experimental results show that the proposed system achieves a higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
引用
收藏
页码:1570 / +
页数:2
相关论文
共 50 条
  • [41] Extending the UML Standards to Model Tree-Structured Data and Their Access Control Requirements
    Algarin, Alberto De la Rosa
    Demurjian, Steven A.
    SECURITY STANDARDISATION RESEARCH, SSR 2016, 2016, 10074 : 187 - 204
  • [42] Cross-Model Conjunctive Queries over Relation and Tree-Structured Data
    Chen, Yuxing
    Uotila, Valter
    Lu, Jiaheng
    Liu, Zhen Hua
    Das, Souripriya
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 21 - 37
  • [43] Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition
    Lei, Howard
    ADVANCES IN BIOMETRICS, 2009, 5558 : 513 - 522
  • [44] DYNAMIC ADAPTATION OF HIDDEN MARKOV MODEL FOR ROBUST SPEECH RECOGNITION
    GAO, YQ
    CHEN, YB
    WU, BX
    1989 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-3, 1989, : 1336 - 1339
  • [45] Tree distributions approximation model for robust discrete speech recognition
    Nacereddine Hammami
    Mouldi Bedda
    Nadir Farah
    International Journal of Speech Technology, 2012, 15 (4) : 455 - 462
  • [46] Tree distributions approximation model for robust discrete speech recognition
    Hammami, Nacereddine
    Bedda, Mouldi
    Farah, Nadir
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (04) : 455 - 462
  • [47] Model transformation for robust speaker recognition from telephone data
    Beaufays, F
    Weintraub, M
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1063 - 1066
  • [49] Environmental robust speech and speaker recognition through multi-channel histogram equalization
    Squartini, Stefano
    Principi, Emanuele
    Rotili, Rudy
    Piazza, Francesco
    NEUROCOMPUTING, 2012, 78 (01) : 111 - 120
  • [50] UTTERANCE-WISE RECURRENT DROPOUT AND ITERATIVE SPEAKER ADAPTATION FOR ROBUST MONAURAL SPEECH RECOGNITION
    Wang, Peidong
    Wang, DeLiang
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4814 - 4818