Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition

被引：0

作者：

Thatphithakkul, Nattanun ^{[1
]}

Kruatrachue, Boontee ^{[1
]}

Wutiwiwatchai, Chai ^{[2
]}

Marukatat, Sanparith ^{[2
]}

Boonpiam, Vataya ^{[2
]}

机构：

[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Bangkok 10520, Thailand

[2] Natl Elect & Comp Technol Ctr, Human Language Technol Lab, Pathum Thani 12120, Thailand

来源：

2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes the use of tree-structured model selection and simulated-data in maximum likelihood linear regression (MLLR) adaptation for environment and speaker robust speech recognition. The objective of this work is to solve major problems in robust speech recognition system, namely unknown speaker and unknown environmental noise. The proposed solution is composed of two components. The first one is based on a tree-structured model for selecting a speaker-dependent model that best matches to the input speech. The second component uses simulated-data to adapt the selected acoustic model to fit with the unknown noise. The proposed technique can thus alleviate both problems simultaneously. Experimental results show that the proposed system achieves a higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.

引用

页码：1570 / +

页数：2

共 50 条

[1] A Simulated-Data Adaptation Technique for Robust Speech Recognition
Thatphithakkul, Nattanun
Kruatrachue, Boontee
Wutiwiwatchai, Chai
Marukatat, Sanparith
Boonpiam, Vataya
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 777 - +
[2] Speaker-independent speech recognition based on tree-structured speaker clustering
Kosaka, T
Matsunaga, S
Sagayama, S
COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 55 - 74
[3] Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation
Wang, SJ
Zhao, YX
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (06): : 663 - 677
[4] ROBUST TREE-STRUCTURED NAMED ENTITIES RECOGNITION FROM SPEECH
Raymond, Christian
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8475 - 8479
[5] On-line Bayesian speaker adaptation using tree-structured transformation and robust priors
Wang, SJ
Zhao, YX
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 977 - 980
[6] Tree-structured vector quantization for speech recognition
Barszcz, M
Chen, W
Boulianne, G
Kenny, P
COMPUTER SPEECH AND LANGUAGE, 2000, 14 (03): : 227 - 239
[7] Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition
Kannan, A
Khudanpur, S
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 769 - 772
[8] Channel and speaker adaptation techniques for robust speech recognition
Chen, Jingdong
Yao, Lei
Huang, Taiyi
Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
[9] A decision tree-structured algorithm of speaker adaptation based on Gaussian Similarity Analysis
Wu, J
Wang, ZY
CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (02): : 166 - 169
[10] Tree-Structured Model with Unbiased Variable Selection and Interaction Detection for Ranking Data
Shih, Yu-Shan
Kung, Yi-Hung
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (02): : 448 - 459

← 1 2 3 4 5 →