Confusion-Based Entropy-Weighted Decoding for Robust Speech Recognition

被引：0

作者：

Chen, Yi ^{[1
]}

Wan, Chia-yu ^{[1
]}

Lee, Lin-shan ^{[1
]}

机构：

[1] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei 10764, Taiwan

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

speech recognition; robustness;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An entropy-based feature parameter weighting scheme was proposed previously [1], in which the scores obtained from different feature parameters are weighted differently in the decoding process according to an entropy measure. In this paper, we propose a more delicate entropy measure for this purpose considering the inherent confusion among different acoustic classes. If a set of acoustic classes are easily confused, those feature parameters which can distinguish them should be emphasized. Extensive experiments with the Aurora 2 testing environment verified that this approach is equally useful for different types of features, and can be easily integrated with typical existing robust speech recognition approaches.

引用

页码：1008 / 1011

页数：4

共 50 条

[1] Speech confusion index (Φ): A confusion-based speech quality indicator and recognition rate prediction for dysarthria
Kayasith, Prakasith
Theeramunkong, Thanaruk
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 58 (08) : 1534 - 1549
[2] Impostor detection in speaker recognition using confusion-based confidence measures
Kim, Kyuhong
Kim, Hoirin
Hahn, Minsoo
ETRI JOURNAL, 2006, 28 (06) : 811 - 814
[3] Multilingual Non-Native Speech Recognition using Phonetic Confusion-Based Acoustic Model Modification and Graphemic Constraints
Bouselmi, G.
Fohr, D.
Illina, I.
Haton, J. -P.
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 109 - +
[4] Fully automated non-native speech recognition using confusion-based acoustic model integration and graphemic constraints
Bouselmi, Ghazi
Fohr, Dominique
Illina, Irina
Haton, Jean Paul
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 345 - 348
[5] Joint decoding of multiple speech patterns for robust speech recognition
Nair, Nishanth Ulhas
Sreenivas, T. V.
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 93 - 98
[6] TWO-DIMENSIONAL FRAME-AND-FEATURE WEIGHTED VITERBI DECODING FOR ROBUST SPEECH RECOGNITION
Chang, Yang
Lee, Lin-shan
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4689 - 4692
[7] Pronouncibility index (Π): a distance-based and confusion-based speech quality measure for dysarthric speakers
Kayasith, Prakasith
Theeramunkong, Thanaruk
KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 27 (03) : 367 - 391
[8] TANKER SELECTION BASED ON AN ENTROPY-WEIGHTED FUZZY MATTER APPROACH
Li, Haoqiang
Chen, Jihong
Wan, Zheng
Cao, Xiao
Shu, Yaqing
Bai, Yun
INTERNATIONAL JOURNAL OF MARITIME ENGINEERING, 2021, 163 : A17 - A28
[9] TANKER SELECTION BASED ON AN ENTROPY-WEIGHTED FUZZY MATTER APPROACH
Li H.
Chen J.
Wan Z.
Cao X.
Shu Y.
Bai Y.
Transactions of the Royal Institution of Naval Architects Part A: International Journal of Maritime Engineering, 2021, 163 (A1): : A17 - A28
[10] Robust Speech Recognition over Mobile Networks Using Combined Weighted Viterbi Decoding and Subvector Based Error Concealment
Tan, Zheng-Hua
Dalsgaard, Paul
Lindberg, Borge
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1137 - 1140

← 1 2 3 4 5 →