Confusion-Based Entropy-Weighted Decoding for Robust Speech Recognition

被引:0
作者
Chen, Yi [1 ]
Wan, Chia-yu [1 ]
Lee, Lin-shan [1 ]
机构
[1] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei 10764, Taiwan
来源
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年
关键词
speech recognition; robustness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An entropy-based feature parameter weighting scheme was proposed previously [1], in which the scores obtained from different feature parameters are weighted differently in the decoding process according to an entropy measure. In this paper, we propose a more delicate entropy measure for this purpose considering the inherent confusion among different acoustic classes. If a set of acoustic classes are easily confused, those feature parameters which can distinguish them should be emphasized. Extensive experiments with the Aurora 2 testing environment verified that this approach is equally useful for different types of features, and can be easily integrated with typical existing robust speech recognition approaches.
引用
收藏
页码:1008 / 1011
页数:4
相关论文
共 10 条
[1]  
[Anonymous], IEEE SIGNAL PROCESSI
[2]   Low-bitrate distributed speech recognition for packet-based and wireless communication [J].
Bernard, A ;
Alwan, A .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08) :570-579
[3]  
CHEN Y, ROBUST FEATURES SPEE
[4]  
CHEN Y, ENTROPY BASED FEATUR
[5]   Noise robust speech recognition using feature compensation based on polynomial fly regression of utterance SNR [J].
Cui, XD ;
Alwan, A .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06) :1161-1172
[6]   PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH [J].
HERMANSKY, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) :1738-1752
[7]  
HIRSCH HG, 2000, AURORA EXPT FRAMEWOR
[8]  
Viikki O., 1998, SPEECH COMMUNICATION
[9]   Speaker verification in noise using a stochastic version of the weighted Viterbi algorithm [J].
Yoma, NB ;
Villar, M .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (03) :158-166
[10]  
YOMA NB, STOCHASTIC WEIGHTED