Confusion-Based Entropy-Weighted Decoding for Robust Speech Recognition

被引:0
|
作者
Chen, Yi [1 ]
Wan, Chia-yu [1 ]
Lee, Lin-shan [1 ]
机构
[1] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei 10764, Taiwan
来源
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年
关键词
speech recognition; robustness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An entropy-based feature parameter weighting scheme was proposed previously [1], in which the scores obtained from different feature parameters are weighted differently in the decoding process according to an entropy measure. In this paper, we propose a more delicate entropy measure for this purpose considering the inherent confusion among different acoustic classes. If a set of acoustic classes are easily confused, those feature parameters which can distinguish them should be emphasized. Extensive experiments with the Aurora 2 testing environment verified that this approach is equally useful for different types of features, and can be easily integrated with typical existing robust speech recognition approaches.
引用
收藏
页码:1008 / 1011
页数:4
相关论文
共 50 条
  • [1] Speech confusion index (Φ): A confusion-based speech quality indicator and recognition rate prediction for dysarthria
    Kayasith, Prakasith
    Theeramunkong, Thanaruk
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 58 (08) : 1534 - 1549
  • [2] Impostor detection in speaker recognition using confusion-based confidence measures
    Kim, Kyuhong
    Kim, Hoirin
    Hahn, Minsoo
    ETRI JOURNAL, 2006, 28 (06) : 811 - 814
  • [3] Multilingual Non-Native Speech Recognition using Phonetic Confusion-Based Acoustic Model Modification and Graphemic Constraints
    Bouselmi, G.
    Fohr, D.
    Illina, I.
    Haton, J. -P.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 109 - +
  • [4] Fully automated non-native speech recognition using confusion-based acoustic model integration and graphemic constraints
    Bouselmi, Ghazi
    Fohr, Dominique
    Illina, Irina
    Haton, Jean Paul
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 345 - 348
  • [5] Joint decoding of multiple speech patterns for robust speech recognition
    Nair, Nishanth Ulhas
    Sreenivas, T. V.
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 93 - 98
  • [6] TWO-DIMENSIONAL FRAME-AND-FEATURE WEIGHTED VITERBI DECODING FOR ROBUST SPEECH RECOGNITION
    Chang, Yang
    Lee, Lin-shan
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4689 - 4692
  • [7] Pronouncibility index (Π): a distance-based and confusion-based speech quality measure for dysarthric speakers
    Kayasith, Prakasith
    Theeramunkong, Thanaruk
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 27 (03) : 367 - 391
  • [8] TANKER SELECTION BASED ON AN ENTROPY-WEIGHTED FUZZY MATTER APPROACH
    Li, Haoqiang
    Chen, Jihong
    Wan, Zheng
    Cao, Xiao
    Shu, Yaqing
    Bai, Yun
    INTERNATIONAL JOURNAL OF MARITIME ENGINEERING, 2021, 163 : A17 - A28
  • [9] TANKER SELECTION BASED ON AN ENTROPY-WEIGHTED FUZZY MATTER APPROACH
    Li H.
    Chen J.
    Wan Z.
    Cao X.
    Shu Y.
    Bai Y.
    Transactions of the Royal Institution of Naval Architects Part A: International Journal of Maritime Engineering, 2021, 163 (A1): : A17 - A28
  • [10] Robust Speech Recognition over Mobile Networks Using Combined Weighted Viterbi Decoding and Subvector Based Error Concealment
    Tan, Zheng-Hua
    Dalsgaard, Paul
    Lindberg, Borge
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1137 - 1140