Diagnostics of speech recognition using classification phoneme diagnostic trees

被引:0
|
作者
Cernak, Milos [1 ]
Wellekens, Christian [1 ]
机构
[1] Inst Eurecom, Dept Multimedia Commun, 2229 Route Cretes,BP 193, F-06904 Sophia Antioplis, France
来源
PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE | 2006年
关键词
fault diagnosis; speech recognition; intrinsic speech variabilities;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
More than three decades of speech recognition research resulted in a very sophisticated statistical framework. However, less attention was still devoted to diagnostics of speech recognition; most previous research report on results in terms of ever-lower WER in various intrinsic or environmental conditions. This paper presents a diagnostics of the decoding process of ASR systems. The purpose of our diagnostics is to go beyond standard evaluation in terms of WERs and confusion matrices, and to look at the recognized output in more details. During the decoding phase, some specific data are collected at the decoder as possible causes of errors, and later are statistically analyzed using classification and regression trees. Focusing on pure acoustic phone decoding without language modeling, we present and discuss the results of the diagnostics that is used for an analysis of impact of intrinsic speech variabilities on speech recognition.
引用
收藏
页码:459 / +
页数:2
相关论文
共 50 条
  • [1] Hierarchical Phoneme Classification for Improved Speech Recognition
    Oh, Donghoon
    Park, Jeong-Sik
    Kim, Ji-Hwan
    Jang, Gil-Jin
    APPLIED SCIENCES-BASEL, 2021, 11 (01): : 1 - 17
  • [2] Myoclectric signal classification for phoneme-based speech recognition
    Scheme, Erik J.
    Hudgins, Bernard
    Parker, Phillip A.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2007, 54 (04) : 694 - 699
  • [3] SPEECH RECOGNITION USING PHONEME HMM CONSTRAINED BY FRAME CORRELATION
    TAKAHASHI, S
    MATSUOKA, T
    MINAMI, Y
    SHIKANO, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1994, 77 (06): : 58 - 69
  • [4] Speech Recognition using Soft Decision Trees
    Ajmera, Jitendra
    Akamine, Masami
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 940 - 943
  • [5] Feature Selection Using Game Theory for Phoneme Based Speech Recognition
    Rekha, J. Ujwala
    Chatrapati, K. Shahu
    Babu, A. Vinaya
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 962 - 966
  • [6] Phoneme and tonal accent recognition for Thai speech
    Theera-Umpon, Nipon
    Chansareewittaya, Suppakarn
    Auephanwiriyakul, Sansanee
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13254 - 13259
  • [7] Phoneme fuzzy characterization in speech recognition systems
    Beritelli, F
    Borrometi, L
    Cuce, A
    APPLICATIONS OF SOFT COMPUTING, 1997, 3165 : 305 - 306
  • [8] Mouth Shape Sequence Recognition Based on Speech Phoneme Recognition
    Xu, Ming
    Hu, Ruimin
    2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, 2006,
  • [9] Using Vector of Fractal Dimensions for Feature Reduction and Phoneme Recognition and Classification
    Hosseini, S. Abolfazl
    Ghassemian, Hassan
    Alizadeh, Roya
    2012 20TH TELECOMMUNICATIONS FORUM (TELFOR), 2012, : 748 - 751
  • [10] Integration of phoneme-subspaces using ICA for speech feature extraction and recognition
    Park, Hyunsin
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 149 - 152