Diagnostics of speech recognition using classification phoneme diagnostic trees

被引：0

作者：

Cernak, Milos ^{[1
]}

Wellekens, Christian ^{[1
]}

机构：

[1] Inst Eurecom, Dept Multimedia Commun, 2229 Route Cretes,BP 193, F-06904 Sophia Antioplis, France

来源：

PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE | 2006年

关键词：

fault diagnosis; speech recognition; intrinsic speech variabilities;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

More than three decades of speech recognition research resulted in a very sophisticated statistical framework. However, less attention was still devoted to diagnostics of speech recognition; most previous research report on results in terms of ever-lower WER in various intrinsic or environmental conditions. This paper presents a diagnostics of the decoding process of ASR systems. The purpose of our diagnostics is to go beyond standard evaluation in terms of WERs and confusion matrices, and to look at the recognized output in more details. During the decoding phase, some specific data are collected at the decoder as possible causes of errors, and later are statistically analyzed using classification and regression trees. Focusing on pure acoustic phone decoding without language modeling, we present and discuss the results of the diagnostics that is used for an analysis of impact of intrinsic speech variabilities on speech recognition.

引用

页码：459 / +

页数：2

共 50 条

[1] Hierarchical Phoneme Classification for Improved Speech Recognition
Oh, Donghoon
Park, Jeong-Sik
Kim, Ji-Hwan
Jang, Gil-Jin
APPLIED SCIENCES-BASEL, 2021, 11 (01): : 1 - 17
[2] Myoclectric signal classification for phoneme-based speech recognition
Scheme, Erik J.
Hudgins, Bernard
Parker, Phillip A.
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2007, 54 (04) : 694 - 699
[3] SPEECH RECOGNITION USING PHONEME HMM CONSTRAINED BY FRAME CORRELATION
TAKAHASHI, S
MATSUOKA, T
MINAMI, Y
SHIKANO, K
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1994, 77 (06): : 58 - 69
[4] Speech Recognition using Soft Decision Trees
Ajmera, Jitendra
Akamine, Masami
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 940 - 943
[5] Feature Selection Using Game Theory for Phoneme Based Speech Recognition
Rekha, J. Ujwala
Chatrapati, K. Shahu
Babu, A. Vinaya
2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 962 - 966
[6] Phoneme and tonal accent recognition for Thai speech
Theera-Umpon, Nipon
Chansareewittaya, Suppakarn
Auephanwiriyakul, Sansanee
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13254 - 13259
[7] Phoneme fuzzy characterization in speech recognition systems
Beritelli, F
Borrometi, L
Cuce, A
APPLICATIONS OF SOFT COMPUTING, 1997, 3165 : 305 - 306
[8] Mouth Shape Sequence Recognition Based on Speech Phoneme Recognition
Xu, Ming
Hu, Ruimin
2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, 2006,
[9] Using Vector of Fractal Dimensions for Feature Reduction and Phoneme Recognition and Classification
Hosseini, S. Abolfazl
Ghassemian, Hassan
Alizadeh, Roya
2012 20TH TELECOMMUNICATIONS FORUM (TELFOR), 2012, : 748 - 751
[10] Integration of phoneme-subspaces using ICA for speech feature extraction and recognition
Park, Hyunsin
Takiguchi, Tetsuya
Ariki, Yasuo
2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 149 - 152

← 1 2 3 4 5 →