Speech recognition using randomized relational decision trees

被引:6
|
作者
Amit, Y [1 ]
Murua, A [1 ]
机构
[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 04期
关键词
classification; decision trees; labeled graphs; spectogram; speech recognition;
D O I
10.1109/89.917679
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We explore the possibility of recognizing speech signals using a large collection of coarse acoustic events, which describe temporal relations between a small number of local features of the spectrogram, The major issue of invariance to changes in duration of speech signal events is addressed by defining temporal relations in a rather coarse manner, allowing for a large degree of slack. The approach is greedy in that it does not offer an "explanation" of the entire signal as the hidden Markov models (HMMs) approach does; rather, it accesses small amounts of relational information to determine a speech unit or class. This implies that we recognize words as units, without recognizing their subcomponents, Multiple randomized decision trees are used to access the large pool of acoustic events in a systematic manner and are aggregated to produce the classifier.
引用
收藏
页码:333 / 341
页数:9
相关论文
共 50 条
  • [1] Speech Recognition using Soft Decision Trees
    Ajmera, Jitendra
    Akamine, Masami
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 940 - 943
  • [2] HMM-based Speech Recognition Using Decision Trees Instead of GMMs
    Teunen, Remco
    Akamine, Masami
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 617 - 620
  • [3] Modelling speech emotion recognition using logistic regression and decision trees
    Jacob A.
    International Journal of Speech Technology, 2017, 20 (4) : 897 - 905
  • [4] Noisy speech recognition failure diagnosis using Minimum Message Length decision trees
    Cernak, Milos
    Darjaa, Sakhia
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 409 - 412
  • [5] Using decision trees to learn ontology taxonomies from relational databases
    Sbai, Sara
    Chabih, Oussama
    Louhdi, Mohammed Reda Chbihi
    Behja, Hicham
    Zemmouri, El Moukhtar
    Trousse, Brigitte
    2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 54 - 58
  • [6] Intracranial hypertension prediction using extremely randomized decision trees
    Scalzo, Fabien
    Hamilton, Robert
    Asgari, Shadnaz
    Kim, Sunghan
    Hu, Xiao
    MEDICAL ENGINEERING & PHYSICS, 2012, 34 (08) : 1058 - 1065
  • [7] Accent- and Speaker-Specific Polyphone Decision Trees for Non-Native Speech Recognition
    Telaar, Dominic
    Fuhs, Mark C.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3312 - 3315
  • [8] SEMANTIC OBJECT RECOGNITION USING CLUSTERING AND DECISION TREES
    Schmidsberger, Falk
    Stolzenburg, Frieder
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 670 - 673
  • [9] Diagnostics of speech recognition using classification phoneme diagnostic trees
    Cernak, Milos
    Wellekens, Christian
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 459 - +
  • [10] A COMPARISON OF DECISION TREE CLASSIFIERS FOR AUTOMATIC DIAGNOSIS OF SPEECH RECOGNITION ERRORS
    Cernak, Milos
    COMPUTING AND INFORMATICS, 2010, 29 (03) : 489 - 501