Speech recognition using randomized relational decision trees

被引：6

作者：

Amit, Y ^{[1
]}

Murua, A ^{[1
]}

机构：

[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 04期

关键词：

classification; decision trees; labeled graphs; spectogram; speech recognition;

D O I：

10.1109/89.917679

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We explore the possibility of recognizing speech signals using a large collection of coarse acoustic events, which describe temporal relations between a small number of local features of the spectrogram, The major issue of invariance to changes in duration of speech signal events is addressed by defining temporal relations in a rather coarse manner, allowing for a large degree of slack. The approach is greedy in that it does not offer an "explanation" of the entire signal as the hidden Markov models (HMMs) approach does; rather, it accesses small amounts of relational information to determine a speech unit or class. This implies that we recognize words as units, without recognizing their subcomponents, Multiple randomized decision trees are used to access the large pool of acoustic events in a systematic manner and are aggregated to produce the classifier.

引用

页码：333 / 341

页数：9