Speech recognition using randomized relational decision trees

被引:6
|
作者
Amit, Y [1 ]
Murua, A [1 ]
机构
[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 04期
关键词
classification; decision trees; labeled graphs; spectogram; speech recognition;
D O I
10.1109/89.917679
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We explore the possibility of recognizing speech signals using a large collection of coarse acoustic events, which describe temporal relations between a small number of local features of the spectrogram, The major issue of invariance to changes in duration of speech signal events is addressed by defining temporal relations in a rather coarse manner, allowing for a large degree of slack. The approach is greedy in that it does not offer an "explanation" of the entire signal as the hidden Markov models (HMMs) approach does; rather, it accesses small amounts of relational information to determine a speech unit or class. This implies that we recognize words as units, without recognizing their subcomponents, Multiple randomized decision trees are used to access the large pool of acoustic events in a systematic manner and are aggregated to produce the classifier.
引用
收藏
页码:333 / 341
页数:9
相关论文
empty
未找到相关数据