Rapid discriminative learning based on misclassification measure

被引:1
作者
Interdisciplinary Faculty of Science and Engineering, Shimane University, Matsue, 690-8504, Japan [1 ]
不详 [2 ]
不详 [3 ]
不详 [4 ]
机构
[1] Interdisciplinary Faculty of Science and Engineering, Shimane University, Matsue
[2] Mathematics and Information Systems, Interdisciplinary Faculty of Science and Engineering, Shimane University
来源
Syst Comput Jpn | 2006年 / 3卷 / 58-68期
关键词
Generalization ability; Generalized Probabilistic Descent (GPD); Minimum Classification Error (MCE); Overtraining; Regularization;
D O I
10.1002/scj.20353
中图分类号
学科分类号
摘要
The current research is based on Minimum Classification Error Learning (MCE/GPD) using Generalized Probabilistic Descent (GPD), which is known as a high-performance discriminative learning method. MCE/GPD is an excellent recognition technique that has been applied to speech recognition because of its high recognition performance and its ability to deal with variable-length vectors. However, like other recognition techniques, it suffers from the problem that recognition performance drops for untrained data (generalization ability problem). There is also the practical fault that training time is lengthy due to the complexity of the algorithm. In the current research, the authors propose a new learning method that improves the generalization ability by introducing regularized learning to avoid ill-posed problems and increases learning speed according to a hierarchical model arrangement, which should solve these two problems. They used a hierarchical neural network for performance evaluation. © 2006 Wiley Periodicals, Inc.
引用
收藏
页码:58 / 68
页数:10
相关论文
共 25 条
[1]  
Amari S., A theory of adaptive pattern classifiers, IEEE Trans EC, 16, pp. 299-307, (1967)
[2]  
Juang B.-H., Katagiri S., Discriminative learning for minimum error classification, IEEE Trans Signal Process, 40, 12, (1992)
[3]  
Kurosawa Y., Probabilistic descent method applied to similarity and discrepancy of quadratic form for pattern recognition, Tech Rep IEICE, (1997)
[4]  
Rabiner L., Jung B.-H., Fundamentals of Speech Recognition, (1993)
[5]  
Chou W., Lee C.-H., Juang B.-H., Minimum error rate training of inter-word context dependent acoustic model units in speech recognition, Proc ICASSP94
[6]  
Yonezawa Y., Akagi M., Modeling of contextual effects based on the minimum classification error criterion, Tech Rep IEICE, (1994)
[7]  
MacDermott E., Katagiri S., Prototype-based minimum classification error/generalized probabilistic descent training for various speech units, Computer Speech and Language, pp. 351-368, (1994)
[8]  
Takahashi J., Sagayama S., An efficient HMM learning method for a small amount of data using minimum classification error, Tech Rep IEICE, (1995)
[9]  
Uemoto N., Matsuoka T., Matsui T., Furui S., Detection of an acoustic model learning method in connected digit speech recognition, Trans Acoust Soc Japan, pp. 121-122, (1995)
[10]  
Schapire R., Freund Y., Barlett P., Boosting the margin: A new explanation for the effectiveness of voting methods, Ann Statist, 26, pp. 1651-1686, (1998)