Importance sampling based discriminative learning for large scale offline handwritten Chinese character recognition

被引:3
作者
Wang, Yanwei [1 ]
Fu, Qiang [2 ]
Ding, Xiaoqing [1 ]
Liu, Changsong [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
Importance sampling; Discriminative learning; Sample importance weight; Handwritten Chinese character recognition; IMPROVEMENT; CLASSIFIER;
D O I
10.1016/j.patcog.2014.09.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of a discriminative learning framework based on importance sampling for large-scale classification tasks is reported in this paper. The framework involves the assignment of samples with different weights according to the sample importance weight function derived from the Bayesian classification rule. Three methods are used to calculate the sample importance weights for learning the modified quadratic discriminant function (MQDF). (1) Rejection sampling method. The method selects important samples as a training subset and trains different levels of MQDFs by focusing on different types of samples. (2) Boosting algorithm. The algorithm modifies the sample importance weights iteratively according to the recognition performance. (3) Minimum classification error (MCE) rule. The parameter of the importance weight function is estimated using the MCE rule. In general, the cursive samples are usually misclassified or prone to be misclassified by the MQDF learned under the maximum likelihood estimation (MLE) rule. The proposed importance sampling framework thereby makes the MQDF classifier focus more on cursive samples than on normal samples. Such a strategy allows the MQDF to achieve higher accuracy while maintaining lower computational complexity. Comprehensive experiments on three Chinese handwritten character datasets demonstrated that the proposed framework exhibits promising character recognition accuracy. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1225 / 1234
页数:10
相关论文
共 45 条
[1]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[2]  
Anderson E.C., 1999, LECT NOTES STAT C
[3]  
[Anonymous], 2001, Pattern Classification
[4]  
[Anonymous], 2006, Pattern recognition and machine learning
[5]  
Bahl L. R., 1986, ICASSP 86 Proceedings. IEEE-IECEJ-ASJ International Conference on Acoustics, Speech and Signal Processing (Cat. No.86CH2243-4), P49
[6]   Another look at rejection sampling through importance sampling [J].
Chen, YG .
STATISTICS & PROBABILITY LETTERS, 2005, 72 (04) :277-283
[7]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[8]   Training invariant support vector machines [J].
Decoste, D ;
Schölkopf, B .
MACHINE LEARNING, 2002, 46 (1-3) :161-190
[9]  
Dietterich T. G., 1995, Journal of Artificial Intelligence Research, V2, P263
[10]   An improved handwritten Chinese character recognition system using support vector machine [J].
Dong, JX ;
Krzyzak, A ;
Suen, CY .
PATTERN RECOGNITION LETTERS, 2005, 26 (12) :1849-1856