Minimum-risk training for semi-Markov conditional random fields with application to handwritten Chinese/Japanese text recognition

被引:23
作者
Zhou, Xiang-Dong [1 ]
Zhang, Yan-Ming [2 ]
Tian, Feng [3 ]
Wang, Hong-An [3 ]
Liu, Cheng-Lin [2 ]
机构
[1] Chinese Acad Sci, Chongqing Inst Green & Intelligent Technol, Intelligent Media Tech Res Ctr, Chongqing 400714, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Beijing Key Lab Human Comp Interact, Inst Software, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-Markov conditional random fields; Minimum-risk training; Character string recognition; ERROR; MINIMIZATION;
D O I
10.1016/j.patcog.2013.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-Markov conditional random fields (semi-CRFs) are usually trained with maximum a posteriori (MAP) criterion which adopts the 0/1 cost for measuring the loss of misclassification. In this paper, based on our previous work on handwritten Chinese/Japanese text recognition (HCTR) using semi-CRFs, we propose an alternative parameter learning method by minimizing the risk on the training set, which has unequal misclassification costs depending on the hypothesis and the ground-truth. Based on this framework, three non-uniform cost functions are compared with the conventional 0/1 cost, and training data selection is incorporated to reduce the computational complexity. In experiments of online handwriting recognition on databases CASIA-OLHWDB and THAT Kondate, we compared the performances of the proposed method with several widely used learning criteria, including conditional log-likelihood (CLL), softmax-margin (SMM), minimum classification error (MCE), large-margin MCE (LM-MCE) and max-margin (MM). On the test set (online handwritten texts) of ICDAR 2011 Chinese handwriting recognition competition, the proposed method outperforms the best system in competition. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1904 / 1916
页数:13
相关论文
共 53 条
[1]  
[Anonymous], P EUROSPEECH
[2]  
[Anonymous], 2008, P CVPR
[3]  
[Anonymous], PATTERN RECOGNIT
[4]  
[Anonymous], 2012, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2011.264
[5]  
[Anonymous], 1999, The Nature Statist. Learn. Theory
[6]  
[Anonymous], 2006, P COLING ACL 2006 MA
[7]  
[Anonymous], THESIS CAMBRIDGE U
[8]  
[Anonymous], P EMNLP
[9]  
[Anonymous], NEURAL INF PROCESS S
[10]  
[Anonymous], COMPUT SPEECH LANG