A hybrid post-processing system for handwritten Chinese character recognition

被引:5
作者
Xu, RF [1 ]
Yeung, D
Shu, WH
Liu, JF
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[2] Harbin Inst Technol, Dept Comp Sci & Technol, Harbin 150006, Peoples R China
关键词
post-processing; handwritten Chinese character recognition; confusing character set; dictionary-based approximate matching; word BI-gram model;
D O I
10.1142/S0218001402001964
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a hybrid post-processing system for improving the performance of Handwritten Chinese Character Recognition is presented. In order to remove two kinds of frequently encountered errors in the recognition result, namely mis-recognized character and unrecognized character, both confusing character characteristics of the recognizer and the contextual linguistic information are utilized in our hybrid three-stage post-processing system. In the first stage, the confusing character set and a statistical Noisy-Channel model are employed to identify the most promising candidate character and append possible unrecognized similar-shaped characters into candidate character set when a candidate sequence is given. Secondly, dictionary-based approximate word matching is conducted to further append contextual linguistic-prone characters into candidate character set and bind the candidate characters into a word-lattice. Finally, a Chinese word BI-Gram Markov model is employed in the third stage to identify a most promising sentence by selecting plausible words from the word-lattice. On the average, our system achieves a 5.1% recognition rate improvement for the first candidate when the original character recognition rate is 90% for the first candidate and 95% for the top-10 candidates by an online HCCR engine.
引用
收藏
页码:657 / 679
页数:23
相关论文
共 14 条
[1]  
CHANG CH, 1997, COMMUN COLIPS, V7, P1
[2]   OPTICAL RECOGNITION OF HANDWRITTEN CHINESE CHARACTERS - ADVANCES SINCE 1980 [J].
HILDEBRANDT, TH ;
LIU, WT .
PATTERN RECOGNITION, 1993, 26 (02) :205-225
[3]  
Lee HJ, 1998, IEEE SYS MAN CYBERN, P4195, DOI 10.1109/ICSMC.1998.727503
[4]  
LEE HJ, 1995, P IEEE 3 ICDAR, P450
[5]  
LIU JF, 1996, THESIS HARBIN I TECH
[6]   INTERACTION OF INFORMATION IN WORD RECOGNITION [J].
MORTON, J .
PSYCHOLOGICAL REVIEW, 1969, 76 (02) :165-&
[7]   HYBRID CONTEXTUAL TEXT RECOGNITION WITH STRING-MATCHING [J].
SINHA, RMK ;
PRASADA, B ;
HOULE, GF ;
SABOURIN, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (09) :915-925
[8]  
SU KY, 1996, INT J COMPUT LING C, V1, P101
[9]  
Tai J.-W., 1993, Proceedings of the Second International Conference on Document Analysis and Recognition (Cat. No.93TH0578-5), P826, DOI 10.1109/ICDAR.1993.395610
[10]   Offline recognition of Chinese handwriting by multifeature and multilevel classification [J].
Tang, YY ;
Tu, LT ;
Liu, JM ;
Lee, SW ;
Lin, WW ;
Shyu, IS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (05) :556-561