Contextual post-processing based on the confusion matrix in offline handwritten Chinese script recognition

被引:32
作者
Li, YX
Tan, CL [1 ]
Ding, XQ
Liu, CS
机构
[1] Natl Univ Singapore, Dept Comp Sci, Sch Comp, Singapore 117543, Singapore
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
Chinese character recognition; contextual post-processing; statistical language model; confusion matrix; candidate expansion; combination; approximate word-matching;
D O I
10.1016/j.patcog.2004.03.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The inclusion of potentially correct characters in candidate sets is key to improving accuracy in the recognition of Chinese scripts in the aspect of contextual post-processing. This paper presents two methods based on a confusion matrix to recall the correct characters. The first method uses original candidates to conjecture the most likely correct characters, and then combines the conjectured set with the original candidates to produce a new candidate set. The second method performs an approximate matching of adjoining characters in a sentence with Chinese words so as to recall the most likely correct character. Experimental results demonstrate the effectiveness of our proposed methods. (C) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1901 / 1912
页数:12
相关论文
共 22 条
[1]   Simulated annealing clustering of Chinese words for contextual text recognition [J].
Chang, CH .
PATTERN RECOGNITION LETTERS, 1996, 17 (01) :57-66
[2]   An empirical study of smoothing techniques for language modeling [J].
Chen, SF ;
Goodman, J .
COMPUTER SPEECH AND LANGUAGE, 1999, 13 (04) :359-394
[3]  
CHEN Y, 1997, THESIS TSINGHUA U CH
[4]  
GU HY, 1991, COMPUT SPEECH LANG, V15, P363
[5]  
HO TK, 1994, IEEE T PATTERN ANAL, V16, P66, DOI 10.1109/34.273716
[6]  
Hosmer D. W., 1989, APPL LOGISTIC REGRES, DOI DOI 10.1097/00019514-200604000-00003
[7]  
Ishidera E., 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition, P8, DOI 10.1109/ICDAR.2001.953745
[8]   Statistical pattern recognition: A review [J].
Jain, AK ;
Duin, RPW ;
Mao, JC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (01) :4-37
[9]  
Kernighan M., 1990, Proceedings of COLING-90, The 13th International Conference on Computational Linguistics, V2, P205
[10]   Multi-level post-processing for Korean character recognition using morphological analysis and linguistic evaluation [J].
Lee, G ;
Lee, JH ;
Yoo, J .
PATTERN RECOGNITION, 1997, 30 (08) :1347-1360