A multiple classifier approach to detect Chinese character recognition errors

被引:3
作者
Hung, KY
Luk, RWP [1 ]
Yeung, DS
Chung, KFL
Shu, W
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Ctr Multimedia Signal Proc, Kowloon, Hong Kong, Peoples R China
[3] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin, Peoples R China
关键词
character recognition; error detection; pattern recognition and language modeling;
D O I
10.1016/j.patcog.2004.09.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detection of recognition errors is important in many areas, such as improving recognition performance, saving manual effort for proof-reading and post-editing, and assigning appropriate weights for retrieval in constructing digital libraries. We propose a novel application of multiple classifiers for the detection of recognition errors. A need for multiple classifiers emerges when a single classifier cannot improve recognition-error detection performance compared with the current detection scheme using a simple threshold mechanism. Although the single classifier does not improve recognition error performance, it serves as a baseline for comparison and the related study of useful features for error detection suggests three distinct cases where improvement is needed. For each case, the multiple classifier approach assigns a classifier to detect the presence or absence of errors and additional features are considered for each case. Our results show that the recall rate (70-80%) of recognition errors, the precision rate (80-90%) of recognition error detection and the saving in manual effort (75%) were better than the corresponding performance using a single classifier or a simple threshold detection scheme. (C) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:723 / 738
页数:16
相关论文
共 32 条
[1]  
AKARI T, 1994, P 15 INT C COMP LING, V1, P187
[2]   An overview of character recognition focused on off-line handwriting [J].
Arica, N ;
Yarman-Vural, FT .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2001, 31 (02) :216-233
[3]  
Brown P. F., 1993, Computational Linguistics, V19, P263
[4]  
CHANG CH, 1994, COMMUNICATION COLIPS, V4, P143
[5]  
CHEN KC, 1993, 9305 AC SIN I INF SC
[6]  
Chen SF, 1998, INT CONF ACOUST SPEE, P681, DOI 10.1109/ICASSP.1998.675356
[7]   A Bayesian framework for deformable pattern recognition with application to handwritten character recognition [J].
Cheung, KW ;
Yeung, DY ;
Chin, RT .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (12) :1382-1388
[8]   Statistical estimation of the number of hidden units for feedforward neural networks [J].
Fujita, O .
NEURAL NETWORKS, 1998, 11 (05) :851-859
[9]   Introduction to the special issue on language modeling and dialogue systems [J].
Glass, JR ;
Rosenfeld, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01) :1-2
[10]  
GOLDING AR, 1996, P 13 INT C MACH LEAR, P182