Stroke Detector and Structure Based Models for Character Recognition: A Comparative Study

被引:27
作者
Shi, Cun-Zhao [1 ]
Gao, Song [1 ]
Liu, Meng-Tao [1 ]
Qi, Cheng-Zuo [1 ]
Wang, Chun-Heng [1 ]
Xiao, Bai-Hua [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Character recognition; stroke detector; structure; part-based model; tree-structure; spatiality embedded codeword; SCENE IMAGES; TEXT; REPRESENTATION; COOCCURRENCE; EXTRACTION; HISTOGRAM;
D O I
10.1109/TIP.2015.2473105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Characters, which are man-made symbols composed of strokes arranged in a certain structure, could provide semantic information and play an indispensable role in our daily life. In this paper, we try to make use of the intrinsic characteristics of characters and explore the stroke and structure-based methods for character recognition. First, we introduce two existing part-based models to recognize characters by detecting the elastic strokelike parts. In order to utilize strokes of various scales, we propose to learn the discriminative multi-scale stroke detector-based representation (DMSDR) for characters. However, the part-based models and DMSDR need to manually label the parts or key points for training. In order to learn the discriminative stroke detectors automatically, we further propose the discriminative spatiality embedded dictionary learning-based representation (DSEDR) for character recognition. We make a comparative study of the performance of the tree-structured model (TSM), mixtures-of-parts TSM, DMSDR, and DSEDR for character recognition on three challenging scene character recognition (SCR) data sets as well as two handwritten digits recognition data sets. A series of experiments is done on these data sets with various experimental setup. The experimental results demonstrate the suitability of stroke detector-based models for recognizing characters with deformations and distortions, especially in the case of limited training samples.
引用
收藏
页码:4952 / 4964
页数:13
相关论文
共 68 条
[1]  
[Anonymous], P CVPR
[2]  
[Anonymous], P INT C DOC AN REC 2, DOI DOI 10.1371/J0URNAL.P0NE.0090352
[3]  
[Anonymous], 2012, ABS12070580 CORR
[4]  
[Anonymous], UMCS2012021
[5]  
[Anonymous], 2014, DEEPLY SUPERVISED NE
[6]  
[Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
[7]  
[Anonymous], 2013, PMLR, DOI DOI 10.5555/3042817.3043055
[8]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[9]  
Berg AC, 2005, PROC CVPR IEEE, P26
[10]   PhotoOCR: Reading Text in Uncontrolled Conditions [J].
Bissacco, Alessandro ;
Cummins, Mark ;
Netzer, Yuval ;
Neven, Hartmut .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :785-792