Stroke Detector and Structure Based Models for Character Recognition: A Comparative Study

被引:27
作者
Shi, Cun-Zhao [1 ]
Gao, Song [1 ]
Liu, Meng-Tao [1 ]
Qi, Cheng-Zuo [1 ]
Wang, Chun-Heng [1 ]
Xiao, Bai-Hua [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Character recognition; stroke detector; structure; part-based model; tree-structure; spatiality embedded codeword; SCENE IMAGES; TEXT; REPRESENTATION; COOCCURRENCE; EXTRACTION; HISTOGRAM;
D O I
10.1109/TIP.2015.2473105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Characters, which are man-made symbols composed of strokes arranged in a certain structure, could provide semantic information and play an indispensable role in our daily life. In this paper, we try to make use of the intrinsic characteristics of characters and explore the stroke and structure-based methods for character recognition. First, we introduce two existing part-based models to recognize characters by detecting the elastic strokelike parts. In order to utilize strokes of various scales, we propose to learn the discriminative multi-scale stroke detector-based representation (DMSDR) for characters. However, the part-based models and DMSDR need to manually label the parts or key points for training. In order to learn the discriminative stroke detectors automatically, we further propose the discriminative spatiality embedded dictionary learning-based representation (DSEDR) for character recognition. We make a comparative study of the performance of the tree-structured model (TSM), mixtures-of-parts TSM, DMSDR, and DSEDR for character recognition on three challenging scene character recognition (SCR) data sets as well as two handwritten digits recognition data sets. A series of experiments is done on these data sets with various experimental setup. The experimental results demonstrate the suitability of stroke detector-based models for recognizing characters with deformations and distortions, especially in the case of limited training samples.
引用
收藏
页码:4952 / 4964
页数:13
相关论文
共 68 条
[31]   Scene Text Recognition using Higher Order Language Priors [J].
Mishra, Anand ;
Alahari, Karteek ;
Jawahar, C. V. .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[32]  
Mishra A, 2012, PROC CVPR IEEE, P2687, DOI 10.1109/CVPR.2012.6247990
[33]  
Netzer Y., 2011, NIPSW
[34]   Scene Text Localization and Recognition with Oriented Stroke Detection [J].
Neumann, Lukas ;
Matas, Jiri .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :97-104
[35]   On Combining Multiple Segmentations in Scene Text Recognition [J].
Neumann, Lukas ;
Matas, Jiri .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :523-527
[36]  
Neumann L, 2012, PROC CVPR IEEE, P3538, DOI 10.1109/CVPR.2012.6248097
[37]  
Neumann L, 2011, LECT NOTES COMPUT SC, V6494, P770, DOI 10.1007/978-3-642-19318-7_60
[38]   Multiscale Histogram of Oriented Gradient Descriptors for Robust Character Recognition [J].
Newell, Andrew J. ;
Griffin, Lewis D. .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :1085-1089
[39]   Large-Lexicon Attribute-Consistent Text Recognition in Natural Images [J].
Novikova, Tatiana ;
Barinova, Olga ;
Kohli, Pushmeet ;
Lempitsky, Victor .
COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 :752-765
[40]   A Hybrid Approach to Detect and Localize Texts in Natural Scene Images [J].
Pan, Yi-Feng ;
Hou, Xinwen ;
Liu, Cheng-Lin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (03) :800-813