Multi-script Identification from Printed Words

被引:3
作者
Jetley, Saumya [1 ]
Mehrotra, Kapil [1 ]
Vaze, Atish [1 ]
Belhe, Swapnil [1 ]
机构
[1] Ctr Dev Adv Comp C DAC, Pune, Maharashtra, India
来源
IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I | 2014年 / 8814卷
关键词
D O I
10.1007/978-3-319-11758-4_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In today's multi-script scenario, documents contain page, paragraph, line and up to word level intermixing of different scripts. We need a script recognition approach that can perform well even at the lowest semantically-valid level of words so as to serve as a generic solution. The present paper proposes a combination of Histogram of Oriented Gradients (HoG) and Local Binary Patterns (LBP), extracted over words, to capture the unique and discriminative structural formations of different scripts. Tested over MILE printed-word data set, this concatenated feature descriptor yields a state-of-the-art average recognition accuracy of 97.4% over a set of 11 Indian scripts. In an end-to-end document recognition system it is correct to assume a skew correction unit prior to script identification. Depending on the amount of skew, the skew correction unit can either yield a correctly aligned document or an inverted one. For script identification in such scenarios, we introduce novel modifications over existing HoG and LBP features to propose - Inversion Invariant HoG (II-HoG) and Inversion Invariant LBP (II-LBP) in order to achieve text inversion invariance. Once the script is recognized, script-specific HoG and LBP feature combination can be used to find the text alignment i.e. 0. or 180. for correction. For the MILE database, first-level inversion-invariant script-identification accuracy for 11 script-set is 95.8% (1% gain over the existing best) while the second-level script-specific orientation-detection accuracy is averaged at 97.7%.
引用
收藏
页码:359 / 368
页数:10
相关论文
共 19 条
[1]  
[Anonymous], 2005, PROC CVPR IEEE
[2]  
[Anonymous], 2010, 2010 2 INT C COMPUTI, DOI DOI 10.1109/ICCCNT.2010.5592562
[3]  
[Anonymous], P INT C DOC REC RETR
[4]  
BLOOMBERG DS, 1995, P SOC PHOTO-OPT INS, V2422, P302, DOI 10.1117/12.205832
[5]  
Chanda S, 2010, P ICPR, P2708
[6]   Identification of Indic Scripts on Torn-Documents [J].
Chanda, Sukalpa ;
Franke, Katrin ;
Pal, Umapada .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :713-717
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]  
Das M. S., 2012, Proceedings of the 2012 1st International Conference on Recent Advances in Information Technology (RAIT 2012), P487, DOI 10.1109/RAIT.2012.6194627
[9]   Composite Script Identification and Orientation Detection for Indian Text Images [J].
Ghosh, Shamita ;
Chaudhuri, Bidyut B. .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :294-298
[10]  
Gopakumar Rajesh, 2010, 2010 5th International Conference on Industrial and Information Systems (ICIIS 2010), P420, DOI 10.1109/ICIINFS.2010.5578668