RETRACTED: An efficient recognition system for preserving ancient historical documents of English characters (Retracted Article)

被引:6
作者
Sathya Narayanan, V. [1 ]
Kasthuri, N. [1 ]
机构
[1] Kongu Engn Coll, Dept Elect & Commun Engn, Erode, Tamil Nadu, India
关键词
Character recognition; HDLA; 2011; dataset; Bounding box method; Local binary patterns; SPM classifier; REPRESENTATION; CLASSIFICATION; SEGMENTATION; RETRIEVAL; DECISION;
D O I
10.1007/s12652-020-02201-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The clusters of historical documents are of great importance in terms of cultural and scientific. In order to access the documents, originality should be maintained. So conversion of digital form is highly required for recognition. While converting, those documents may be due to poor quality, overlapping of characters, complex background and so on. In this paper, an efficient system for recognizing English characters from degraded historical document images is proposed. Initially, modified Adaptive Thresholding based binarization process is performed to eliminate the noise content in the input image. The characters are segmented through the rectangular bounding box method. Then Local binary pattern (LBP) algorithm is enforced to extricate the features of each characters. Finally, Spatial Pyramid Matching (SPM) classifier is used for texture classification. HDLA 2011 dataset is employed to validate the proposed method. The proposed method achieves 94.6% recognition accuracy and 0.34 s computation time for Lucida Black-letter font. This method also outperforms better than the existing recognition techniques.
引用
收藏
页码:6275 / 6283
页数:9
相关论文
共 49 条
[1]   Shape retrieval using triangle-area representation and dynamic space warping [J].
Alajlan, Naif ;
El Rube, Ibrahim ;
Kamel, Mohamed S. ;
Freeman, George .
PATTERN RECOGNITION, 2007, 40 (07) :1911-1920
[2]   Multi-resident activity tracking and recognition in smart environments [J].
Alemdar, Hande ;
Ersoy, Cem .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (04) :513-529
[3]  
Amit C., 2014, International Journal of Information & Computation Technology, V4, P559
[4]   A Robust Segmentation Technique for Line, Word and Character Extraction from Kannada Text in Low Resolution Display Board Images [J].
Angadi, S. A. ;
Kodabagi, M. M. .
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2014, 14 (1-2)
[5]  
[Anonymous], 2010, P INT C INF EM TECHN
[6]  
[Anonymous], 2008, INT J IMAGE PROCESS
[7]   BAS: a perceptual shape descriptor based on the beam angle statistics [J].
Arica, N ;
Vural, FTY .
PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) :1627-1639
[8]  
Asaari MSM, 2013, INT C MACH VIS APPL, P256
[9]   Devnagari numeral recognition by combining decision of multiple connectionist classifiers [J].
Bajaj, R ;
Dey, L ;
Chaudhury, S .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1) :59-72
[10]   A complete OCR for printed Hindi text in Devanagari script [J].
Bansal, V ;
Sinha, RMK .
SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, :800-804