Script pattern identification of word images using multi-directional and multi-scalable textures

被引:4
作者
Sahare, Parul [1 ]
Dhok, Sanjay B. [2 ]
机构
[1] Indian Inst Informat Technol, Dept Elect & Commun Engn, Nagpur, Maharashtra, India
[2] Visvesvaraya Natl Inst Technol, Ctr VLSI & Nanotechnol, Nagpur, Maharashtra, India
关键词
Document analysis; Log-polar transform; Rotational invariant; Script identification; Wavelet transform; LANGUAGE IDENTIFICATION; CHARACTER-RECOGNITION; CURVELET TRANSFORM; HANDWRITTEN; SEPARATION; MACHINE; NOISY; SEGMENTATION; ALGORITHMS; FILTER;
D O I
10.1007/s12652-020-02718-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a precursor of optical character recognition (OCR) technology, script identification finds many applications like sorting and indexing of document images. Classifying these scripts, especially at different scales and orientations, is one of the interesting and vital problems in the field of document image analysis. In this paper, an algorithm is proposed for the identification of scripts using scale and rotation robust log-polar wavelet and semi decimated wavelet features. Initially, words are segmented from document images in the form of text-blobs by the Gaussian filter. Then, texture features are calculated using a combination of discrete wavelet and semi decimated discrete wavelet transforms in log-polar domain. Here, most of the rotational and scale variations are removed in log-polar domain, whereas wavelet transform is capable of extracting the information at different resolution levels. This helps in the formation of significant textures for the purpose of characterization. At last, k-nearest neighbor classifier is used for the identification of scripts. Comprehensive experiments on different databases illustrate the effectiveness of the proposed algorithm. Benchmarking analysis shows that a maximum recall rate of 98.96% is obtained, and demonstrates better performance compared to the other contemporary approaches.
引用
收藏
页码:9739 / 9755
页数:17
相关论文
共 66 条
[1]   Handwritten Arabic numerals recognition using convolutional neural network [J].
Ahamed, Pratik ;
Kundu, Soumyadeep ;
Khan, Tauseef ;
Bhateja, Vikrant ;
Sarkar, Ram ;
Mollah, Ayatullah Faruk .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (11) :5445-5457
[2]  
[Anonymous], 2010, J AUTOM CONTROL, DOI DOI 10.2298/JAC1001017B
[3]  
[Anonymous], 2013, IOSR J, DOI DOI 10.9790/0661-12597102
[4]   An approach to the script discrimination in the Slavic documents [J].
Brodic, Darko ;
Milivojevic, Zoran N. ;
Maluckov, Cedomir A. .
SOFT COMPUTING, 2015, 19 (09) :2655-2665
[5]   Texture for script identification [J].
Busch, A ;
Boles, WW ;
Sridharan, S .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (11) :1720-1732
[6]  
Busch A, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS, P569
[7]  
Busch A, 2002, INT CONF ACOUST SPEE, P3584
[8]   Image retrieval using BDIP and BVLC moments [J].
Chun, YD ;
Seo, SY ;
Kim, NC .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (09) :951-957
[9]  
Echi A.K., 2014, Electronic Letters on Computer Vision and Image Analysis, V13, P1
[10]   Script Recognition-A Review [J].
Ghosh, Debashis ;
Dube, Tulika ;
Shivaprasad, Adamane P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (12) :2142-2161