Script pattern identification of word images using multi-directional and multi-scalable textures

被引:4
作者
Sahare, Parul [1 ]
Dhok, Sanjay B. [2 ]
机构
[1] Indian Inst Informat Technol, Dept Elect & Commun Engn, Nagpur, Maharashtra, India
[2] Visvesvaraya Natl Inst Technol, Ctr VLSI & Nanotechnol, Nagpur, Maharashtra, India
关键词
Document analysis; Log-polar transform; Rotational invariant; Script identification; Wavelet transform; LANGUAGE IDENTIFICATION; CHARACTER-RECOGNITION; CURVELET TRANSFORM; HANDWRITTEN; SEPARATION; MACHINE; NOISY; SEGMENTATION; ALGORITHMS; FILTER;
D O I
10.1007/s12652-020-02718-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a precursor of optical character recognition (OCR) technology, script identification finds many applications like sorting and indexing of document images. Classifying these scripts, especially at different scales and orientations, is one of the interesting and vital problems in the field of document image analysis. In this paper, an algorithm is proposed for the identification of scripts using scale and rotation robust log-polar wavelet and semi decimated wavelet features. Initially, words are segmented from document images in the form of text-blobs by the Gaussian filter. Then, texture features are calculated using a combination of discrete wavelet and semi decimated discrete wavelet transforms in log-polar domain. Here, most of the rotational and scale variations are removed in log-polar domain, whereas wavelet transform is capable of extracting the information at different resolution levels. This helps in the formation of significant textures for the purpose of characterization. At last, k-nearest neighbor classifier is used for the identification of scripts. Comprehensive experiments on different databases illustrate the effectiveness of the proposed algorithm. Benchmarking analysis shows that a maximum recall rate of 98.96% is obtained, and demonstrates better performance compared to the other contemporary approaches.
引用
收藏
页码:9739 / 9755
页数:17
相关论文
共 66 条
[51]   Multilingual Character Segmentation and Recognition Schemes for Indian Document Images [J].
Sahare, Parul ;
Dhok, Sanjay B. .
IEEE ACCESS, 2018, 6 :10603-10617
[52]   Script identification algorithms: a survey [J].
Sahare P. ;
Dhok S.B. .
International Journal of Multimedia Information Retrieval, 2017, 6 (3) :211-232
[53]   Review of Text Extraction Algorithms for Scene-text and Document Images [J].
Sahare, Parul ;
Dhok, Sanjay B. .
IETE TECHNICAL REVIEW, 2017, 34 (02) :144-164
[54]   RETRACTED: An efficient recognition system for preserving ancient historical documents of English characters (Retracted Article) [J].
Sathya Narayanan, V. ;
Kasthuri, N. .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (06) :6275-6283
[55]   Script identification in the wild via discriminative convolutional neural network [J].
Shi, Baoguang ;
Bai, Xiang ;
Yao, Cong .
PATTERN RECOGNITION, 2016, 52 :448-458
[56]   Stroke Detector and Structure Based Models for Character Recognition: A Comparative Study [J].
Shi, Cun-Zhao ;
Gao, Song ;
Liu, Meng-Tao ;
Qi, Cheng-Zuo ;
Wang, Chun-Heng ;
Xiao, Bai-Hua .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :4952-4964
[57]   New Gradient-Spatial-Structural Features for video script identification [J].
Shivakumara, Palaiahnakote ;
Yuan, Zehuan ;
Zhao, Danni ;
Lu, Tong ;
Tan, Chew Lim .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 130 :35-53
[58]  
SINGH PK, 2018, J AMBIENT INTELL HUM
[59]  
Soman K. P., 2010, INSIGHT WAVELETS THE
[60]   Determination of the script and language content of document images [J].
Spitz, AL .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (03) :235-245