Novel Geometrical Shape Feature Extraction Techniques for Multilingual Character Recognition

被引:13
作者
Soora, Narasimha Reddy [1 ]
Deshpande, Parag S. [1 ]
机构
[1] Visvesvaraya Natl Inst Technol, Comp Sci & Engn, Nagpur, Maharashtra, India
关键词
Crossing count features; Edit distance; Feature extraction; Multilingual character recognition; Shape geometry; Shape symbol; LICENSE PLATE-RECOGNITION; SYSTEM; TEXT;
D O I
10.1080/02564602.2016.1229583
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multilingual character recognition from the images of aged Indian documents is challenging because of the complex character grapheme of the Indian language scripts. Feature extraction plays the most important role in recognition of such images. In this paper, we have proposed a set of feature vectors (FVs) which are based on shape geometry (SG) decoding of the input character. The first FV is based on SG decoding of the input character using triangular area (TA) calculation. The second FV, namely, SG using perpendicular distance is extracted by dividing the input image into individual components and the shape of the individual component is decoded into shape symbols by comparing the normalized perpendicular distances of the individual pixels of the component onto the line joining the end points of the component. Apart from the proposed FVs, we have used crossing count features. These FVs are represented as the string of shape operators; hence, we have used minimum edit distance classifier to recognize the input character. The proposed character recognition technique is evaluated using the characters extracted from printed aged multilingual Indian documents having English, Devanagari, and Marathi scripts and achieved encouraging results. To further assess the performance of the proposed system, we have considered publicly available media-lab license plate benchmark database and achieved significant performance.
引用
收藏
页码:612 / 621
页数:10
相关论文
共 17 条
[1]   A license plate-recognition algorithm for intelligent transportation system applications [J].
Anagnostopoulos, Christos Nikolaos E. ;
Anagnostopoulos, Ioannis E. ;
Loumos, Vassili ;
Kayafas, Eleftherios .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2006, 7 (03) :377-392
[2]  
Anagnostopoulos I. E., MEDIALAB LPR DATABAS
[3]   Multilingual OCR system for South Indian scripts and English documents: An approach based on Fourier transform and principal component analysis [J].
Aradhya, V. N. Manjunath ;
Kumar, G. Hemantha ;
Noushath, S. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2008, 21 (04) :658-668
[4]  
Bag S., PATTERN RECOGNIT, V47, P1187
[5]  
Basu S., 2005, P 2 IND INT C ART IN, P407
[6]  
Gaurav D. D., 2012, ARXIV12023884V1 CORN, V1202, P1
[7]   Application-Oriented License Plate Recognition [J].
Hsu, Gee-Sern ;
Chen, Jiun-Chang ;
Chung, Yu-Zu .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2013, 62 (02) :552-561
[8]  
Iqbal A, 2008, P SCIS ISIS NAG, P1367
[9]  
Kale Karbhari V., 2014, International Journal of Advanced Research in Artificial Intelligence, V3, P68
[10]   A novel feature extraction technique for offline handwritten Gurmukhi character recognition [J].
Kumar, Munish ;
Sharma, R. K. ;
Jindal, Manish Kumar .
IETE JOURNAL OF RESEARCH, 2013, 59 (06) :687-692