A Scale and Rotation Invariant Urdu Nastalique Ligature Recognition Using Cascade Forward Backpropagation Neural Network

被引:7
作者
Rehman, Khawaja Ubaid Ur [1 ]
Khan, Yaser Daanial [1 ]
机构
[1] Univ Management & Technol, Dept Comp Sci, Lahore 54770, Pakistan
关键词
Deep neural network (DNN); optical character recognition (OCR); scale invariant classifier; rotation invariant classifier; OPTICAL CHARACTER-RECOGNITION; MOMENTS;
D O I
10.1109/ACCESS.2019.2936363
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the emerging age of technologies, machines are becoming more and more skilled and capable just like humans. Despite the fact that machines do not have their own intelligence, but still due to advancement in Artificial Intelligence (AI), machines are rapidly advancing. The area of Pattern Recognition (PR) deals with bringing enhancements to identify obscure patterns corresponding to specific classes. Optical Character Recognition (OCR) is a subfield of PR which deals with the recognition of characters. A great work has been done for Japanese, Hindi, Arabic and Chinese scripts, but only a diminutive work has been done for Urdu script. The Urdu language is highly cursive and is written in different calligraphic styles like Naskh, Nastalique, Kofi, Devani and Riqa. The Nastalique font is very calligraphic with aesthetic beauty. The ligature segmentation of Urdu Nastalique is also more difficult as compared to other languages. Urdu Nastalique has some characteristics like stacking of ligatures and cursiveness which makes its ligature segmentation a difficult task. Cursiveness means ligatures are joined together to form a new shape. It contains connected ligatures which makes it more complicated as compared to other languages. The ligature recognition of Urdu text by an OCR is a strenuous task due to variants of scaling, rotation, orientation and font style. In this study, a scale and rotation invariant classifier for Urdu Nastalique OCR is proposed. A combination of scale and location invariant moments is used for feature extraction and the classification is performed using Cascade Forward Backpropagation Neural Network. The model is validated through independent dataset testing and 5-fold cross-validation which gave 96.474% and 96.922% accuracy. The results depict the adaptability of the proposed model due to its high accuracy for recognition of Urdu Nastalique Ligature.
引用
收藏
页码:120648 / 120669
页数:22
相关论文
共 66 条
  • [1] Acharya J, 2015, 2015 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), P1, DOI 10.1109/ICCNC.2015.7069284
  • [2] Ahmad Z, 2007, PROC WRLD ACAD SCI E, V26, P249
  • [3] Ahmad Z, 2009, 2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 3, P452
  • [4] Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking
    AlKhateeb, Jawad H.
    Ren, Jinchang
    Jiang, Jianmin
    Al-Muhtaseb, Husni
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1081 - 1088
  • [5] [Anonymous], 2007, P MVA
  • [6] [Anonymous], 2012, P C LANG TECHN
  • [7] [Anonymous], INT J COMPUT TRENDS
  • [8] [Anonymous], 2010, P INT C INF EM TECHN
  • [9] [Anonymous], 2002, MULT TOP C 2002 INMI, DOI DOI 10.1109/INMIC.2002.1310191
  • [10] [Anonymous], 2012, PROCEEDING WORKSHOP