Urdu handwritten text recognition: a survey

被引:10
作者
Husnain, Mujtaba [1 ]
Saad Missen, Malik Muhammad [1 ]
Mumtaz, Shahzad [1 ]
Coustaty, Mickael [2 ]
Luqman, Muzzamil [2 ]
Ogier, Jean-Marc [2 ]
机构
[1] Islamia Univ Bahawalpur, Dept Comp Sci & IT, Bahawalpur 63100, Pakistan
[2] Univ La Rochelle, L3i Lab, Ave Michel Crepeau, F-17000 La Rochelle, France
关键词
optical character recognition; handwritten character recognition; text analysis; natural language processing; document image processing; Urdu script; interesting field; challenging field; online handwritten text recognition systems; recognition process; different granularity levels; surveyed articles; granularity level; Urdu handwritten text recognition approaches; SCRIPT; IMPLEMENTATION; SEGMENTATION; FEATURES;
D O I
10.1049/iet-ipr.2019.0401
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Work on the problem of handwritten text recognition in Urdu script has been an active research area. A significant progress is made in this interesting and challenging field in the last few years. In this study, the authors presented a comprehensive survey for a number of offline and online handwritten text recognition systems for Urdu script written in Nastaliq font style from 2004 to 2019. Following features make their contribution worthwhile and unique among the reviews of a similar kind: (i) their review classifies the existing studies based on types of recognition systems used for Urdu handwritten text, (ii) it covers a very different outlook of the recognition process of the Urdu handwritten text at different granularity levels (e.g. character, word, ligature, or sentence level), (iii) this review article also presents each of surveyed articles in following dimensions: the task performed, its granularity level, dataset used, results obtained, and future dimensions, and (iv) lastly it gives the summary of the surveyed articles according to the granularity levels, publishing years, related tasks or subtasks, and types of classifiers used. In the end, major challenges and tasks related to Urdu handwritten text recognition approaches are also discussed in detail.
引用
收藏
页码:2291 / 2300
页数:10
相关论文
共 76 条
  • [1] Ligature based Urdu Nastaleeq sentence recognition using gated bidirectional long short term memory
    Ahmad, Ibrar
    Wang, Xiaojie
    Mao, Yuz Hao
    Liu, Guang
    Ahmad, Haseeb
    Ullah, Rahat
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 703 - 714
  • [2] Ahmad Z, 2007, PROC WRLD ACAD SCI E, V26, P249
  • [3] Akram QU, 2017, 2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), P129, DOI 10.1109/ASAR.2017.8067774
  • [4] Multi-Language Handwritten Digits Recognition based on Novel Structural Features
    Alghazo, Jaafar M.
    Latif, Ghazanfar
    Alzubaidi, Loay
    Elhassan, Ammar
    [J]. JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2019, 63 (02)
  • [5] Ali A, 2004, INMIC 2004: 8th International Multitopic Conference, Proceedings, P79
  • [6] [Anonymous], THESIS
  • [7] [Anonymous], 2007, P MVA
  • [8] [Anonymous], 2012, ACM Transactions on Asian Language Information Processing, DOI [DOI 10.1145/2090176, DOI 10.1145/2090176.2090177, 10.1145/2090176.2090177]
  • [9] [Anonymous], 1975, Computer Graph-ics and Image Processing
  • [10] [Anonymous], 2019, P FUTURE INFORM COMM