Analysis of Cursive Text Recognition Systems: A Systematic Literature Review

被引:2
作者
Khan, Sulaiman [1 ,2 ]
Nazir, Shah [2 ]
Khan, Habib Ullah [3 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Fdn, Coll Sci & Engn, Doha, Qatar
[2] Univ Swabi, Dept Comp Sci, Swabi, Pakistan
[3] Qatar Univ, Coll Business & Econ, Dept Accounting & Informat Syst, Doha, Qatar
关键词
Cursive languages; recognition algorithms; feature techniques; systematic literature review; HANDWRITTEN TEXT; WORD RECOGNITION; SEGMENTATION; LINE; CHARACTERS; FEATURES; EXTRACTION; FRAMEWORK; ALGORITHM; DATABASE;
D O I
10.1145/3592600
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regional and cultural diversities around the world have given birth to a large number of writing systems and scripts, which consist of varying character sets. Developing an optimal character recognition for such a varying and large character set is a challenging task. Unlimited variations in handwritten text due to mood swings, varying writing styles, changes in medium of writing, and many more puzzle the research community. To overcome this problem, researchers have proposed various techniques for the automatic recognition of cursive languages like Urdu, Pashto, and Arabic. With the passage of time, the field of text recognition matured, and the number of publications exponentially increased in the targeted field. It is very difficult to find all the techniques developed, calculate the time and resource consumptions, and understand the cost-benefit tradeoffs among these techniques. These tradeoffs resist making this technology able for practical use. To address these tradeoffs, this article systematic analysis to identify gaps in the literature and suggest new enhanced solution accordingly. A total of 153 of the most relevant articles from 2008 to 2022 are analyzed in this systematic literature review (SLR) work. This systematic review process shows (1) the list of techniques suggested for cursive text recognition purposes and its capabilities, (2) set of feature extraction techniques proposed, and (3) implementation tools used to design and simulate the empirical studies in this specialized field. We have also discussed the emerging trends and described their implications for the research community in this specialized domain. This systematic assessment will ultimately help researchers to perform an overview of the existing character/text recognition approaches, recognition capabilities, and time consumption and subsequently identify the areas that requires a significant attention in the near future.
引用
收藏
页数:30
相关论文
共 180 条
  • [1] Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks
    Abandah, Gheith A.
    Jamour, Fuad T.
    Qaralleh, Esam A.
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (03) : 275 - 291
  • [2] Recognition for old Arabic manuscripts using spatial gray level dependence (SGLD)
    Abd Al-Aziz, Ahmad M.
    Gheith, Mervat
    Sayed, Ayman F.
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2011, 12 (01) : 37 - 43
  • [3] Abdalkafor A. S., 2018, P 1 INT C COMP APPL, P1
  • [4] A large vocabulary system for Arabic online handwriting recognition
    Abdelaziz, Ibrahim
    Abdou, Sherif
    Al-Barhamtoshy, Hassanin
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (04) : 1129 - 1141
  • [5] Arabic character recognition using a Haar cascade classifier approach (HCC)
    AbdelRaouf, Ashraf
    Higgins, Colin A.
    Pridmore, Tony
    Khalil, Mahmoud I.
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (02) : 411 - 426
  • [6] ONLINE RECOGNITION SYSTEM FOR HANDWRITTEN ARABIC MATHEMATICAL SYMBOLS
    Abuzaraida, Mustafa Ali
    Zeki, Akram M.
    Zeki, Ahmed M.
    [J]. 2013 International Conference on Advanced Computer Science Applications and Technologies (ACSAT), 2014, : 223 - 227
  • [7] Employing fisher discriminant analysis for Arabic: text classification
    AbuZeina, Dia
    Al-Anzi, Fawaz S.
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2018, 66 : 474 - 486
  • [8] Ligature based Urdu Nastaleeq sentence recognition using gated bidirectional long short term memory
    Ahmad, Ibrar
    Wang, Xiaojie
    Mao, Yuz Hao
    Liu, Guang
    Ahmad, Haseeb
    Ullah, Rahat
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 703 - 714
  • [9] Line and Ligature Segmentation of Urdu Nastaleeq Text
    Ahmad, Ibrar
    Wang, Xiaojie
    Li, Ruifan
    Ahmed, Manzoor
    Ullah, Rahat
    [J]. IEEE ACCESS, 2017, 5 : 10924 - 10940
  • [10] Ahmad R, 2017, 2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), P168, DOI 10.1109/ASAR.2017.8067781