ADOCRNet: A Deep Learning OCR for Arabic Documents Recognition

被引:3
|
作者
Mosbah, Lamia [1 ]
Moalla, Ikram [1 ,2 ]
Hamdani, Tarek M. [1 ,3 ]
Neji, Bilel [4 ]
Beyrouthy, Taha [4 ]
Alimi, Adel M. [1 ,5 ]
机构
[1] Univ Sfax, Natl Engn Sch Sfax ENIS, ReGIM Lab, REs Grp Intelligent Machines, Sfax 3038, Tunisia
[2] Al Baha Univ, Coll Comp Sci & Informat Technol, Al Bahah 65511, Saudi Arabia
[3] Univ Monastir, Higher Inst Comp Sci Mahdia ISIMa, Monastir 5000, Tunisia
[4] Amer Univ Middle East, Coll Engn & Technol, Egaila 54200, Kuwait
[5] Univ Johannesburg, Fac Engn & Built Environm, Dept Elect & Elect Engn Sci, Johannesburg 3038, South Africa
关键词
Arabic; document recognition; CNNs; CTC; deep learning; BLSTM; OCR; NEURAL-NETWORKS; CHARACTER-RECOGNITION;
D O I
10.1109/ACCESS.2024.3379530
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, Optical character recognition (OCR) has experienced a resurgence of interest especially for contemporary Arabic data. In fact, OCR development for printed and handwritten Arabic script is still a challenging task. These challenges are due to the specific characteristics of the Arabic script. In this work, we attempt to address these challenges by creating a deep learning OCR for Arabic document recognition called ADOCRNet. It is a novel deep learning framework whose architecture is built of layers of Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (BLSTM) trained using Connectionist Temporal Classification (CTC) algorithm. In order to assess the performance of our OCR, the proposed system is performed on two printed text datasets which are P-KHATT (text line images) and APTI (word images). It's also evaluated on a handwritten Arabic text dataset IFN/ENIT (word images). According to the practical tests, the conceived model achieves strength recognition rates on the three datasets. ADOCRNet reaches a Character Error Rate (CER) of 0.01% on the P-KHATT dataset, 0.03% on the APTI dataset and a Word Error Rate (WER) of 1.09% on the IFN/ENIT dataset, which significantly outperforms the outcomes of the current systems.
引用
收藏
页码:55620 / 55631
页数:12
相关论文
共 50 条
  • [1] A Deep OCR for Degraded Bangla Documents
    Chaudhury, Ayan
    Mukherjee, Partha Sarathi
    Das, Sudip
    Biswas, Chandan
    Bhattacharya, Ujjwal
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [2] Arabic Speech Recognition with Deep Learning: A Review
    Algihab, Wajdan
    Alawwad, Noura
    Aldawish, Anfal
    AlHumoud, Sarah
    SOCIAL COMPUTING AND SOCIAL MEDIA: DESIGN, HUMAN BEHAVIOR AND ANALYTICS, SCSM 2019, PT I, 2019, 11578 : 15 - 31
  • [3] Wild OCR: Deep Learning Architecture for Text Recognition in Images
    Amudha, J.
    Thakur, Manmohan Singh
    Shrivastava, Anupriya
    Gupta, Shubham
    Gupta, Deepa
    Sharma, Kshitij
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION NETWORKS (ICCCN 2021), 2022, 394 : 499 - 506
  • [4] Analysis of Recent Deep Learning Techniques for Arabic Handwritten-Text OCR and Post-OCR Correction
    Najam, Rayyan
    Faizullah, Safiullah
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [5] Deep Learning Based Sinhala Optical Character Recognition (OCR)
    Anuradha, Isuri
    Liyanage, Chamila
    Wijayawardhana, Harsha
    Weerasinghe, Ruvan
    2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 298 - 299
  • [6] Using Unsupervised Deep Learning for Automatic Summarization of Arabic Documents
    Nabil Alami
    Noureddine En-nahnahi
    Said Alaoui Ouatik
    Mohammed Meknassi
    Arabian Journal for Science and Engineering, 2018, 43 : 7803 - 7815
  • [7] Semantic Annotation of Arabic Web Documents using Deep Learning
    Albukhitan, Saeed
    Alnazer, Ahmed
    Helmy, Tarek
    9TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2018) / THE 8TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2018) / AFFILIATED WORKSHOPS, 2018, 130 : 589 - 596
  • [8] Using Unsupervised Deep Learning for Automatic Summarization of Arabic Documents
    Alami, Nabil
    En-nahnahi, Noureddine
    Ouatik, Said Alaoui
    Meknassi, Mohammed
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 7803 - 7815
  • [9] Table of Contents Recognition in OCR Documents using Image-based Machine Learning
    Kosaraju, Sai
    Tsaku, Nelson Zange
    Patel, Pritesh
    Bayramoglu, Tanju
    Modgil, Girish
    Kang, Mingon
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 186 - 189
  • [10] A Deep Learning Approach for Handwritten Arabic Names Recognition
    Mustafa, Mohamed Elhafiz
    Elbashir, Murtada Khalafallah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 678 - 682