ADOCRNet: A Deep Learning OCR for Arabic Documents Recognition

被引：3

作者：

Mosbah, Lamia ^{[1
]}

Moalla, Ikram ^{[1
,2
]}

Hamdani, Tarek M. ^{[1
,3
]}

Neji, Bilel ^{[4
]}

Beyrouthy, Taha ^{[4
]}

Alimi, Adel M. ^{[1
,5
]}

机构：

[1] Univ Sfax, Natl Engn Sch Sfax ENIS, ReGIM Lab, REs Grp Intelligent Machines, Sfax 3038, Tunisia

[2] Al Baha Univ, Coll Comp Sci & Informat Technol, Al Bahah 65511, Saudi Arabia

[3] Univ Monastir, Higher Inst Comp Sci Mahdia ISIMa, Monastir 5000, Tunisia

[4] Amer Univ Middle East, Coll Engn & Technol, Egaila 54200, Kuwait

[5] Univ Johannesburg, Fac Engn & Built Environm, Dept Elect & Elect Engn Sci, Johannesburg 3038, South Africa

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Arabic; document recognition; CNNs; CTC; deep learning; BLSTM; OCR; NEURAL-NETWORKS; CHARACTER-RECOGNITION;

D O I：

10.1109/ACCESS.2024.3379530

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, Optical character recognition (OCR) has experienced a resurgence of interest especially for contemporary Arabic data. In fact, OCR development for printed and handwritten Arabic script is still a challenging task. These challenges are due to the specific characteristics of the Arabic script. In this work, we attempt to address these challenges by creating a deep learning OCR for Arabic document recognition called ADOCRNet. It is a novel deep learning framework whose architecture is built of layers of Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (BLSTM) trained using Connectionist Temporal Classification (CTC) algorithm. In order to assess the performance of our OCR, the proposed system is performed on two printed text datasets which are P-KHATT (text line images) and APTI (word images). It's also evaluated on a handwritten Arabic text dataset IFN/ENIT (word images). According to the practical tests, the conceived model achieves strength recognition rates on the three datasets. ADOCRNet reaches a Character Error Rate (CER) of 0.01% on the P-KHATT dataset, 0.03% on the APTI dataset and a Word Error Rate (WER) of 1.09% on the IFN/ENIT dataset, which significantly outperforms the outcomes of the current systems.

引用

页码：55620 / 55631

页数：12

共 50 条

[1] A Deep OCR for Degraded Bangla Documents
Chaudhury, Ayan
Mukherjee, Partha Sarathi
Das, Sudip
Biswas, Chandan
Bhattacharya, Ujjwal
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
[2] Arabic Speech Recognition with Deep Learning: A Review
Algihab, Wajdan
Alawwad, Noura
Aldawish, Anfal
AlHumoud, Sarah
SOCIAL COMPUTING AND SOCIAL MEDIA: DESIGN, HUMAN BEHAVIOR AND ANALYTICS, SCSM 2019, PT I, 2019, 11578 : 15 - 31
[3] Wild OCR: Deep Learning Architecture for Text Recognition in Images
Amudha, J.
Thakur, Manmohan Singh
Shrivastava, Anupriya
Gupta, Shubham
Gupta, Deepa
Sharma, Kshitij
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION NETWORKS (ICCCN 2021), 2022, 394 : 499 - 506
[4] Analysis of Recent Deep Learning Techniques for Arabic Handwritten-Text OCR and Post-OCR Correction
Najam, Rayyan
Faizullah, Safiullah
APPLIED SCIENCES-BASEL, 2023, 13 (13):
[5] Deep Learning Based Sinhala Optical Character Recognition (OCR)
Anuradha, Isuri
Liyanage, Chamila
Wijayawardhana, Harsha
Weerasinghe, Ruvan
2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 298 - 299
[6] Using Unsupervised Deep Learning for Automatic Summarization of Arabic Documents
Nabil Alami
Noureddine En-nahnahi
Said Alaoui Ouatik
Mohammed Meknassi
Arabian Journal for Science and Engineering, 2018, 43 : 7803 - 7815
[7] Semantic Annotation of Arabic Web Documents using Deep Learning
Albukhitan, Saeed
Alnazer, Ahmed
Helmy, Tarek
9TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2018) / THE 8TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2018) / AFFILIATED WORKSHOPS, 2018, 130 : 589 - 596
[8] Using Unsupervised Deep Learning for Automatic Summarization of Arabic Documents
Alami, Nabil
En-nahnahi, Noureddine
Ouatik, Said Alaoui
Meknassi, Mohammed
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 7803 - 7815
[9] Table of Contents Recognition in OCR Documents using Image-based Machine Learning
Kosaraju, Sai
Tsaku, Nelson Zange
Patel, Pritesh
Bayramoglu, Tanju
Modgil, Girish
Kang, Mingon
PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 186 - 189
[10] A Deep Learning Approach for Handwritten Arabic Names Recognition
Mustafa, Mohamed Elhafiz
Elbashir, Murtada Khalafallah
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 678 - 682

← 1 2 3 4 5 →