Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm

被引:3
作者
Eltay, Mohamed [1 ]
Zidouri, Abdelmalek [1 ]
Ahmad, Irfan [2 ]
Elarian, Yousef [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Elect Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] Cambrian Coll, Sudbury, ON, Canada
来源
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I | 2021年 / 12916卷
关键词
Handwriting recognition; Deep Learning Neural Network; Data augmentation; Recurrent Neural Network; Connectionist temporal classification;
D O I
10.1007/978-3-030-86198-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
引用
收藏
页码:322 / 335
页数:14
相关论文
共 50 条
[21]   Improving speech recognition using data augmentation and acoustic model fusion [J].
Rebai, Ilyes ;
BenAyed, Yessine ;
Mahdi, Walid ;
Lorre, Jean-Pierre .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 :316-322
[22]   A Text Data Augmentation Approach for Improving the Performance of CNN [J].
Abulaish, Muhammad ;
Sah, Amit Kumar .
2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, :660-665
[23]   Data Augmentation and Text Recognition on Khmer Historical Manuscripts [J].
Valy, Dona ;
Verleysen, Michel ;
Chhun, Sophea .
2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, :73-78
[24]   Improving Automated Evaluation of Student Text Responses Using GPT-3.5 for Text Data Augmentation [J].
Cochran, Keith ;
Cohn, Clayton ;
Rouet, Jean Francois ;
Hastings, Peter .
ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2023, 2023, 13916 :217-228
[25]   Improving Diacritical Arabic Speech Recognition: Transformer-Based Models with Transfer Learning and Hybrid Data Augmentation [J].
Alaqel, Haifa ;
El Hindi, Khalil .
INFORMATION, 2025, 16 (03)
[26]   Arabic Handwritten Word Recognition Using HMMs with Explicit State Duration [J].
A. Benouareth ;
A. Ennaji ;
M. Sellami .
EURASIP Journal on Advances in Signal Processing, 2008
[27]   A Novel Approach of Bangla Handwritten Text Recognition using HMM [J].
Roy, Partha Pratim ;
Dey, Prasenjit ;
Roy, Sangheeta ;
Pal, Umapada ;
Kimura, Fumitaka .
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, :661-666
[28]   Script-Level Word Sample Augmentation for Few-Shot Handwritten Text Recognition [J].
Chen, Wei ;
Su, Xiangdong ;
Zhang, Haoran .
FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022, 2022, 13639 :316-330
[29]   Emotion Recognition in Bangla Text: An Ensemble Approach with Data Augmentation Using BanglaBERT and MultiBERT [J].
Halder, Nabarun ;
Alam, Armun ;
Setu, Jahanggir Hossain ;
Islam, Ashraful ;
Amin, M. Ashraful .
2025 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING, ICCAE, 2025, :6-11
[30]   Impact of Data-Augmentation on Brain Tumor Detection Using Different YOLO Versions Models [J].
Ishtaiwi, Abdelraouf ;
Ali, Ali ;
AI-Qerem, Ahmed ;
Alsmadi, Yazan ;
Aldweesh, Amjad ;
Alauthman, Mohammed ;
Alzubi, Omar ;
Nashwan, Shadi ;
Ramadan, Awad ;
AI-Zghoul, Musab ;
Alangari, Someah .
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (03) :466-482