Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm

被引:3
作者
Eltay, Mohamed [1 ]
Zidouri, Abdelmalek [1 ]
Ahmad, Irfan [2 ]
Elarian, Yousef [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Elect Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] Cambrian Coll, Sudbury, ON, Canada
来源
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I | 2021年 / 12916卷
关键词
Handwriting recognition; Deep Learning Neural Network; Data augmentation; Recurrent Neural Network; Connectionist temporal classification;
D O I
10.1007/978-3-030-86198-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
引用
收藏
页码:322 / 335
页数:14
相关论文
共 50 条
  • [21] A Text Data Augmentation Approach for Improving the Performance of CNN
    Abulaish, Muhammad
    Sah, Amit Kumar
    2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, : 660 - 665
  • [22] Data Augmentation and Text Recognition on Khmer Historical Manuscripts
    Valy, Dona
    Verleysen, Michel
    Chhun, Sophea
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 73 - 78
  • [23] Improving Automated Evaluation of Student Text Responses Using GPT-3.5 for Text Data Augmentation
    Cochran, Keith
    Cohn, Clayton
    Rouet, Jean Francois
    Hastings, Peter
    ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2023, 2023, 13916 : 217 - 228
  • [24] Improving Diacritical Arabic Speech Recognition: Transformer-Based Models with Transfer Learning and Hybrid Data Augmentation
    Alaqel, Haifa
    El Hindi, Khalil
    Information (Switzerland), 2025, 16 (03)
  • [25] Arabic Handwritten Word Recognition Using HMMs with Explicit State Duration
    A. Benouareth
    A. Ennaji
    M. Sellami
    EURASIP Journal on Advances in Signal Processing, 2008
  • [26] A Novel Approach of Bangla Handwritten Text Recognition using HMM
    Roy, Partha Pratim
    Dey, Prasenjit
    Roy, Sangheeta
    Pal, Umapada
    Kimura, Fumitaka
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 661 - 666
  • [27] Script-Level Word Sample Augmentation for Few-Shot Handwritten Text Recognition
    Chen, Wei
    Su, Xiangdong
    Zhang, Haoran
    FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022, 2022, 13639 : 316 - 330
  • [28] Improving Automated Evaluation of Formative Assessments with Text Data Augmentation
    Cochran, Keith
    Cohn, Clayton
    Hutchins, Nicole
    Biswas, Gautam
    Hastings, Peter
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, 2022, 13355 : 390 - 401
  • [29] Impact of Data-Augmentation on Brain Tumor Detection Using Different YOLO Versions Models
    Ishtaiwi, Abdelraouf
    Ali, Ali
    AI-Qerem, Ahmed
    Alsmadi, Yazan
    Aldweesh, Amjad
    Alauthman, Mohammed
    Alzubi, Omar
    Nashwan, Shadi
    Ramadan, Awad
    AI-Zghoul, Musab
    Alangari, Someah
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (03) : 466 - 482
  • [30] Improving Handwritten Mathematical Expression Recognition via Integrating Convolutional Neural Network With Transformer and Diffusion-Based Data Augmentation
    Zhang, Yibo
    Li, Gaoxu
    IEEE ACCESS, 2024, 12 : 67945 - 67956