Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm

被引:3
作者
Eltay, Mohamed [1 ]
Zidouri, Abdelmalek [1 ]
Ahmad, Irfan [2 ]
Elarian, Yousef [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Elect Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] Cambrian Coll, Sudbury, ON, Canada
来源
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I | 2021年 / 12916卷
关键词
Handwriting recognition; Deep Learning Neural Network; Data augmentation; Recurrent Neural Network; Connectionist temporal classification;
D O I
10.1007/978-3-030-86198-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
引用
收藏
页码:322 / 335
页数:14
相关论文
共 50 条
  • [31] You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation
    Laptev, Aleksandr
    Korostik, Roman
    Svischev, Aleksey
    Andrusenko, Andrei
    Medennikov, Ivan
    Rybin, Sergey
    [J]. 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 439 - 444
  • [32] Improving Attention-Based Handwritten Mathematical Expression Recognition with Scale Augmentation and Drop Attention
    Li, Zhe
    Jin, Lianwen
    Lai, Songxuan
    Zhu, Yecheng
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 175 - 180
  • [33] TA-DA! - Improving Activity Recognition Using Temporal Adapters and Data Augmentation
    Hopp, Maximilian
    Hartleb, Helge
    Burchard, Robin
    [J]. COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 551 - 554
  • [34] An Explainable Deep Learning-Based Method for Schizophrenia Diagnosis Using Generative Data-Augmentation
    Saadatinia, Mehrshad
    Salimi-Badr, Armin
    [J]. IEEE ACCESS, 2024, 12 : 98379 - 98392
  • [35] Handwritten Urdu Characters and Digits Recognition Using Transfer Learning and Augmentation With AlexNet
    Rasheed, Aqsa
    Ali, Nouman
    Zafar, Bushra
    Shabbir, Amsa
    Sajid, Muhammad
    Mahmood, Muhammad Tariq
    [J]. IEEE ACCESS, 2022, 10 : 102629 - 102645
  • [36] Improving DRS-to-Text Generation Through Delexicalization and Data Augmentation
    Amin, Muhammad Saad
    Anselma, Luca
    Mazzei, Alessandro
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 121 - 136
  • [37] Improving Named Entity Recognition for Social Media with Data Augmentation
    Liu, Wenzhong
    Cui, Xiaohui
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [38] Improving Speech Emotion Recognition With Adversarial Data Augmentation Network
    Yi, Lu
    Mak, Man-Wai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 172 - 184
  • [39] A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition
    Paraskevopoulou, Georgia
    Spyrou, Evaggelos
    Perantonis, Stavros
    [J]. SIGMAP: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2022, : 61 - 69
  • [40] SIGN LANGUAGE RECOGNITION BASED ON ADAPTIVE HMMS WITH DATA AUGMENTATION
    Guo, Dan
    Zhou, Wengang
    Wang, Meng
    Lie, Houqiang
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2876 - 2880