Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm

被引:3
作者
Eltay, Mohamed [1 ]
Zidouri, Abdelmalek [1 ]
Ahmad, Irfan [2 ]
Elarian, Yousef [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Elect Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] Cambrian Coll, Sudbury, ON, Canada
来源
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I | 2021年 / 12916卷
关键词
Handwriting recognition; Deep Learning Neural Network; Data augmentation; Recurrent Neural Network; Connectionist temporal classification;
D O I
10.1007/978-3-030-86198-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
引用
收藏
页码:322 / 335
页数:14
相关论文
共 50 条
[41]   SIGN LANGUAGE RECOGNITION BASED ON ADAPTIVE HMMS WITH DATA AUGMENTATION [J].
Guo, Dan ;
Zhou, Wengang ;
Wang, Meng ;
Lie, Houqiang .
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, :2876-2880
[42]   Improving Turkish Telephone Speech Recognition with Data Augmentation and Out of Domain Data [J].
Uslu, Zeynep Gulhan ;
Yildirim, Tulay .
2019 16TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2019, :176-179
[43]   TOWARDS IMPROVING SPEECH EMOTION RECOGNITION USING SYNTHETIC DATA AUGMENTATION FROM EMOTION CONVERSION [J].
Ibrahim, Karim M. ;
Perzol, Antony ;
Leglaive, Simon .
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, :10636-10640
[44]   IMPROVING AUTOMATIC TARGET RECOGNITION WITH INFRARED IMAGERY USING VISION TRANSFORMERS AND FOCUSED DATA AUGMENTATION [J].
Baili, Nada ;
Frigui, Hichem .
2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, :381-387
[45]   Knowledge transfer using Neural network based approach for handwritten text recognition [J].
Nair, Rathin Radhakrishnan ;
Sankaran, Nishant ;
Kota, Bharagava Urala ;
Tulyakov, Sergey ;
Setlur, Srirangaraj ;
Govindaraju, Venu .
2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, :441-446
[46]   Writer Adaptation of Online Handwritten Recognition using Adaptive RBF Network [J].
Raje, Surabhi ;
Mehrotra, Kapil ;
Belhe, Swapnil .
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, :691-695
[47]   Stroke-Based Data Augmentation for Enhancing Optical Character Recognition of Ancient Handwritten Scripts [J].
Ayyoob, M. P. ;
Ilyas, P. Muhamed .
IEEE ACCESS, 2024, 12 :186794-186802
[48]   TEXTUAL DATA AUGMENTATION FOR ARABIC-ENGLISH CODE-SWITCHING SPEECH RECOGNITION [J].
Hussein, Amir ;
Chowdhury, Shammur Absar ;
Abdelali, Ahmed ;
Dehak, Najim ;
Ali, Ahmed ;
Khudanpur, Sanjeev .
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, :777-784
[49]   New MDLSTM-based designs with data augmentation for offline Arabic handwriting recognition [J].
Maalej, Rania ;
Kherallah, Monji .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (07) :10243-10260
[50]   New MDLSTM-based designs with data augmentation for offline Arabic handwriting recognition [J].
Rania Maalej ;
Monji Kherallah .
Multimedia Tools and Applications, 2022, 81 :10243-10260