New MDLSTM-based designs with data augmentation for offline Arabic handwriting recognition

被引:0
作者
Rania Maalej
Monji Kherallah
机构
[1] University of Sfax,National School of Engineers of Sfax
[2] University of Sfax,Faculty of Sciences
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Data augmentation; MDLSTM; Dropout; ReLU; Maxout; Offline arabic handwriting recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Although deep learning techniques have achieved promising results in several fields including healthcare, monitoring, and smart cities, their application in handwriting recognition shows limited results, especially for the offline Arabic handwritten script. Therefore, there is a need to enhance existing deep learning architectures. Moreover, the Multi-Dimensional Long Short-Term Memory (MDLSTM) leverages the LSTM model by replacing the single recurring connection with as many connections as there are spatiotemporal dimensions in the data. These connections allow the network to create a flexible internal context representation that is robust to local distortions. In this context, MDLSTM based architecture has been explored and three new architectures with Dropout, ReLU, and Maxout are proposed for offline Arabic handwriting recognition. Moreover, data augmentation approach has been applied to validate the proposed models. Indeed, a new dataset is developed by some morphological operations applied to the existing dataset “IFN/ENIT”. The experimental results show that our models outperform existing ones and the best accuracy of 92.59% was recorded with the MDLSTM-CTC-Maxout model trained with the original IFN/ENIT dataset. Moreover, data augmentation improves the MDLSTM-CTC-Maxout proposed model’s accuracy to reach 93.46%.
引用
收藏
页码:10243 / 10260
页数:17
相关论文
共 50 条
  • [31] SNR-Selection-Based-Data Augmentation for Dysarthric Speech Recognition
    Nawroly, Sarkhell Sirwan
    Popescu, Decebal Gheorghe
    Antony, Mariya Celin Thekekara
    Philominal, Actlin Jeeva Muthu
    STUDIES IN INFORMATICS AND CONTROL, 2023, 32 (04): : 129 - 140
  • [32] Data Augmentation Based on Frequency Warping for Recognition of Cleft Palate Speech
    Fujiwara, Kento
    Takashima, Ryoichi
    Sugiyama, Chihiro
    Tanaka, Nobukazu
    Nohara, Kanji
    Nozaki, Kazunori
    Takiguchi, Tetsuya
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 471 - 476
  • [33] A GAN-Based Data Augmentation Method for Multimodal Emotion Recognition
    Luo, Yun
    Zhu, Li-Zhen
    Lu, Bao-Liang
    ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT I, 2019, 11554 : 141 - 150
  • [34] Strong Generalized Speech Emotion Recognition Based on Effective Data Augmentation
    Tao, Huawei
    Shan, Shuai
    Hu, Ziyi
    Zhu, Chunhua
    Ge, Hongyi
    ENTROPY, 2023, 25 (01)
  • [35] Glyph-Based Data Augmentation for Accurate Kanji Character Recognition
    Ofusa, Kenichiro
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 597 - 602
  • [36] Generative Adversarial Network (GAN) based Data Augmentation for Palmprint Recognition
    Wang, Gengxing
    Kang, Wenxiong
    Wu, Qiuxia
    Wang, Zhiyong
    Gao, Junbin
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 156 - 162
  • [37] Recognition of Diabetic Retinopathy Grades Based on Data Augmentation and Attention Mechanisms
    Li, Xueri
    Wen, Li
    Du, Fanyu
    Yang, Lei
    Wu, Jianfang
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (06)
  • [38] ALDANER: Active Learning based Data Augmentation for Named Entity Recognition
    Moscato, Vincenzo
    Postiglione, Marco
    Sperli, Giancarlo
    Vignali, Andrea
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [39] CNN-based data augmentation for handwritten gurumukhi text recognition
    Sareen, Bhavna
    Ahuja, Rakesh
    Singh, Amitoj
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 71035 - 71053
  • [40] Underwater Acoustic Target Recognition Based on Data Augmentation and Residual CNN
    Yao, Qihai
    Wang, Yong
    Yang, Yixin
    ELECTRONICS, 2023, 12 (05)