Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network

被引:0
作者
El Bahi, Hassan [1 ]
机构
[1] L2IS, Laboratory of Computer and Systems Engineering, Cadi Ayyad University, B.P. 511, Marrakech
关键词
Ancient manuscripts; Convolutional neural network; Handwritten text recognition; Named entity recognition; Recurrent neural network;
D O I
10.1007/s00500-024-09930-6
中图分类号
学科分类号
摘要
Digitizing ancient manuscripts and making them accessible to a broader audience is a crucial step in unlocking the wealth of information they hold. However, automatic recognition of handwritten text and the extraction of relevant information such as named entities from these manuscripts are among the most difficult research topics, due to several factors such as poor quality of manuscripts, complex background, presence of ink stains, cursive handwriting, etc. To meet these challenges, we propose two systems, the first system performs the task of handwritten text recognition (HTR) in ancient manuscripts; it starts with a preprocessing operation. Then, a convolutional neural network (CNN) is used to extract the features of each input image. Finally, a recurrent neural network (RNN) which has Long Short-Term Memory (LSTM) blocks with the Connectionist Temporal Classification (CTC) layer will predict the text contained in the image. The second system focuses on recognizing named entities and deciphering the relationships among words directly from images of old manuscripts, bypassing the need for an intermediate text transcription step. Like the previous system, this second system starts with a preprocessing step. Then the data augmentation technique is used to increase the training dataset. After that, the extraction of the most relevant features is done automatically using a CNN model. Finally, the recognition of names entities and the relationship between word images is performed using a bidirectional LSTM. Extensive experiments on the ESPOSALLES dataset demonstrate that the proposed systems achieve the state-of-the-art performance exceeding existing systems. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:12249 / 12268
页数:19
相关论文
共 50 条
  • [21] End-to-end attention convolutional recurrent network for online handwritten Chinese text recognition
    Qu, Xiwen
    Wu, Zhihong
    Huang, Jun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 62541 - 62558
  • [22] Recognition of Urdu Handwritten Alphabet Using Convolutional Neural Network (CNN)
    Ahmed, Gulzar
    Alyas, Tahir
    Iqbal, Muhammad Waseem
    Ashraf, Muhammad Usman
    Alghamdi, Ahmed Mohammed
    Bahaddad, Adel A.
    Almarhabi, Khalid Ali
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 2967 - 2984
  • [23] Speech Emotion Recognition Using Deep Convolutional Neural Network and Simple Recurrent Unit
    Jiang, Pengxu
    Fu, Hongliang
    Tao, Huawei
    ENGINEERING LETTERS, 2019, 27 (04) : 901 - 906
  • [24] Pioneer dataset and automatic recognition of Urdu handwritten characters using a deep autoencoder and convolutional neural network
    Hazrat Ali
    Ahsan Ullah
    Talha Iqbal
    Shahid Khattak
    SN Applied Sciences, 2020, 2
  • [25] A Dynamic Emotion Recognition System Based on Convolutional Feature Extraction and Recurrent Neural Network
    Yin, Yida
    Ayoub, Misbah
    Abel, Andrew
    Zhang, Haiyang
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 134 - 154
  • [26] Pioneer dataset and automatic recognition of Urdu handwritten characters using a deep autoencoder and convolutional neural network
    Ali, Hazrat
    Ullah, Ahsan
    Iqbal, Talha
    Khattak, Shahid
    SN APPLIED SCIENCES, 2020, 2 (02):
  • [27] Handwritten Digit Recognition Based on Convolutional Neural Network
    Zhang, Chao
    Zhou, Zhiyao
    Lin, Lan
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 7384 - 7388
  • [28] HANDWRITTEN CHARACTER RECOGNITION WITH SEQUENTIAL CONVOLUTIONAL NEURAL NETWORK
    Liu, Caihua
    Liu, Jie
    Yu, Fang
    Huang, Yalou
    Chen, Jimeng
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 291 - 296
  • [29] Image Augmentation by Blocky Artifact in Deep Convolutional Neural Network for Handwritten Digit Recognition
    Shopon, Md
    Mohammed, Nabeel
    Abedin, Md Anowarul
    2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2017,
  • [30] Handwritten Text Recognition using Deep Learning
    Nikitha, A.
    Geetha, J.
    JayaLakshmi, D. S.
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS ON ELECTRONICS, INFORMATION, COMMUNICATION & TECHNOLOGY (RTEICT-2020), 2020, : 388 - 392