A Self-attention Based Model for Offline Handwritten Text Recognition

被引:2
|
作者
Nam Tuan Ly [1 ]
Trung Tan Ngo [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Tokyo, Japan
来源
PATTERN RECOGNITION, ACPR 2021, PT II | 2022年 / 13189卷
关键词
Self-attention; Multi-head; Handwritten text recognition; CNN; BLSTM; CTC; SEQUENCE;
D O I
10.1007/978-3-031-02444-3_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwritten text recognition is an important part of document analysis and it has been receiving a lot of attention from numerous researchers for decades. In this paper, we present a self-attention-based model for offline handwritten textline recognition. The proposed model consists of three main components: a feature extractor by CNN; an encoder by a BLSTM network and a self-attention module; and a decoder by CTC. The self-attention module is complementary to RNN in the encoder and helps the encoder to capture long-range and multi-level dependencies across an input sequence. According to the extensive experiments on the two datasets of IAM Handwriting and Kuzushiji, the proposed model achieves better accuracy than the state-of-the-art models. The self-attention map visualization shows that the self-attention mechanism helps the encoder capture long-range and multi-level dependencies across an input sequence.
引用
收藏
页码:356 / 369
页数:14
相关论文
共 50 条
  • [11] AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks
    Kass, Dmitrijs
    Vats, Ekta
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 507 - 522
  • [12] Self-attention Based Text Matching Model with Generative Pre-training
    Zhang, Xiaolin
    Lei, Fengpei
    Yu, Shengji
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 84 - 91
  • [13] Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns
    Nam Tuan Ly
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 74 - 79
  • [14] Deep Convolutional Recurrent Network for Segmentation-free Offline Handwritten Japanese Text Recognition
    Nam-Tuan Ly
    Cuong-Tuan Nguyen
    Kha-Cong Nguyen
    Nakagawa, Masaki
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 5 - 9
  • [15] Wearable sensors for human activity recognition based on a self-attention CNN-BiLSTM model
    Guo, Huafeng
    Xiang, Changcheng
    Chen, Shiqiang
    SENSOR REVIEW, 2023, 43 (5/6) : 347 - 358
  • [16] Deformable Self-Attention for Text Classification
    Ma, Qianli
    Yan, Jiangyue
    Lin, Zhenxi
    Yu, Liuhong
    Chen, Zipeng
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1570 - 1581
  • [17] BRESSAY: A Brazilian Portuguese Dataset for Offline Handwritten Text Recognition
    Neto, Arthur F. S.
    Bezerra, Byron L. D.
    Araujo, Savio S.
    Souza, Wiliane M. A. S.
    Alves, Kleberson F.
    Oliveira, Macileide F.
    Lins, Samara V. S.
    Hazin, Hugo J. F.
    Rocha, Pedro H., V
    Toselli, Alejandro H.
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II, 2024, 14805 : 315 - 333
  • [18] Bridging the Gap in Resource for Offline English Handwritten Text Recognition
    Mondal, Ajoy
    Tulsyan, Krishna
    Jawahai, C., V
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II, 2024, 14805 : 413 - 428
  • [19] Self-Attention Networks For Motion Posture Recognition Based On Data Fusion
    Ji, Zhihao
    Xie, Qiang
    4TH INTERNATIONAL CONFERENCE ON INFORMATICS ENGINEERING AND INFORMATION SCIENCE (ICIEIS2021), 2022, 12161
  • [20] Finger Vein Recognition Based on ResNet With Self-Attention
    Zhang, Zhibo
    Chen, Guanghua
    Zhang, Weifeng
    Wang, Huiyang
    IEEE ACCESS, 2024, 12 : 1943 - 1951