A Self-attention Based Model for Offline Handwritten Text Recognition

被引:2
|
作者
Nam Tuan Ly [1 ]
Trung Tan Ngo [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Tokyo, Japan
来源
PATTERN RECOGNITION, ACPR 2021, PT II | 2022年 / 13189卷
关键词
Self-attention; Multi-head; Handwritten text recognition; CNN; BLSTM; CTC; SEQUENCE;
D O I
10.1007/978-3-031-02444-3_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwritten text recognition is an important part of document analysis and it has been receiving a lot of attention from numerous researchers for decades. In this paper, we present a self-attention-based model for offline handwritten textline recognition. The proposed model consists of three main components: a feature extractor by CNN; an encoder by a BLSTM network and a self-attention module; and a decoder by CTC. The self-attention module is complementary to RNN in the encoder and helps the encoder to capture long-range and multi-level dependencies across an input sequence. According to the extensive experiments on the two datasets of IAM Handwriting and Kuzushiji, the proposed model achieves better accuracy than the state-of-the-art models. The self-attention map visualization shows that the self-attention mechanism helps the encoder capture long-range and multi-level dependencies across an input sequence.
引用
收藏
页码:356 / 369
页数:14
相关论文
共 50 条
  • [41] A Class Balanced Spatio-Temporal Self-Attention Model for Combat Intention Recognition
    Wang, Xuan
    Jin, Benzhou
    Jia, Mingyang
    Wu, Gang
    Zhang, Xiaofei
    IEEE ACCESS, 2024, 12 : 112074 - 112084
  • [42] Re-Transformer: A Self-Attention Based Model for Machine Translation
    Liu, Huey-Ing
    Chen, Wei-Lin
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 3 - 10
  • [43] Solar irradiance prediction based on self-attention recursive model network
    Kang, Ting
    Wang, Huaizhi
    Wu, Ting
    Peng, Jianchun
    Jiang, Hui
    FRONTIERS IN ENERGY RESEARCH, 2022, 10
  • [44] High Performance Offline Handwritten Chinese Text Recognition with a New Data Preprocessing and Augmentation Pipeline
    Xie, Canyu
    Lai, Songxuan
    Liao, Qianying
    Jin, Lianwen
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 45 - 59
  • [45] Best Practices for a Handwritten Text Recognition System
    Retsinas, George
    Sfikas, Giorgos
    Gatos, Basilis
    Nikou, Christophoros
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 247 - 259
  • [46] Research on Offline Handwritten Chinese Character Recognition Based on Deep Learning
    Hao, Qiuyun
    Wu, Xiaoming
    Zhang, Sen
    Zhang, Peng
    Ma, Xiaofeng
    Jiang, Jingsai
    2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 470 - 474
  • [47] Discrete representation learning for handwritten text recognition
    Davoudi, Homa
    Traviglia, Arianna
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (21) : 15759 - 15773
  • [48] Handwritten Text Recognition for Bengali
    Andreu Sanchez, Joan
    Pal, Umapada
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 542 - 547
  • [49] Masked face recognition with convolutional visual self-attention network
    Ge, Yiming
    Liu, Hui
    Du, Junzhao
    Li, Zehua
    Wei, Yuheng
    NEUROCOMPUTING, 2023, 518 : 496 - 506
  • [50] HAN: An efficient hierarchical self-attention network for skeleton-based gesture recognition
    Liu, Jianbo
    Wang, Ying
    Xiang, Shiming
    Pan, Chunhong
    PATTERN RECOGNITION, 2025, 162