A Self-attention Based Model for Offline Handwritten Text Recognition

被引:2
|
作者
Nam Tuan Ly [1 ]
Trung Tan Ngo [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Tokyo, Japan
来源
关键词
Self-attention; Multi-head; Handwritten text recognition; CNN; BLSTM; CTC; SEQUENCE;
D O I
10.1007/978-3-031-02444-3_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwritten text recognition is an important part of document analysis and it has been receiving a lot of attention from numerous researchers for decades. In this paper, we present a self-attention-based model for offline handwritten textline recognition. The proposed model consists of three main components: a feature extractor by CNN; an encoder by a BLSTM network and a self-attention module; and a decoder by CTC. The self-attention module is complementary to RNN in the encoder and helps the encoder to capture long-range and multi-level dependencies across an input sequence. According to the extensive experiments on the two datasets of IAM Handwriting and Kuzushiji, the proposed model achieves better accuracy than the state-of-the-art models. The self-attention map visualization shows that the self-attention mechanism helps the encoder capture long-range and multi-level dependencies across an input sequence.
引用
收藏
页码:356 / 369
页数:14
相关论文
共 50 条
  • [21] Hidden Markov model-based ensemble methods for offline handwritten text line recognition
    Bertolami, Roman
    Bunke, Horst
    PATTERN RECOGNITION, 2008, 41 (11) : 3452 - 3460
  • [22] Deep Neural Network based Hidden Markov Model for Offline Handwritten Chinese Text Recognition
    Du, Jun
    Wang, Zi-Rui
    Zhai, Jian-Fang
    Hu, Jin-Shui
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3428 - 3433
  • [23] Self-attention based Text Knowledge Mining for Text Detection
    Wan, Qi
    Ji, Haoqin
    Shen, Linlin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5979 - 5988
  • [24] Offline recognition of large vocabulary cursive handwritten text
    Vinciarelli, A
    Bengio, S
    Bunke, H
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1101 - 1105
  • [25] Rejection strategies for offline handwritten text line recognition
    Bertolami, Roman
    Zimmermann, Matthias
    Bunke, Horst
    PATTERN RECOGNITION LETTERS, 2006, 27 (16) : 2005 - 2012
  • [26] Parsimonious HMMs for Offline Handwritten Chinese Text Recognition
    Wang, Wenchao
    Du, Jun
    Wang, Zi-Rui
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 145 - 150
  • [27] Offline Handwritten Quranic Text Recognition: A Research Perspective
    Iqbal, Arshad
    Zafar, Aasim
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 125 - 128
  • [28] Speech Emotion Recognition Based on Self-Attention Weight Correction for Acoustic and Text Features
    Santoso, Jennifer
    Yamada, Takeshi
    Ishizuka, Kenkichi
    Hashimoto, Taiichi
    Makino, Shoji
    IEEE ACCESS, 2022, 10 : 115732 - 115743
  • [29] Named entity recognition for Chinese marine text with knowledge-based self-attention
    He, Shufeng
    Sun, Dianqi
    Wang, Zhao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 19135 - 19149
  • [30] Named entity recognition for Chinese marine text with knowledge-based self-attention
    Shufeng He
    Dianqi Sun
    Zhao Wang
    Multimedia Tools and Applications, 2022, 81 : 19135 - 19149