Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns

被引:17
|
作者
Nam Tuan Ly [1 ]
Cuong Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
Handwritten Japanese Text Recognition; End-to-End Model; CNN; BLSTM; Synthetic Image Generation;
D O I
10.1109/ICFHR-2018.2018.00022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an end-to-end model of Deep Convolutional Recurrent Network (DCRN) for recognizing offline handwritten Japanese text lines. The end-to-end DCRN model has three parts: a convolutional feature extractor using Deep Convolutional Neural Network (DCNN) to extract a feature sequence from a text line image; recurrent layers employing a Deep Bidirectional LSTM to predict pre-frame from the feature sequence; and a transcription layer using Connectionist Temporal Classification (CTC) to convert the pre-frame predictions into the label sequence. Since our end-to-end model requires a large data for training, we synthesize handwritten text line images from sentences in corpora and handwritten character patterns in the Nakayosi and Kuchibue database with elastic distortions. In the experiment, we evaluate the performance of the end-to-end model and the effectiveness of the synthetic data generation method on the test set of the TUAT Kondate database. The results of the experiments show that our end-to-end model achieves higher than the state-of-the-art recognition accuracy on the test set of TUAT Kondate with 96.35% and 98.05% character level recognition accuracies without and with the generated synthetic data, respectively.
引用
收藏
页码:74 / 79
页数:6
相关论文
共 50 条
  • [1] Training an End-to-End System for Handwritten Mathematical Expression Recognition by Generated Patterns
    Anh Duc Le
    Nakagawa, Masaki
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1056 - 1061
  • [2] Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model
    Carbonell, Manuel
    Villegas, Mauricio
    Fornes, Alicia
    Llados, Josep
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 399 - 404
  • [3] End-to-End page-Level assessment of handwritten text recognition
    Vidal, Enrique
    Toselli, Alejandro H.
    Rios-Vila, Antonio
    Calvo-Zaragoza, Jorge
    PATTERN RECOGNITION, 2023, 142
  • [4] Improvement of End-to-End Offline Handwritten Mathematical Expression Recognition by Weakly Supervised Learning
    Thanh-Nghia Truong
    Cuong Tuan Nguyen
    Khanh Minh Phan
    Nakagawa, Masaki
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 181 - 186
  • [5] End-to-End Historical Handwritten Ethiopic Text Recognition Using Deep Learning
    Malhotra, Ruchika
    Addis, Maru Tesfaye
    IEEE ACCESS, 2023, 11 : 99535 - 99545
  • [6] An end-to-end handwritten text recognition method using residual attention networks
    Wang Y.-T.
    Zheng H.
    Chang H.-Y.
    Li S.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (07): : 1825 - 1834
  • [7] End-to-End Handwritten Paragraph Text Recognition Using a Vertical Attention Network
    Coquenet, Denis
    Chatelain, Clement
    Paquet, Thierry
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 508 - 524
  • [8] Multitask Training with Text Data for End-to-End Speech Recognition
    Wang, Peidong
    Sainath, Tara N.
    Weiss, Ron J.
    INTERSPEECH 2021, 2021, : 2566 - 2570
  • [9] End-to-end attention convolutional recurrent network for online handwritten Chinese text recognition
    Qu, Xiwen
    Wu, Zhihong
    Huang, Jun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 62541 - 62558
  • [10] End-to-end Handwritten Chinese Paragraph Text Recognition Using Residual Attention Networks
    Wang, Yintong
    Yang, Yingjie
    Chen, Haiyan
    Zheng, Hao
    Chang, Heyou
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (01): : 371 - 388