Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns

被引:17
|
作者
Nam Tuan Ly [1 ]
Cuong Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
Handwritten Japanese Text Recognition; End-to-End Model; CNN; BLSTM; Synthetic Image Generation;
D O I
10.1109/ICFHR-2018.2018.00022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an end-to-end model of Deep Convolutional Recurrent Network (DCRN) for recognizing offline handwritten Japanese text lines. The end-to-end DCRN model has three parts: a convolutional feature extractor using Deep Convolutional Neural Network (DCNN) to extract a feature sequence from a text line image; recurrent layers employing a Deep Bidirectional LSTM to predict pre-frame from the feature sequence; and a transcription layer using Connectionist Temporal Classification (CTC) to convert the pre-frame predictions into the label sequence. Since our end-to-end model requires a large data for training, we synthesize handwritten text line images from sentences in corpora and handwritten character patterns in the Nakayosi and Kuchibue database with elastic distortions. In the experiment, we evaluate the performance of the end-to-end model and the effectiveness of the synthetic data generation method on the test set of the TUAT Kondate database. The results of the experiments show that our end-to-end model achieves higher than the state-of-the-art recognition accuracy on the test set of TUAT Kondate with 96.35% and 98.05% character level recognition accuracies without and with the generated synthetic data, respectively.
引用
收藏
页码:74 / 79
页数:6
相关论文
共 17 条
  • [1] Training an End-to-End System for Handwritten Mathematical Expression Recognition by Generated Patterns
    Anh Duc Le
    Nakagawa, Masaki
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1056 - 1061
  • [2] A Self-attention Based Model for Offline Handwritten Text Recognition
    Nam Tuan Ly
    Trung Tan Ngo
    Nakagawa, Masaki
    PATTERN RECOGNITION, ACPR 2021, PT II, 2022, 13189 : 356 - 369
  • [3] An End-to-End model for Vietnamese speech recognition
    Van Huy Nguyen
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 307 - 312
  • [4] Deep Convolutional Recurrent Network for Segmentation-free Offline Handwritten Japanese Text Recognition
    Nam-Tuan Ly
    Cuong-Tuan Nguyen
    Kha-Cong Nguyen
    Nakagawa, Masaki
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 5 - 9
  • [5] An end-to-end wafer map defect recognition model
    Xia, Min
    Mu, Xiaobao
    Wu, Zhonghai
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1205 - 1210
  • [6] Towards an End-to-End Speech Recognition Model for Accurate Quranic Recitation
    Al-Fadhli, Sumayya
    Al-Harbi, Hajar
    Cherif, Asma
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [7] Investigating Radical-based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese
    Li, Sheng
    Lu, Xugang
    Ding, Chenchen
    Shen, Peng
    Kawahara, Tatsuya
    Kawai, Hisashi
    INTERSPEECH 2019, 2019, : 2200 - 2204
  • [8] End-To-End Finger Trimodal Features Fusion and Recognition Model Based on CNN
    Wen, Mengna
    Zhang, Haigang
    Yang, Jinfeng
    BIOMETRIC RECOGNITION (CCBR 2021), 2021, 12878 : 39 - 48
  • [9] An End-to-End Deep Model With Discriminative Facial Features for Facial Expression Recognition
    Liu, Jun
    Wang, Hongxia
    Feng, Yanjun
    IEEE ACCESS, 2021, 9 : 12158 - 12166
  • [10] Performance of End-to-end Model Based on Convolutional LSTM for Human Activity Recognition
    Sun, Young Ghyu
    Kim, Soo Hyun
    Lee, Seongwoo
    Seon, Joonho
    Lee, SangWoon
    Kim, Cheong Ghil
    Kim, Jin Young
    JOURNAL OF WEB ENGINEERING, 2022, 21 (05): : 1671 - 1690