Improving CNN-RNN Hybrid Networks for Handwriting Recognition

被引:85
作者
Dutta, Kartik [1 ]
Krishnan, Praveen [1 ]
Mathew, Minesh [1 ]
Jawahar, C. V. [1 ]
机构
[1] IIIT Hyderabad, CVIT, Hyderabad, Telangana, India
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
Handwriting recognition; CNN-RNN network; Data augmentation; Image pre-processing;
D O I
10.1109/ICFHR-2018.2018.00023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of deep learning based models have centered around recent architectures and the availability of large scale annotated data. In this work, we explore these two factors systematically for improving handwritten recognition for scanned off-line document images. We propose a modified CNN-RNN hybrid architecture with a major focus on effective training using: (i) efficient initialization of network using synthetic data for pre-training, (ii) image normalization for slant correction and (iii) domain specific data transformation and distortion for learning important invariances. We perform a detailed ablation study to analyze the contribution of individual modules and present state of art results for the task of unconstrained line and word recognition on popular datasets such as IAM, RIMES and GW.
引用
收藏
页码:80 / 85
页数:6
相关论文
共 36 条
  • [1] Word Spotting and Recognition with Embedded Attributes
    Almazan, Jon
    Gordo, Albert
    Fornes, Alicia
    Valveny, Ernest
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) : 2552 - 2566
  • [2] [Anonymous], IJPRAI
  • [3] [Anonymous], ICFHR
  • [4] [Anonymous], THESIS
  • [5] [Anonymous], ARXIV160804224
  • [6] [Anonymous], CBMS
  • [7] [Anonymous], 2016, EUR C COMP VIS
  • [8] [Anonymous], 2016, INT C FRONT HANDWR R
  • [9] [Anonymous], IJDAR
  • [10] Balakrishnan N., 2006, TCDL