Content and Style Aware Generation of Text-Line Images for Handwriting Recognition

被引:18
|
作者
Kang, Lei [1 ]
Riba, Pau [2 ]
Rusinol, Marcal [3 ]
Fornes, Alicia [4 ]
Villegas, Mauricio [5 ]
机构
[1] Shantou Univ, Comp Sci Dept, Shantou 515063, Peoples R China
[2] Helsing AI, D-80331 Munich, Germany
[3] AllRead MLT, Barcelona 08039, Spain
[4] Univ Autonoma Barcelona, Comp Vis Ctr, Comp Sci Dept, Bellaterra 08193, Spain
[5] Omni Us, D-10559 Berlin, Germany
关键词
Visualization; Text recognition; Writing; Training; Handwriting recognition; Image recognition; Vocabulary; Handwritten text recognition; transformers; generative adversarial networks; synthetic data generation;
D O I
10.1109/TPAMI.2021.3122572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten Text Recognition has achieved an impressive performance in public benchmarks. However, due to the high inter- and intra-class variability between handwriting styles, such recognizers need to be trained using huge volumes of manually labeled training data. To alleviate this labor-consuming problem, synthetic data produced with TrueType fonts has been often used in the training loop to gain volume and augment the handwriting style variability. However, there is a significant style bias between synthetic and real data which hinders the improvement of recognition performance. To deal with such limitations, we propose a generative method for handwritten text-line images, which is conditioned on both visual appearance and textual content. Our method is able to produce long text-line samples with diverse handwriting styles. Once properly trained, our method can also be adapted to new target data by only accessing unlabeled text-line images to mimic handwritten styles and produce images with any textual content. Extensive experiments have been done on making use of the generated samples to boost Handwritten Text Recognition performance. Both qualitative and quantitative results demonstrate that the proposed approach outperforms the current state of the art.
引用
收藏
页码:8846 / 8860
页数:15
相关论文
共 8 条
  • [1] Handwriting Text-line Detection and Recognition in Answer Sheet Composition with Few Labeled Data
    Wu, Kunnan
    Fu, Huiyuan
    Li, Wensheng
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 129 - 132
  • [2] Robust text-line and word segmentation for handwritten documents images
    Stafylakis, Themos
    Papavassiliou, Vassilis
    Katsouros, Vassilis
    Carayannis, George
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3393 - 3396
  • [3] EXPERIMENTS WITH HANDWRITING RECOGNITION USING HOLOGRAPHIC REPRESENTATION OF LINE IMAGES
    GORSKY, ND
    PATTERN RECOGNITION LETTERS, 1994, 15 (09) : 853 - 859
  • [4] Text line segmentation and word recognition in a system for general writer independent handwriting recognition
    Marti, UV
    Bunke, H
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 159 - 163
  • [5] Pay attention to what you read: Non-recurrent handwritten text-Line recognition
    Kang, Lei
    Riba, Pau
    Rusinol, Marcal
    Fornes, Alicia
    Villegas, Mauricio
    PATTERN RECOGNITION, 2022, 129
  • [6] Neural Text Line Segmentation of Multilingual Print and Handwriting with Recognition-Based Evaluation
    Schone, Patrick
    Hargraves, Christian
    Morrey, Jon
    Day, Rachael
    Jacox, Mindy
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 265 - 272
  • [7] Two-Step CNN Framework for Text Line Recognition in Camera-Captured Images
    Chernyshova, Yulia S.
    Sheshkus, Alexander V.
    Arlazarov, Vladimir V.
    IEEE ACCESS, 2020, 8 : 32587 - 32600
  • [8] A search method for on-line handwritten text employing writing-box-free handwriting recognition
    Oda, H
    Kitadai, A
    Onuma, M
    Nakagawa, M
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 545 - 550