Generating Text Sequence Images for Recognition

被引:5
|
作者
Gong, Yanxiang [1 ]
Deng, Linjie [1 ]
Ma, Zheng [1 ]
Xie, Mei [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
关键词
Image generation; Text sequence images; Training data; Text recognition;
D O I
10.1007/s11063-019-10166-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.
引用
收藏
页码:1677 / 1688
页数:12
相关论文
共 50 条
  • [41] DPF-S2S: A novel dual-pathway-fusion-based sequence-to-sequence text recognition model
    Zhang, Yuqing
    Wu, Peishu
    Li, Han
    Liu, Yurong
    Alsaadi, Fuad E.
    Zeng, Nianyin
    NEUROCOMPUTING, 2023, 523 : 182 - 190
  • [42] Automatic text segmentation and text recognition for video indexing
    Lienhart, R
    Effelsberg, W
    MULTIMEDIA SYSTEMS, 2000, 8 (01) : 69 - 81
  • [43] Generating Face Images With Attributes for Free
    Liu, Yaoyao
    Sun, Qianru
    He, Xiangnan
    Liu, An-An
    Su, Yuting
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2733 - 2743
  • [44] Gujarati Text Recognition: A Review
    Kathiriya, Khushali B.
    Goswami, Mukesh M.
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [45] Unconstrained Scene Text and Video Text Recognition for Arabic Script
    Jain, Mohit
    Mathew, Minesh
    Jawahar, C. V.
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 26 - 30
  • [46] SRR-GAN: Super-Resolution based Recognition with GAN for Low-Resolved Text Images
    Xu, Ming-Chao
    Yin, Fei
    Liu, Cheng-Lin
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 1 - 6
  • [47] A text reading algorithm for natural images
    Gonzalez, Alvaro
    Miguel Bergasa, Luis
    IMAGE AND VISION COMPUTING, 2013, 31 (03) : 255 - 274
  • [48] LEARNING TO REMOVE REFLECTIONS FOR TEXT IMAGES
    Wang, Ce
    Wan, Renjie
    Gao, Feng
    Shi, Boxin
    Duan, Ling-Yu
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1276 - 1281
  • [49] Text Extraction from Web Images
    Liu, Changsong
    Yang, Cheng
    Ding, Xiaoqing
    Fan, Jian
    IMAGING AND PRINTING IN A WEB 2.0 WORLD II, 2011, 7879
  • [50] Switching Text-Based Image Encoders for Captioning Images With Text
    Ueda, Arisa
    Yang, Wei
    Sugiura, Komei
    IEEE ACCESS, 2023, 11 : 55706 - 55715