Generating Text Sequence Images for Recognition

被引:5
|
作者
Gong, Yanxiang [1 ]
Deng, Linjie [1 ]
Ma, Zheng [1 ]
Xie, Mei [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
关键词
Image generation; Text sequence images; Training data; Text recognition;
D O I
10.1007/s11063-019-10166-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.
引用
收藏
页码:1677 / 1688
页数:12
相关论文
共 50 条
  • [21] A Framework of Text Detection and Recognition from Natural Images for Mobile Device
    Selmi, Zied
    Ben Halima, Mohamed
    Wali, Ali
    Alimi, Adel M.
    NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
  • [22] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [23] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
  • [24] An Algorithm for Natural Images Text Recognition Using Four Direction Features
    Zhang, Min
    Yan, Yujin
    Wang, Hai
    Zhao, Wei
    ELECTRONICS, 2019, 8 (09)
  • [25] Extraction and Recognition of Multi-oriented Text from Trademark Images
    Tripathi, Priyanka
    Indoria, Ajay Kumar
    2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
  • [26] Character Sequence Prediction Method for Training Data Creation in the Task of Text Recognition
    Zlobin, Pavel K.
    Chernyshova, Yulia S.
    Sheshkus, Alexander, V
    Arlazarov, Vladimir V.
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [27] Content and Style Aware Generation of Text-Line Images for Handwriting Recognition
    Kang, Lei
    Riba, Pau
    Rusinol, Marcal
    Fornes, Alicia
    Villegas, Mauricio
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8846 - 8860
  • [28] A Plant Disease Recognition Method Based on Fusion of Images and Graph Structure Text
    Wang, Chunshan
    Zhou, Ji
    Zhang, Yan
    Wu, Huarui
    Zhao, Chunjiang
    Teng, Guifa
    Li, Jiuxi
    FRONTIERS IN PLANT SCIENCE, 2022, 12
  • [29] Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach
    Xue, Wenyuan
    Li, Qingyong
    Xue, Qiyuan
    IEEE ACCESS, 2020, 8 (08): : 407 - 416
  • [30] End-to-End Analysis for Text Detection and Recognition in Natural Scene Images
    Alnefaie, Ahlam
    Gupta, Deepak
    Bhuyan, Monowar H.
    Razzak, Imran
    Gupta, Prashant
    Prasad, Mukesh
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,