Generating Text Sequence Images for Recognition

被引:5
|
作者
Gong, Yanxiang [1 ]
Deng, Linjie [1 ]
Ma, Zheng [1 ]
Xie, Mei [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
关键词
Image generation; Text sequence images; Training data; Text recognition;
D O I
10.1007/s11063-019-10166-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.
引用
收藏
页码:1677 / 1688
页数:12
相关论文
共 50 条
  • [31] A Parallel Text Recognition in Electrical Equipment Nameplate Images Based on Apache Flink
    Liu, Zhen
    Li, Lin
    Zhang, Da
    Liu, Liangshuai
    Deng, Ze
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (07)
  • [32] Urdu-Text Detection and Recognition in Natural Scene Images Using Deep Learning
    Arafat, Syed Yasser
    Iqbal, Muhammad Javed
    IEEE ACCESS, 2020, 8 : 96787 - 96803
  • [33] Text Select-Backdoor: Selective Backdoor Attack for Text Recognition Systems
    Kwon, Hyun
    Baek, Jang-Woon
    IEEE ACCESS, 2024, 12 : 170688 - 170698
  • [34] Convolution Neural Network Based Deep Features for Text Recognition in Multi-Type Images
    Raghunandan, K. S.
    Kumara, Chethana B. M.
    Kumar, G. Hemantha
    Sunil, C.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 502 - 507
  • [35] A new multi-modal approach to bib number/text detection and recognition in Marathon images
    Shivakumara, Palaiahnakote
    Raghavendra, R.
    Qin, Longfei
    Raja, Kiran B.
    Lu, Tong
    Pal, Umapada
    PATTERN RECOGNITION, 2017, 61 : 479 - 491
  • [36] Two-Step CNN Framework for Text Line Recognition in Camera-Captured Images
    Chernyshova, Yulia S.
    Sheshkus, Alexander V.
    Arlazarov, Vladimir V.
    IEEE ACCESS, 2020, 8 : 32587 - 32600
  • [37] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
    El Bahi, Hassan
    Zatni, Abdelkarim
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 26453 - 26481
  • [38] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
    Hassan El Bahi
    Abdelkarim Zatni
    Multimedia Tools and Applications, 2019, 78 : 26453 - 26481
  • [39] Rethinking text rectification for scene text recognition
    Ke, Wenjun
    Wei, Jianguo
    Hou, Qingzhi
    Feng, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
  • [40] Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
    Chandio, Asghar Ali
    Asikuzzaman, MD.
    Pickering, Mark R.
    Leghari, Mehwish
    IEEE ACCESS, 2022, 10 : 10062 - 10078