Generating Text Sequence Images for Recognition

被引：5

作者：

Gong, Yanxiang ^{[1
]}

Deng, Linjie ^{[1
]}

Ma, Zheng ^{[1
]}

Xie, Mei ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China

来源：

NEURAL PROCESSING LETTERS | 2020年 / 51卷 / 02期

关键词：

Image generation; Text sequence images; Training data; Text recognition;

D O I：

10.1007/s11063-019-10166-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.

引用

页码：1677 / 1688

页数：12

共 50 条

[41] DPF-S2S: A novel dual-pathway-fusion-based sequence-to-sequence text recognition model
Zhang, Yuqing
Wu, Peishu
Li, Han
Liu, Yurong
Alsaadi, Fuad E.
Zeng, Nianyin
NEUROCOMPUTING, 2023, 523 : 182 - 190
[42] Automatic text segmentation and text recognition for video indexing
Lienhart, R
Effelsberg, W
MULTIMEDIA SYSTEMS, 2000, 8 (01) : 69 - 81
[43] Generating Face Images With Attributes for Free
Liu, Yaoyao
Sun, Qianru
He, Xiangnan
Liu, An-An
Su, Yuting
Chua, Tat-Seng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2733 - 2743
[44] Gujarati Text Recognition: A Review
Kathiriya, Khushali B.
Goswami, Mukesh M.
2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
[45] Unconstrained Scene Text and Video Text Recognition for Arabic Script
Jain, Mohit
Mathew, Minesh
Jawahar, C. V.
2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 26 - 30
[46] SRR-GAN: Super-Resolution based Recognition with GAN for Low-Resolved Text Images
Xu, Ming-Chao
Yin, Fei
Liu, Cheng-Lin
2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 1 - 6
[47] A text reading algorithm for natural images
Gonzalez, Alvaro
Miguel Bergasa, Luis
IMAGE AND VISION COMPUTING, 2013, 31 (03) : 255 - 274
[48] LEARNING TO REMOVE REFLECTIONS FOR TEXT IMAGES
Wang, Ce
Wan, Renjie
Gao, Feng
Shi, Boxin
Duan, Ling-Yu
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1276 - 1281
[49] Text Extraction from Web Images
Liu, Changsong
Yang, Cheng
Ding, Xiaoqing
Fan, Jian
IMAGING AND PRINTING IN A WEB 2.0 WORLD II, 2011, 7879
[50] Switching Text-Based Image Encoders for Captioning Images With Text
Ueda, Arisa
Yang, Wei
Sugiura, Komei
IEEE ACCESS, 2023, 11 : 55706 - 55715

← 1 2 3 4 5 →