Generating Text Sequence Images for Recognition

被引：5

作者：

Gong, Yanxiang ^{[1
]}

Deng, Linjie ^{[1
]}

Ma, Zheng ^{[1
]}

Xie, Mei ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China

来源：

NEURAL PROCESSING LETTERS | 2020年 / 51卷 / 02期

关键词：

Image generation; Text sequence images; Training data; Text recognition;

D O I：

10.1007/s11063-019-10166-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.

引用

页码：1677 / 1688

页数：12

共 50 条

[21] A Framework of Text Detection and Recognition from Natural Images for Mobile Device
Selmi, Zied
Ben Halima, Mohamed
Wali, Ali
Alimi, Adel M.
NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
[22] Text detection, recognition, and script identification in natural scene images: a Review
Veronica Naosekpam
Nilkanta Sahu
International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
[23] Text detection, recognition, and script identification in natural scene images: a Review
Naosekpam, Veronica
Sahu, Nilkanta
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
[24] An Algorithm for Natural Images Text Recognition Using Four Direction Features
Zhang, Min
Yan, Yujin
Wang, Hai
Zhao, Wei
ELECTRONICS, 2019, 8 (09)
[25] Extraction and Recognition of Multi-oriented Text from Trademark Images
Tripathi, Priyanka
Indoria, Ajay Kumar
2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
[26] Character Sequence Prediction Method for Training Data Creation in the Task of Text Recognition
Zlobin, Pavel K.
Chernyshova, Yulia S.
Sheshkus, Alexander, V
Arlazarov, Vladimir V.
FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
[27] Content and Style Aware Generation of Text-Line Images for Handwriting Recognition
Kang, Lei
Riba, Pau
Rusinol, Marcal
Fornes, Alicia
Villegas, Mauricio
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8846 - 8860
[28] A Plant Disease Recognition Method Based on Fusion of Images and Graph Structure Text
Wang, Chunshan
Zhou, Ji
Zhang, Yan
Wu, Huarui
Zhao, Chunjiang
Teng, Guifa
Li, Jiuxi
FRONTIERS IN PLANT SCIENCE, 2022, 12
[29] Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach
Xue, Wenyuan
Li, Qingyong
Xue, Qiyuan
IEEE ACCESS, 2020, 8 (08): : 407 - 416
[30] End-to-End Analysis for Text Detection and Recognition in Natural Scene Images
Alnefaie, Ahlam
Gupta, Deepak
Bhuyan, Monowar H.
Razzak, Imran
Gupta, Prashant
Prasad, Mukesh
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

← 1 2 3 4 5 →