Generating Text Sequence Images for Recognition

被引：5

作者：

Gong, Yanxiang ^{[1
]}

Deng, Linjie ^{[1
]}

Ma, Zheng ^{[1
]}

Xie, Mei ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China

来源：

NEURAL PROCESSING LETTERS | 2020年 / 51卷 / 02期

关键词：

Image generation; Text sequence images; Training data; Text recognition;

D O I：

10.1007/s11063-019-10166-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.

引用

页码：1677 / 1688

页数：12

共 50 条

[31] A Parallel Text Recognition in Electrical Equipment Nameplate Images Based on Apache Flink
Liu, Zhen
Li, Lin
Zhang, Da
Liu, Liangshuai
Deng, Ze
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (07)
[32] Urdu-Text Detection and Recognition in Natural Scene Images Using Deep Learning
Arafat, Syed Yasser
Iqbal, Muhammad Javed
IEEE ACCESS, 2020, 8 : 96787 - 96803
[33] Text Select-Backdoor: Selective Backdoor Attack for Text Recognition Systems
Kwon, Hyun
Baek, Jang-Woon
IEEE ACCESS, 2024, 12 : 170688 - 170698
[34] Convolution Neural Network Based Deep Features for Text Recognition in Multi-Type Images
Raghunandan, K. S.
Kumara, Chethana B. M.
Kumar, G. Hemantha
Sunil, C.
2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 502 - 507
[35] A new multi-modal approach to bib number/text detection and recognition in Marathon images
Shivakumara, Palaiahnakote
Raghavendra, R.
Qin, Longfei
Raja, Kiran B.
Lu, Tong
Pal, Umapada
PATTERN RECOGNITION, 2017, 61 : 479 - 491
[36] Two-Step CNN Framework for Text Line Recognition in Camera-Captured Images
Chernyshova, Yulia S.
Sheshkus, Alexander V.
Arlazarov, Vladimir V.
IEEE ACCESS, 2020, 8 : 32587 - 32600
[37] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
El Bahi, Hassan
Zatni, Abdelkarim
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 26453 - 26481
[38] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
Hassan El Bahi
Abdelkarim Zatni
Multimedia Tools and Applications, 2019, 78 : 26453 - 26481
[39] Rethinking text rectification for scene text recognition
Ke, Wenjun
Wei, Jianguo
Hou, Qingzhi
Feng, Hui
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
[40] Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
Chandio, Asghar Ali
Asikuzzaman, MD.
Pickering, Mark R.
Leghari, Mehwish
IEEE ACCESS, 2022, 10 : 10062 - 10078

← 1 2 3 4 5 →