Text Prior Guided Scene Text Image Super-Resolution

被引：39

作者：

Ma, Jianqi ^{[1
]}

Guo, Shi ^{[1
]}

Zhang, Lei ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Scene text image super-resolution; super-resolution; text prior; NETWORK; RECOGNITION;

D O I：

10.1109/TIP.2023.3237002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text image super-resolution (STISR) aims to improve the resolution and visual quality of low-resolution (LR) scene text images, while simultaneously boost the performance of text recognition. However, most of the existing STISR methods regard text images as natural scene images, ignoring the categorical information of text. In this paper, we make an inspiring attempt to embed text recognition prior into STISR model. Specifically, we adopt the predicted character recognition probability sequence as the text prior, which can be obtained conveniently from a text recognition model. The text prior provides categorical guidance to recover high-resolution (HR) text images. On the other hand, the reconstructed HR image can refine the text prior in return. Finally, we present a multi-stage text prior guided super-resolution (TPGSR) framework for STISR. Our experiments on the benchmark TextZoom dataset show that TPGSR can not only effectively improve the visual quality of scene text images, but also significantly improve the text recognition accuracy over existing STISR methods. Our model trained on TextZoom also demonstrates certain generalization capability to the LR images in other datasets. The source code of our work is available

引用

页码：1341 / 1353

页数：13

共 50 条

[41] ADVERSARIAL TEXT IMAGE SUPER-RESOLUTION USING SINKHORN DISTANCE [J].

Geng, Cong ;

Chen, Li ;

Zhang, Xiaoyun ;

Gao, Zhiyong .

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, :2663-2667

[42] Scene Text Image Super-Resolution Reconstruction Based on Perceiving Multi-Domain Character Distance [J].

Huang, Jun-Yang ;

Chen, Hong-Hui ;

Wang, Jia-Bao ;

Chen, Ping-Ping ;

Lin, Zhi-Jian .

Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (07) :2262-2270

[43] Rectification and Super-Resolution Enhancements for Forensic Text Recognition [J].

Blanco-Medina, Pablo ;

Fidalgo, Eduardo ;

Alegre, Enrique ;

Alaiz-Rodriguez, Rocio ;

Janez-Martino, Francisco ;

Bonnici, Alexandra .

SENSORS, 2020, 20 (20) :1-17

[44] Super-Resolution of Text Image Based on Conditional Generative Adversarial Network [J].

Wang, Yuyang ;

Ding, Wenjun ;

Su, Feng .

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 :270-281

[45] Pixel-Level Degradation for Text Image Super-Resolution and Recognition [J].

Qian, Xiaohong ;

Xie, Lifeng ;

Ye, Ning ;

Le, Renlong ;

Yang, Shengying .

ELECTRONICS, 2023, 12 (21)

[46] CNN-Based Text Image Super-Resolution Tailored for OCR [J].

Zhang, Haochen ;

Liu, Dong ;

Xiong, Zhiwei .

2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,

[47] Coarse-to-fine text injecting for realistic image super-resolution [J].

Chen, Xiaoyu ;

Bai, Chao ;

Wu, Zhenyao ;

Wu, Xinyi ;

Zou, Qi ;

Xia, Yong ;

Wang, Song .

NEUROCOMPUTING, 2025, 626

[48] Scene text image super-resolution using multi-scale convolutional neural network with skip connections [J].

Walha, Rim ;

Aouini, Amal .

APPLIED INTELLIGENCE, 2024, :5931-5943

[49] Single-Character-Based Embedding Feature Aggregation Using Cross-Attention for Scene Text Super-Resolution [J].

Wang, Meng ;

Li, Qianqian ;

Liu, Haipeng .

SENSORS, 2025, 25 (07)

[50] PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit [J].

Mou, Yongqiang ;

Tan, Lei ;

Yang, Hui ;

Chen, Jingying ;

Liu, Leyuan ;

Yan, Rui ;

Huang, Yaohong .

COMPUTER VISION - ECCV 2020, PT XV, 2020, 12360 :158-174

← 1 2 3 4 5 →