Text Prior Guided Scene Text Image Super-Resolution

被引：31

作者：

Ma, Jianqi ^{[1
]}

Guo, Shi ^{[1
]}

Zhang, Lei ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Scene text image super-resolution; super-resolution; text prior; NETWORK; RECOGNITION;

D O I：

10.1109/TIP.2023.3237002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text image super-resolution (STISR) aims to improve the resolution and visual quality of low-resolution (LR) scene text images, while simultaneously boost the performance of text recognition. However, most of the existing STISR methods regard text images as natural scene images, ignoring the categorical information of text. In this paper, we make an inspiring attempt to embed text recognition prior into STISR model. Specifically, we adopt the predicted character recognition probability sequence as the text prior, which can be obtained conveniently from a text recognition model. The text prior provides categorical guidance to recover high-resolution (HR) text images. On the other hand, the reconstructed HR image can refine the text prior in return. Finally, we present a multi-stage text prior guided super-resolution (TPGSR) framework for STISR. Our experiments on the benchmark TextZoom dataset show that TPGSR can not only effectively improve the visual quality of scene text images, but also significantly improve the text recognition accuracy over existing STISR methods. Our model trained on TextZoom also demonstrates certain generalization capability to the LR images in other datasets. The source code of our work is available

引用

页码：1341 / 1353

页数：13

共 50 条

[1] More and Less: Enhancing Abundance and Refining Redundancy for Text-Prior-Guided Scene Text Image Super-Resolution
Yang, Wei
Luo, Yihong
Ibrayim, Mayire
Hamdulla, Askar
[J]. DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 129 - 146
[2] GARDEN: Generative Prior Guided Network for Scene Text Image Super-Resolution
Kong, Yuxin
Ma, Weihong
Jin, Lianwen
Xue, Yang
[J]. DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 196 - 214
[3] Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer
Shi, Qin
Zhu, Yu
Liu, Yatong
Ye, Jiongyao
Yang, Dawei
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
[4] Text Image Super-Resolution Guided by Text Structure and Embedding Priors
Huang, Cong
Peng, Xiulian
Liu, Dong
Lu, Yan
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[5] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
Chen, Jingye
Li, Bin
Xue, Xiangyang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
[6] Text Gestalt: Stroke-Aware Scene Text Image Super-resolution
Chen, Jingye
Yu, Haiyang
Ma, Jianqi
Li, Bin
Xue, Xiangyang
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 285 - 293
[7] Batch-transformer for scene text image super-resolution
Sun, Yaqi
Xie, Xiaolan
Li, Zhi
Yang, Kai
[J]. VISUAL COMPUTER, 2024, 40 (10) : 7399 - 7409
[8] HiREN: Towards higher supervision quality for better scene text image super-resolution
Zhao, Minyi
Xu, Yi
Li, Bingjia
Wang, Jie
Guan, Jihong
Zhou, Shuigeng
[J]. NEUROCOMPUTING, 2025, 623
[9] Scene Text Image Super-Resolution Via Semantic Distillation and Text Perceptual Loss
Zhao, Cairong
Shu, Rui
Feng, Shuyang
Zhu, Liang
Wang, Xuekuan
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1153 - 1164
[10] Advancing scene text image super-resolution via edge enhancement priors
Li, Hongjun
Li, Shangfeng
[J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250

← 1 2 3 4 5 →