Text Prior Guided Scene Text Image Super-Resolution

被引:39
作者
Ma, Jianqi [1 ]
Guo, Shi [1 ]
Zhang, Lei [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
关键词
Scene text image super-resolution; super-resolution; text prior; NETWORK; RECOGNITION;
D O I
10.1109/TIP.2023.3237002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text image super-resolution (STISR) aims to improve the resolution and visual quality of low-resolution (LR) scene text images, while simultaneously boost the performance of text recognition. However, most of the existing STISR methods regard text images as natural scene images, ignoring the categorical information of text. In this paper, we make an inspiring attempt to embed text recognition prior into STISR model. Specifically, we adopt the predicted character recognition probability sequence as the text prior, which can be obtained conveniently from a text recognition model. The text prior provides categorical guidance to recover high-resolution (HR) text images. On the other hand, the reconstructed HR image can refine the text prior in return. Finally, we present a multi-stage text prior guided super-resolution (TPGSR) framework for STISR. Our experiments on the benchmark TextZoom dataset show that TPGSR can not only effectively improve the visual quality of scene text images, but also significantly improve the text recognition accuracy over existing STISR methods. Our model trained on TextZoom also demonstrates certain generalization capability to the LR images in other datasets. The source code of our work is available
引用
收藏
页码:1341 / 1353
页数:13
相关论文
共 50 条
[21]   Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement [J].
Guo, Hang ;
Dai, Tao ;
Meng, Guanghao ;
Xia, Shu-Tao .
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, :782-790
[22]   Soft-edge-guided significant coordinate attention network for scene text image super-resolution [J].
Xi, Chenchen ;
Zhang, Kaibing ;
He, Xin ;
Hu, Yanting ;
Chen, Jinguang .
VISUAL COMPUTER, 2024, 40 (08) :5393-5406
[23]   Gradient-Based Graph Attention for Scene Text Image Super-resolution [J].
Zhu, Xiangyuan ;
Guo, Kehua ;
Fang, Hui ;
Ding, Rui ;
Wu, Zheng ;
Schaefer, Gerald .
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, :3861-3869
[24]   Text-Enhanced Scene Image Super-Resolution via Stroke Mask and Orthogonal Attention [J].
Shu, Rui ;
Zhao, Cairong ;
Feng, Shuyang ;
Zhu, Liang ;
Miao, Duoqian .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) :6317-6330
[25]   Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution [J].
Zhang, Wenyu ;
Deng, Xin ;
Jia, Baojun ;
Yu, Xingtong ;
Chen, Yifan ;
Ma, Jin ;
Ding, Qing ;
Zhang, Xinming .
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, :2168-2179
[26]   TLWSR: Weakly supervised real-world scene text image super-resolution using text label [J].
Shi, Qin ;
Zhu, Yu ;
Fang, Chuantao ;
Yang, Dawei .
IET IMAGE PROCESSING, 2023, 17 (09) :2780-2790
[27]   Bayesian super-resolution of text in video with a text-specific bimodal prior [J].
Donaldson K. ;
Myers G.K. .
International Journal of Document Analysis and Recognition (IJDAR), 2005, 7 (2-3) :159-167
[28]   DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution [J].
Singh, Shrey ;
Keserwani, Prateek ;
Iwamura, Masakazu ;
Roy, Partha Pratim .
COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 :303-320
[29]   C3-STISR: Scene Text Image Super-resolution with Triple Clues [J].
Zhao, Minyi ;
Wang, Miao ;
Bai, Fan ;
Li, Bingjia ;
Wang, Jie ;
Zhou, Shuigeng .
PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, :1707-1713
[30]   Multi-Task Learning for Scene Text Image Super-Resolution with Multiple Transformers [J].
Honda, Kosuke ;
Kurematsu, Masaki ;
Fujita, Hamido ;
Selamat, Ali .
ELECTRONICS, 2022, 11 (22)