TLWSR: Weakly supervised real-world scene text image super-resolution using text label

被引：1

作者：

Shi, Qin ^{[1
]}

Zhu, Yu ^{[1
,3
]}

Fang, Chuantao ^{[1
]}

Yang, Dawei ^{[1
,2
]}

机构：

[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China

[2] Fudan Univ, Zhongshan Hosp, Dept Pulm & Crit Care Med, Shanghai, Peoples R China

[3] Shanghai Engn Res Ctr Internet Things Resp Med, Shanghai, Peoples R China

来源：

IET IMAGE PROCESSING | 2023年 / 17卷 / 09期

关键词：

image processing; image resolution; unsupervised learning; NETWORK;

D O I：

10.1049/ipr2.12827

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text image super-resolution (STISR) has recently received considerable attention. Existing STISR methods are applicable to the situation that all the LR-HR pairs are available. However, in real-world scenarios, it is difficult and expensive to collect ground-truth HR labels and align them with LR images, and thus it is essential to find a way to implement weakly supervised learning. We investigate the STISR problem in the situation that only a subset of HR labels is available and design a weak supervision framework using coarse-grained text labels named TLWSR, which combines incomplete supervision and inexact supervision. Specifically, a lightweight text recognition network and connectionist temporal classification loss are used to guide the super-resolution of text images during training. Extensive experiments on the benchmark TextZoom demonstrate that TLWSR generates distinguishable text images and exceeds the fully supervised baseline TSRN in boosting text recognition accuracywith only 50% HR labels available. Meanwhile, TLWSR can be applied to different super-resolution backbones and significantly improves their performance. Furthermore, TLWSR shows good generalization capability to low-quality images on scene text recognition benchmarks, which verifies the effectiveness of this framework. To the authors' knowledge, this is the first work exploring the problem of STISR in weakly supervised scenarios.

引用

页码：2780 / 2790

页数：11

共 38 条

[11] HiREN: Towards higher supervision quality for better scene text image super-resolution
Zhao, Minyi
Xu, Yi
Li, Bingjia
Wang, Jie
Guan, Jihong
Zhou, Shuigeng
NEUROCOMPUTING, 2025, 623
[12] Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Guo, Hang
Dai, Tao
Meng, Guanghao
Xia, Shu-Tao
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 782 - 790
[13] More and Less: Enhancing Abundance and Refining Redundancy for Text-Prior-Guided Scene Text Image Super-Resolution
Yang, Wei
Luo, Yihong
Ibrayim, Mayire
Hamdulla, Askar
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 129 - 146
[14] Real-World Image Super-Resolution by Exclusionary Dual-Learning
Li, Hao
Qin, Jinghui
Yang, Zhijing
Wei, Pengxu
Pan, Jinshan
Lin, Liang
Shi, Yukai
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4752 - 4763
[15] Text-Enhanced Scene Image Super-Resolution via Stroke Mask and Orthogonal Attention
Shu, Rui
Zhao, Cairong
Feng, Shuyang
Zhu, Liang
Miao, Duoqian
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6317 - 6330
[16] Pragmatic degradation learning for scene text image super-resolution with data-training strategy
Yang, Shengying
Xie, Lifeng
Ran, Xiaoxiao
Lei, Jingsheng
Qian, Xiaohong
KNOWLEDGE-BASED SYSTEMS, 2024, 285
[17] QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text recognition using a Query-aware Transformer
Liu, Chongyu
Jiang, Qing
Peng, Dezhi
Kong, Yuxin
Zhang, Jiaixin
Xiong, Longfei
Duan, Jiwei
Sun, Cheng
Jin, Lianwen
NEUROCOMPUTING, 2025, 620
[18] Unsupervised Denoising for Super-Resolution (UDSR) of Real-World Images
Prajapati, Kalpesh
Chudasama, Vishal
Patel, Heena
Sarvaiya, Anjali
Upla, Kishor
Raja, Kiran
Ramachandra, Raghavendra
Busch, Christoph
IEEE ACCESS, 2022, 10 : 122329 - 122346
[19] Real-World Light Field Image Super-Resolution Via Degradation Modulation
Wang, Yingqian
Liang, Zhengyu
Wang, Longguang
Yang, Jungang
An, Wei
Guo, Yulan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
[20] TextDiff: Enhancing scene text image super-resolution with mask-guided residual diffusion models
Liu, Baolin
Yang, Zongyuan
Chiu, Chinwai
Xiong, Yongping
PATTERN RECOGNITION, 2025, 164

← 1 2 3 4 →