Parametric loss-based super-resolution for scene text recognition

被引：0

作者：

Viriyavisuthisakul, Supatta ^{[1
,2
]}

Sanguansat, Parinya ^{[3
]}

Racharak, Teeradaj ^{[2
]}

Le Nguyen, Minh ^{[2
]}

Kaothanthong, Natsuda ^{[1
]}

Haruechaiyasak, Choochart ^{[4
]}

Yamasaki, Toshihiko ^{[5
]}

机构：

[1] Thammasat Univ, Sirindhorn Int Inst Technol, Sch Management Technol, Khlong Luang 12000, Pathum Thani, Thailand

[2] Japan Adv Inst Informat Technol, Sch Informat Sci, Nomi, Ishikawa 9231211, Japan

[3] Panyapiwat Inst Management, Fac Engn & Technol, Nonthaburi 11120, Thailand

[4] Natl Elect & Comp Technol Ctr, Khlong Luang 10400, Pathum Thani, Thailand

[5] Univ Tokyo, Dept Informat & Commun Engn, Tokyo 1138656, Japan

来源：

MACHINE VISION AND APPLICATIONS | 2023年 / 34卷 / 04期

关键词：

Scene text image; Super-resolution; Parametric; Regularization; Loss function; CONVOLUTIONAL NETWORK; IMAGE;

D O I：

10.1007/s00138-023-01416-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text image super-resolution (STISR) is regarded as the process of improving the image quality of low-resolution scene text images to improve text recognition accuracy. Recently, a text attention network was introduced to reconstruct high-resolution scene text images; the backbone method involved the convolutional neural network-based and transformer-based architecture. Although it can deal with rotated and curved-shaped texts, it still cannot properly handle images containing improper-shaped texts and blurred text regions. This can lead to incorrect text predictions during the text recognition step. In this study, we propose the application of multiple parametric regularizations and parametric weight parameters to the loss function of the STISR method to improve scene text image quality and text recognition accuracy. We design and extend it into three types of methods: adding multiple parametric regularizations, modifying parametric weight parameters, and combining parametric weights and multiple parametric regularizations. Experiments were conducted and compared with state-of-the-art models. The results showed a significant improvement for every proposed method. Moreover, our methods generated clearer and sharper edges than the baseline with a better-quality image score.

引用

页数：14

共 45 条

[1]

Amor, 2020, 34 C NEURAL INFORM P, P1

[2] Single Image Super-Resolution via a Holistic Attention Network [J].

Niu, Ben ;

Wen, Weilei ;

Ren, Wenqi ;

Zhang, Xiangde ;

Yang, Lianping ;

Wang, Shuzhen ;

Zhang, Kaihao ;

Cao, Xiaochun ;

Shen, Haifeng .

COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :191-207

[3] Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model [J].

Cai, Jianrui ;

Zeng, Hui ;

Yong, Hongwei ;

Cao, Zisheng ;

Zhang, Lei .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3086-3095

[4] Pre-Trained Image Processing Transformer [J].

Chen, Hanting ;

Wang, Yunhe ;

Guo, Tianyu ;

Xu, Chang ;

Deng, Yiping ;

Liu, Zhenhua ;

Ma, Siwei ;

Xu, Chunjing ;

Xu, Chao ;

Gao, Wen .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305

[5] Real-world single image super-resolution: A brief review [J].

Chen, Honggang ;

He, Xiaohai ;

Qing, Linbo ;

Wu, Yuanyuan ;

Ren, Chao ;

Sheriff, Ray E. ;

Zhu, Ce .

INFORMATION FUSION, 2022, 79 :124-145

[6] Scene Text Telescope: Text-Focused Scene Image Super-Resolution [J].

Chen, Jingye ;

Li, Bin ;

Xue, Xiangyang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12021-12030

[7] RSRGAN: computationally efficient real-world single image super-resolution using generative adversarial network [J].

Chudasama, Vishal ;

Upla, Kishor .

MACHINE VISION AND APPLICATIONS, 2020, 32 (01)

[8] Image Super-Resolution Using Deep Convolutional Networks [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307

[9] Learning a Deep Convolutional Network for Image Super-Resolution [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 3 4 5 →