Gradient-Based Graph Attention for Scene Text Image Super-resolution

被引：0

作者：

Zhu, Xiangyuan ^{[1
]}

Guo, Kehua ^{[1
]}

Fang, Hui ^{[2
]}

Ding, Rui ^{[1
]}

Wu, Zheng ^{[1
]}

Schaefer, Gerald ^{[2
]}

机构：

[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China

[2] Loughborough Univ, Dept Comp Sci, Loughborough, Leics, England

来源：

THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3 | 2023年

基金：

美国国家科学基金会;

关键词：

NETWORK;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text image super-resolution (STISR) in the wild has been shown to be beneficial to support improved vision-based text recognition from low-resolution imagery. An intuitive way to enhance STISR performance is to explore the well-structured and repetitive layout characteristics of text and exploit these as prior knowledge to guide model convergence. In this paper, we propose a novel gradient-based graph attention method to embed patch-wise text layout contexts into image feature representations for high-resolution text image reconstruction in an implicit and elegant manner. We introduce a non-local group-wise attention module to extract text features which are then enhanced by a cascaded channel attention module and a novel gradient-based graph attention module in order to obtain more effective representations by exploring correlations of regional and local patch-wise text layout properties. Extensive experiments on the benchmark TextZoom dataset convincingly demonstrate that our method supports excellent text recognition and outperforms the current state-of-the-art in STISR. The source code is available at https://github.com/xyzhu1/TSAN.

引用

页码：3861 / 3869

页数：9

共 33 条

[1] Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].

Ahn, Namhyuk ;

Kang, Byungkon ;

Sohn, Kyung-Ah .

COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272

[2] Single Image Super-Resolution via a Holistic Attention Network [J].

Niu, Ben ;

Wen, Weilei ;

Ren, Wenqi ;

Zhang, Xiangde ;

Yang, Lianping ;

Wang, Shuzhen ;

Zhang, Kaihao ;

Cao, Xiaochun ;

Shen, Haifeng .

COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :191-207

[3] Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model [J].

Cai, Jianrui ;

Zeng, Hui ;

Yong, Hongwei ;

Cao, Zisheng ;

Zhang, Lei .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3086-3095

[4]

Chen JY, 2022, AAAI CONF ARTIF INTE, P285

[5] Scene Text Telescope: Text-Focused Scene Image Super-Resolution [J].

Chen, Jingye ;

Li, Bin ;

Xue, Xiangyang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12021-12030

[6] AON: Towards Arbitrarily-Oriented Text Recognition [J].

Cheng, Zhanzhan ;

Xu, Yangliu ;

Bai, Fan ;

Niu, Yi ;

Pu, Shiliang ;

Zhou, Shuigeng .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5571-5579

[7] Second-order Attention Network for Single Image Super-Resolution [J].

Dai, Tao ;

Cai, Jianrui ;

Zhang, Yongbing ;

Xia, Shu-Tao ;

Zhang, Lei .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11057-11066

[8]

Dong C., 2015, arXiv

[9] Image Super-Resolution Using Deep Convolutional Networks [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307

[10] Synthetic Data for Text Localisation in Natural Images [J].

Gupta, Ankush ;

Vedaldi, Andrea ;

Zisserman, Andrew .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2315-2324

← 1 2 3 4 →