Densely Connected Transformer With Linear Self-Attention for Lightweight Image Super-Resolution

被引：11

作者：

Zeng, Kun ^{[1
]}

Lin, Hanjiang ^{[2
]}

Yan, Zhiqiang ^{[3
]}

Fang, Jinsheng ^{[2
]}

机构：

[1] Minjiang Univ, Coll Comp & Control Engn, Fujian Prov Key Lab Informat Proc & Intelligent Co, Fuzhou 350108, Peoples R China

[2] Minnan Normal Univ, Sch Comp Sci, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Peoples R China

[3] Guilin Univ Elect Technol, Dept Comp, Guilin 541004, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2023年 / 72卷

基金：

中国国家自然科学基金;

关键词：

Transformers; Computational modeling; Superresolution; Feature extraction; Task analysis; Image restoration; Computational efficiency; Convolutional neural network (CNN); densely connected network; lightweight network; linear self-attention (LSA); single image super-resolution; transformer;

D O I：

10.1109/TIM.2023.3304672

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Image super-resolution (SR) is the process of restoring high-resolution (HR) images from low-resolution (LR) ones. Recent Transformer-based SR methods have achieved impressive results by utilizing the self-attention (SA) mechanism, which allows modeling long-range dependencies among input features in spatial dimensions. However, the computational complexity of SA increases quadratically with respect to the feature size, which makes Transformer-based methods inefficient. Additionally, despite the success of dense connections in convolutional neural network (CNN)-based methods, they have not been fully explored in Transformer-based methods. In this article, we propose a novel approach for lightweight SR, called densely connected transformer with linear SA (DCTLSA) network. Our method addresses the efficiency issue of SA by designing a new linear SA (LSA), which calculates the similarities in spatial dimension with linear complexity. Moreover, we leverage dense connections to integrate multiple levels of features and provide rich information for SR. Our experimental results demonstrate that DCTLSA outperforms state-of-the-art lightweight SR methods in terms of SR performance, model complexity, and inference speed. The code of the proposed method is available at https://github.com/zengkun301/DCTLSA.

引用

页数：12

共 47 条

[1] Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].

Ahn, Namhyuk ;

Kang, Byungkon ;

Sohn, Kyung-Ah .

COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272

[2] Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].

Bevilacqua, Marco ;

Roumy, Aline ;

Guillemot, Christine ;

Morel, Marie-Line Alberi .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,

[3] Super-resolution musculoskeletal MRI using deep learning [J].

Chaudhari, Akshay S. ;

Fang, Zhongnan ;

Kogan, Feliks ;

Wood, Jeff ;

Stevens, Kathryn J. ;

Gibbons, Eric K. ;

Lee, Jin Hyung ;

Gold, Garry E. ;

Hargreaves, Brian A. .

MAGNETIC RESONANCE IN MEDICINE, 2018, 80 (05) :2139-2154

[4] Pre-Trained Image Processing Transformer [J].

Chen, Hanting ;

Wang, Yunhe ;

Guo, Tianyu ;

Xu, Chang ;

Deng, Yiping ;

Liu, Zhenhua ;

Ma, Siwei ;

Xu, Chunjing ;

Xu, Chao ;

Gao, Wen .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305

[5] Activating More Pixels in Image Super-Resolution Transformer [J].

Chen, Xiangyu ;

Wang, Xintao ;

Zhou, Jiantao ;

Qiao, Yu ;

Dong, Chao .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22367-22377

[6] An Efficient mmW Frequency-Domain Imaging Algorithm for Near-Field Scanning 1-D SIMO/MIMO Array [J].

Chen, Xu ;

Wang, Hongqiang ;

Yang, Qi ;

Zeng, Yang ;

Deng, Bin .

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71

[7] Second-order Attention Network for Single Image Super-Resolution [J].

Dai, Tao ;

Cai, Jianrui ;

Zhang, Yongbing ;

Xia, Shu-Tao ;

Zhang, Lei .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11057-11066

[8]

Danielsson P-E, 1990, Machine Vision for Three-Dimensional Scenes, P347, DOI DOI 10.1016/B978-0-12-266722-0.50016-6

[9] Learning a Deep Convolutional Network for Image Super-Resolution [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199

[10] Interpreting Super-Resolution Networks with Local Attribution Maps [J].

Gu, Jinjin ;

Dong, Chao .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :9195-9204

← 1 2 3 4 5 →