Multi-scale feature fusion distilled attention network for efficient image super-resolution

被引:0
作者
Tang, Yinggan [1 ,2 ,3 ]
Su, Mengjie [1 ]
Zhang, Xiuli [4 ]
机构
[1] Yanshan Univ, Sch Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China
[2] Yanshan Univ, Key Lab Intelligent Rehabil & Neromodulat Hebei Pr, Qinhuangdao 066004, Hebei, Peoples R China
[3] Yanshan Univ, Key Lab Intelligent Control & Neural Informat Proc, Minist Educ, Qinhuangdao 066004, Hebei, Peoples R China
[4] Sch Hebei Vocat & Tech Coll Bldg Mat, Qinhuangdao 066004, Peoples R China
关键词
Efficient super-resolution; Information distillation; Multi-scale feature;
D O I
10.1016/j.asoc.2025.113382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient single image super-resolution (SISR) seeks to enhance reconstruction quality while minimizing computational complexity and model parameters for practical deployment on resource-constrained edge devices. A critical limitation of existing approaches lies in the restricted receptive fields of conventional convolutions, which hinder attention mechanisms and backbone networks from effectively capturing non-local features, ultimately degrading super-resolution performance. To solve this challenge, we propose a novel Multi-scale Feature Fusion Distilled Attention Network (MFFDAN) that synergizes multi-scale feature extraction with lightweight attention mechanisms. Our framework introduces three key innovations. First, we proposed a Multi-scale Convolution (MConv) module that dynamically extracts hierarchical features with multi-scale receptive fields while maintaining parameter efficiency, subsequently extended into a Multi-scale Feature Fusion Distillation Block (MFFDB). Second, we proposed a Distilled Spatial Attention (DSA) mechanism that enhances spatial feature interactions through multi-scale downsampling and distillation-based information refinement. Third, a lightweight Depth-wise Channel Attention (DCA) module optimized via depth-wise convolutions for efficient channel feature selection is proposed. These components are systematically integrated through residual connections to form the Multi-scale Feature Fusion Distilled Attention Block (MFFDAB), the core building block of our architecture. Extensive experiments demonstrate that MFFDAN achieves state-of-theart performance while maintaining superior computational efficiency, achieving an optimal balance between reconstruction quality (PSNR/SSIM), model complexity (parameters/Multi-Adds), and inference speed across benchmark datasets. The proposed innovations provide a practical solution for deploying high-quality SISR models on edge computing platforms.
引用
收藏
页数:11
相关论文
共 56 条
[1]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[2]   Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].
Ahn, Namhyuk ;
Kang, Byungkon ;
Sohn, Kyung-Ah .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272
[3]   Single image super-resolution based on directional variance attention network [J].
Behjati, Parichehr ;
Rodriguez, Pau ;
Fernandez, Carles ;
Hupont, Isabelle ;
Mehri, Armin ;
Gonzalez, Jordi .
PATTERN RECOGNITION, 2023, 133
[4]   Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].
Bevilacqua, Marco ;
Roumy, Aline ;
Guillemot, Christine ;
Morel, Marie-Line Alberi .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[5]   Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks [J].
Chen, Jierun ;
Kao, Shiu-Hong ;
He, Hao ;
Zhuo, Weipeng ;
Wen, Song ;
Lee, Chul-Ho ;
Chan, S. -H. Gary .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12021-12031
[6]   Rethinking Coarse-to-Fine Approach in Single Image Deblurring [J].
Cho, Sung-Jin ;
Ji, Seo-Won ;
Hong, Jun-Pyo ;
Jung, Seung-Won ;
Ko, Sung-Jea .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :4621-4630
[7]  
Deng Weijian, 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P1712, DOI 10.1109/CVPRW59228.2023.00172
[8]   RepVGG: Making VGG-style ConvNets Great Again [J].
Ding, Xiaohan ;
Zhang, Xiangyu ;
Ma, Ningning ;
Han, Jungong ;
Ding, Guiguang ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13728-13737
[9]   Accelerating the Super-Resolution Convolutional Neural Network [J].
Dong, Chao ;
Loy, Chen Change ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :391-407
[10]   Learning a Deep Convolutional Network for Image Super-Resolution [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199