MSANet: Multi-scale attention networks for image classification

被引:0
|
作者
Ping Cao
Fangxin Xie
Shichao Zhang
Zuping Zhang
Jianfeng Zhang
机构
[1] Central South University,School of Computer Science and Engineering
[2] Beijing Jiaotong University,School of Computer and Information Technology
[3] National University of Defense Technology,College of Computer Science
来源
关键词
Image classification; Convolutional neural network; Multi-scale feature; Channel attention; Spatial attention;
D O I
暂无
中图分类号
学科分类号
摘要
The classification of images based on the principles of human vision is a major task in the field of computer vision. It is a common method to use multi-scale information and attention mechanism to obtain better classification performance. The methods based on multi-scale can obtain more accurate feature description by fusing different levels of information, and the methods based on attention can make the deep learning models focus on more valuable information in the image. However, the current methods usually treat the acquisition of multi-scale feature maps and the acquisition of attention weights as two separate steps in sequence. Since human eyes usually use these two methods at the same time when observing objects, we propose a multi-scale attention (MSA) module. The proposed MSA module directly extracts the attention information of different scales from a feature map, that is, the multi-scale and attention methods are simultaneously completed in one step. In the MSA module, we obtain different scales of channel and spatial attention by controlling the size of the convolution kernel for cross-channel and cross-space information interaction. Our module can be easily integrated into different convolutional neural networks to form Multi-scale attention networks (MSANet) architectures. We demonstrate the performance of MSANet on CIFAR-10 and CIFAR-100 data sets. In particular, the accuracy of our ResNet-110 based model on CIFAR-10 is 94.39%. Compared with the benchmark convolution model, our proposed multi-scale attention module can bring a roughly 3% increase in accuracy rate on CIFAR-100. Experimental results show that the proposed multi-scale attention module is superior in image classification.
引用
收藏
页码:34325 / 34344
页数:19
相关论文
共 50 条
  • [21] MSANet: Multi scale prohibited item detection with tripe attention
    Wu, Zhengping
    Zhu, Peng
    Lei, Bangjun
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML 2024, 2024, : 27 - 31
  • [22] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    Neurocomputing, 2022, 491 : 14 - 23
  • [23] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    NEUROCOMPUTING, 2022, 491 : 14 - 23
  • [24] Multi-Scale Spatial-Spectral Residual Attention Network for Hyperspectral Image Classification
    Wu, Qinggang
    He, Mengkun
    Liu, Zhongchi
    Liu, Yanyan
    ELECTRONICS, 2024, 13 (02)
  • [25] Multi-scale high and low feature fusion attention network for intestinal image classification
    Li, Sheng
    Zhu, Beibei
    Guo, Xinran
    Ye, Shufang
    Ye, Jietong
    Zhuang, Yongwei
    He, Xiongxiong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 2877 - 2886
  • [26] Enhancing Medical Image Classification With Context Modulated Attention and Multi-Scale Feature Fusion
    Zhang, Renhan
    Luo, Xuegang
    Lv, Junrui
    Cao, Junyang
    Zhu, Yangping
    Wang, Juan
    Zheng, Bochuan
    IEEE ACCESS, 2025, 13 : 15226 - 15243
  • [27] Research on image classification based on residual group multi-scale enhanced attention network
    Wang, Chunzhi
    Deng, Xizhi
    Sun, Yun
    Yan, Lingyu
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118
  • [28] Multi-scale high and low feature fusion attention network for intestinal image classification
    Sheng Li
    Beibei Zhu
    Xinran Guo
    Shufang Ye
    Jietong Ye
    Yongwei Zhuang
    Xiongxiong He
    Signal, Image and Video Processing, 2023, 17 : 2877 - 2886
  • [29] Multi-scale receptive fields: Graph attention neural network for hyperspectral image classification
    Ding, Yao
    Zhang, Zhili
    Zhao, Xiaofeng
    Hong, Danfeng
    Cai, Wei
    Yang, Nengjun
    Wang, Bei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
  • [30] Dual attention guided multi-scale CNN for fine-grained image classification
    Liu, Xiaozhang
    Zhang, Lifeng
    Li, Tao
    Wang, Dejian
    Wang, Zhaojie
    INFORMATION SCIENCES, 2021, 573 : 37 - 45