MSANet: Multi-scale attention networks for image classification

被引:0
|
作者
Ping Cao
Fangxin Xie
Shichao Zhang
Zuping Zhang
Jianfeng Zhang
机构
[1] Central South University,School of Computer Science and Engineering
[2] Beijing Jiaotong University,School of Computer and Information Technology
[3] National University of Defense Technology,College of Computer Science
来源
关键词
Image classification; Convolutional neural network; Multi-scale feature; Channel attention; Spatial attention;
D O I
暂无
中图分类号
学科分类号
摘要
The classification of images based on the principles of human vision is a major task in the field of computer vision. It is a common method to use multi-scale information and attention mechanism to obtain better classification performance. The methods based on multi-scale can obtain more accurate feature description by fusing different levels of information, and the methods based on attention can make the deep learning models focus on more valuable information in the image. However, the current methods usually treat the acquisition of multi-scale feature maps and the acquisition of attention weights as two separate steps in sequence. Since human eyes usually use these two methods at the same time when observing objects, we propose a multi-scale attention (MSA) module. The proposed MSA module directly extracts the attention information of different scales from a feature map, that is, the multi-scale and attention methods are simultaneously completed in one step. In the MSA module, we obtain different scales of channel and spatial attention by controlling the size of the convolution kernel for cross-channel and cross-space information interaction. Our module can be easily integrated into different convolutional neural networks to form Multi-scale attention networks (MSANet) architectures. We demonstrate the performance of MSANet on CIFAR-10 and CIFAR-100 data sets. In particular, the accuracy of our ResNet-110 based model on CIFAR-10 is 94.39%. Compared with the benchmark convolution model, our proposed multi-scale attention module can bring a roughly 3% increase in accuracy rate on CIFAR-100. Experimental results show that the proposed multi-scale attention module is superior in image classification.
引用
收藏
页码:34325 / 34344
页数:19
相关论文
共 50 条
  • [31] Attention-based Multi-scale Transfer ResNet for Skull Fracture Image Classification
    Ning, Dunbo
    Liu, Gang
    Jiang, Rifeng
    Wang, Chuyi
    FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2019, 11198
  • [32] MSRA-G: Combination of multi-scale residual attention network and generative adversarial networks for hyperspectral image classification
    Zhao, Jinling
    Hu, Lei
    Huang, Linsheng
    Wang, Chuanjian
    Liang, Dong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [33] CONTEXTUAL MULTI-SCALE IMAGE CLASSIFICATION ON QUADTREE
    Hedhli, Ihsen
    Moser, Gabriele
    Serpico, Sebastiano B.
    Zerubia, Josiane
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1349 - 1353
  • [34] IMAGE CLASSIFICATION METHOD WITH MULTI-SCALE FEATURES
    Lu, Peng
    Zou, Peiqi
    Zou, Guoliang
    Zheng, Zongsheng
    Zou, Peiqi
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2019, 20 (06) : 1183 - 1191
  • [35] Combining Multi-Scale Dissimilarities for Image Classification
    Li, Yan
    Duin, Robert P. W.
    Loog, Marco
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1639 - 1642
  • [36] MULTI-SCALE RESIDUAL NETWORK FOR IMAGE CLASSIFICATION
    Zhong, Xian
    Gong, Oubo
    Huang, Wenxin
    Yuan, Jingling
    Ma, Bo
    Li, Ryan Wen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2023 - 2027
  • [37] Multi-Scale Self-Attention for Text Classification
    Guo, Qipeng
    Qiu, Xipeng
    Liu, Pengfei
    Xue, Xiangyang
    Zhang, Zheng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7847 - 7854
  • [38] Multi-Scale Attention Network for Diabetic Retinopathy Classification
    Al-Antary, Mohammad T.
    Arafa, Yasmine
    IEEE ACCESS, 2021, 9 : 54190 - 54200
  • [39] Multi-Scale and Attention based ResNet for Heartbeat Classification
    Zhang, Haojie
    Yang, Gongping
    Huang, Yuwen
    Yuan, Feng
    Yin, Yilong
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1529 - 1535
  • [40] Multi-scale convolution networks for seismic event classification with windowed self-attention
    Huang, Yongming
    Xie, Yi
    Liu, Wei
    Ma, Yongsheng
    Miao, Fajun
    Zhang, Guobao
    JOURNAL OF SEISMOLOGY, 2025, 29 (01) : 257 - 268