Research on Multi-Scale CNN and Transformer-Based Multi-Level Multi-Classification Method for Images

被引:1
|
作者
Gou, Quandeng [1 ]
Ren, Yuheng [2 ,3 ]
机构
[1] Neijiang Normal Univ, Informatizat Construct & Serv Ctr, Neijiang 641000, Peoples R China
[2] Xiamen Kunlu IoT Informat Technol Co Ltd, Xiamen 361021, Fujian, Peoples R China
[3] European Union Univ, Sch Business Econ, CH-1820 Montreux, Switzerland
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Task analysis; Convolution; Image classification; Convolutional neural networks; Vectors; Transformer; hierarchical characteristics of the model; multi-scale convolution; multi-level and multi-classification of images;
D O I
10.1109/ACCESS.2024.3433374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the vigorous development of digital creativity, the image data generated by it has exploded. To effectively manage massive image data, multi-level and multi-classification management of images has become very necessary. However, the existing hierarchical classification models of deep learning images are all based on convolutional neural networks, which have limitations in capturing the underlying global features. Different from this, Transformer, as a new neural network, captures the global context information through the attention mechanism, so it performs excellently in various visual recognition tasks. However, the existing work based on Transformer does not use the hierarchical structure information in the model, making it challenging to apply the model to multi-level and multi-classification tasks of images. Therefore, this paper proposes a new image multi-level and multi-classification model, which uses multi-scale CNN to effectively capture feature information at different scales and combines it with the Transformer's ability to extract global features. At the same time, the model makes full use of the hierarchical structure information in Transformer to better understand the complex relationship of images. We have done a lot of experiments on three data sets, CIFAR-10, CIFAR-100, and CUB-200-2011, and compared the performance with the existing multi-level and multi-classification model of images. The results show that our model has higher classification accuracy and better robustness.
引用
收藏
页码:103049 / 103059
页数:11
相关论文
共 50 条
  • [31] Local climate zone classification using a multi-scale, multi-level attention network
    Kim, Minho
    Jeong, Doyoung
    Kim, Yongil
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 181 (181) : 345 - 366
  • [32] MLMS-Net: A Point Cloud Classification Network with Multi-Level and Multi-Scale
    Xue D.
    Cheng Y.
    Wen P.
    Yu W.
    Qin X.
    Cheng, Yinglei, 1600, Xi'an Jiaotong University (54): : 70 - 78
  • [33] Multi-Classification of Skin Lesion Images Including Mpox Disease Using Transformer-Based Deep Learning Architectures
    Vuran, Seyfettin
    Ucan, Murat
    Akin, Mehmet
    Kaya, Mehmet
    DIAGNOSTICS, 2025, 15 (03)
  • [34] Transformer-Based Multi-Scale Data-Driven Wellbore Risk Prediction Method
    Zhang, Hongyuan
    Liu, Yupei
    Zhang, Xingquan
    Yin, Zhiming
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 53 - 58
  • [35] MSTD: A Multi-Scale Transformer-Based Method to Diagnose Benign and Malignant Lung Nodules
    Zhao, Xiaoyu
    Li, Jiao
    Qi, Man
    Chen, Xuxin
    Chen, Wei
    Li, Yongqun
    Liu, Qi
    Tang, Jiajia
    Han, Zhihai
    Zhang, Chunyang
    IEEE ACCESS, 2025, 13 : 16182 - 16195
  • [36] TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation
    Wang, Ruotong
    Shen, Yanqing
    Zuo, Weiliang
    Zhou, Sanping
    Zheng, Nanning
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13638 - 13647
  • [37] Research on Rain Removal Method for Single Image Based on Multi-channel and Multi-scale CNN
    Liu C.
    Wang Q.
    Bi X.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2020, 42 (09): : 2285 - 2292
  • [38] Research on Rain Removal Method for Single Image Based on Multi-channel and Multi-scale CNN
    Liu Changyuan
    Wang Qi
    Bi Xiaojun
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (09) : 2285 - 2292
  • [39] Surgical instrument segmentation based on multi-scale and multi-level feature network
    Wang, Yiming
    Qiu, Zhongxi
    Hu, Yan
    Chen, Hao
    Ye, Fangfu
    Liu, Jiang
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2672 - 2675
  • [40] Multi-task sentiment classification model based on DistilBert and multi-scale CNN
    Xiong, Guanghao
    Yan, Ke
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 700 - 707