Research on Multi-Scale CNN and Transformer-Based Multi-Level Multi-Classification Method for Images

被引:1
|
作者
Gou, Quandeng [1 ]
Ren, Yuheng [2 ,3 ]
机构
[1] Neijiang Normal Univ, Informatizat Construct & Serv Ctr, Neijiang 641000, Peoples R China
[2] Xiamen Kunlu IoT Informat Technol Co Ltd, Xiamen 361021, Fujian, Peoples R China
[3] European Union Univ, Sch Business Econ, CH-1820 Montreux, Switzerland
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Task analysis; Convolution; Image classification; Convolutional neural networks; Vectors; Transformer; hierarchical characteristics of the model; multi-scale convolution; multi-level and multi-classification of images;
D O I
10.1109/ACCESS.2024.3433374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the vigorous development of digital creativity, the image data generated by it has exploded. To effectively manage massive image data, multi-level and multi-classification management of images has become very necessary. However, the existing hierarchical classification models of deep learning images are all based on convolutional neural networks, which have limitations in capturing the underlying global features. Different from this, Transformer, as a new neural network, captures the global context information through the attention mechanism, so it performs excellently in various visual recognition tasks. However, the existing work based on Transformer does not use the hierarchical structure information in the model, making it challenging to apply the model to multi-level and multi-classification tasks of images. Therefore, this paper proposes a new image multi-level and multi-classification model, which uses multi-scale CNN to effectively capture feature information at different scales and combines it with the Transformer's ability to extract global features. At the same time, the model makes full use of the hierarchical structure information in Transformer to better understand the complex relationship of images. We have done a lot of experiments on three data sets, CIFAR-10, CIFAR-100, and CUB-200-2011, and compared the performance with the existing multi-level and multi-classification model of images. The results show that our model has higher classification accuracy and better robustness.
引用
收藏
页码:103049 / 103059
页数:11
相关论文
共 50 条
  • [21] A Multi-Classification Method of Liver Pathology Images Based on Sparse Multi-Scale Local Binary Pattern-Local Directional Pattern
    Liu, H. L.
    Jiang, H. Y.
    Zhang, G. X.
    Wang, Z. G.
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2015, 5 (08) : 1973 - 1976
  • [22] Transformer-based Multi-scale Underwater Image Enhancement Network
    Yang, Ai-Ping
    Fang, Si-Jie
    Shao, Ming-Fu
    Zhang, Teng-Fei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (12): : 1696 - 1705
  • [23] Multi-scale and Multi-level Attention Based on External Knowledge in EHRs
    Le, Duc
    Le, Bac
    RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2024, PT I, 2024, 2144 : 113 - 125
  • [24] MULTI-SCALE TRANSFORMER-BASED FEATURE COMBINATION FOR IMAGE RETRIEVAL
    Roig Mari, Carlos
    Varas Gonzalez, David
    Bou-Balust, Elisenda
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3166 - 3170
  • [25] MUSTER: A Multi-Scale Transformer-Based Decoder for Semantic Segmentation
    Xu, Jing
    Shi, Wentao
    Gao, Pan
    Li, Qizhu
    Wang, Zhengwei
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 202 - 212
  • [26] Classification of liver lesions in CT images based on LivlesioNet, modified Multi-Scale CNN with bridge Scale method
    Gedeon, Kashala Kabe
    Liu, Zhe
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8911 - 8929
  • [27] Crowd counting by using multi-level density-based spatial information: A Multi-scale CNN framework
    Dong, Li
    Zhang, Haijun
    Ji, Yuzhu
    Ding, Yuxin
    INFORMATION SCIENCES, 2020, 528 (528) : 79 - 91
  • [28] Classification of liver lesions in CT images based on LivlesioNet, modified Multi-Scale CNN with bridge Scale method
    Kashala Kabe Gedeon
    Zhe Liu
    Multimedia Tools and Applications, 2024, 83 : 8911 - 8929
  • [29] SPECTRAL-SPATIAL CLASSIFICATION OF HYPERSPECTRAL IMAGES WITH MULTI-LEVEL CNN
    Chhapariya, Koushikey
    Buddhiraju, Krishna Mohan
    Kumar, Anil
    2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
  • [30] The classification of airborne LiDAR building point clouds based on multi-scale and multi-level cloth simulation
    Liu, Rufei
    Wang, Minye
    Hou, Guangqiang
    Wu, Wei
    Zhao, Changwei
    Ge, Qingjie
    PHOTOGRAMMETRIC RECORD, 2023, 38 (182): : 118 - 136