ConvNeXt-ST-AFF: A Novel Skin Disease Classification Model Based on Fusion of ConvNeXt and Swin Transformer

被引:5
作者
Hao, Shengnan [1 ]
Zhang, Liguo [1 ]
Jiang, Yanyan [2 ]
Wang, Jingkun [3 ]
Ji, Zhanlin [1 ,4 ]
Zhao, Li [3 ]
Ganchev, Ivan [4 ,5 ,6 ]
机构
[1] North China Univ Sci & Technol, Dept Artificial Intelligence, Tangshan 063009, Peoples R China
[2] Hebei Agr Univ, Coll Urban & Rural Construct, Baoding 071000, Peoples R China
[3] Tsinghua Univ, Inst Precis Med, Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
[4] Univ Limerick, Telecommun Res Ctr TRC, Limerick V94 T9PX, Ireland
[5] Univ Plovdiv Paisii Hilendarski, Dept Comp Syst, Plovdiv 4000, Bulgaria
[6] Bulgarian Acad Sci, Inst Math & Informat, Sofia 1040, Bulgaria
关键词
Skin disease classification; image denoising; model fusion; attention; ConvNeXt; swin transformer; NEURAL-NETWORK; LOCALIZATION; FRAMEWORK; FEATURES; SYSTEM;
D O I
10.1109/ACCESS.2023.3324042
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic classification of dermatological images is an important technology that assists doctors in performing faster and more accurate classification of skin diseases. Recently, convolutional neural networks (CNNs) and Transformer networks have been employed in learning respectively the local and global features of lesion images. However, existing works mainly focus on utilizing a single neural network for feature extraction, which limits the model classification performance. In order to tackle this problem, a novel fusion model, named ConvNeXt-ST-AFF, is proposed in this paper, by combining the strengths of ConvNeXt and Swin Transformer (ConvNeXt-ST in the model's name). In the proposed model, the pretrained ConvNeXt and Swin Transformer networks extract local and global features from images, which are then fused using Attentional Feature Fusion (AFF) submodules (AFF in the model's name). Additionally, in order to enhance the model's attention on the regions of skin lesions during training, an Efficient Channel Attention (ECA) module is incorporated into the ConvNeXt network. Moreover, the proposed model employs a denoising module to reduce the influence of artifacts and improve the image contrast. The results, obtained by experiments conducted on two datasets, demonstrate that the proposed ConvNeXt-ST-AFF model has higher classification ability, according to multiple evaluation metrics, compared to the original ConvNeXt and Swin Transformer, and other state-of-the-art classification models.
引用
收藏
页码:117460 / 117473
页数:14
相关论文
共 57 条
  • [1] Malignant skin melanoma detection using image augmentation by oversampling in nonlinear lower-dimensional embedding manifold
    Abayomi-Alli, Olusola Oluwakemi
    Damasevicius, Robertas
    Misra, Sanjay
    Maskeliunas, Rytis
    Abayomi-Alli, Adebayo
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 : 2600 - 2614
  • [2] Automatic skin tumour border detection for digital dermoscopy using a new digital image analysis scheme
    Abbas, Q.
    Garcia, I. F.
    Rashid, M.
    [J]. BRITISH JOURNAL OF BIOMEDICAL SCIENCE, 2010, 67 (04) : 177 - 183
  • [3] [Anonymous], 2016, arXiv
  • [4] A two-stream deep neural network-based intelligent system for complex skin cancer types classification
    Attique Khan, Muhammad
    Sharif, Muhammad
    Akram, Tallha
    Kadry, Seifedine
    Hsu, Ching-Hsien
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10621 - 10649
  • [5] Multiclass skin lesion classification in dermoscopic images using swin transformer model
    Ayas, Selen
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (09) : 6713 - 6722
  • [6] A multimodal transformer to fuse images and metadata for skin disease classification
    Cai, Gan
    Zhu, Yu
    Wu, Yue
    Jiang, Xiaoben
    Ye, Jiongyao
    Yang, Dawei
    [J]. VISUAL COMPUTER, 2023, 39 (07) : 2781 - 2793
  • [7] Dermoscopy Image Analysis: Overview and Future Directions
    Celebi, M. Emre
    Codella, Noel
    Halpern, Allan
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (02) : 474 - 478
  • [8] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
    Chen, Jierun
    Kao, Shiu-Hong
    He, Hao
    Zhuo, Weipeng
    Wen, Song
    Lee, Chul-Ho
    Chan, S. -H. Gary
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
  • [9] Randaugment: Practical automated data augmentation with a reduced search space
    Cubuk, Ekin D.
    Zoph, Barret
    Shlens, Jonathon
    Le, Quoc, V
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3008 - 3017
  • [10] Attentional Feature Fusion
    Dai, Yimian
    Gieseke, Fabian
    Oehmcke, Stefan
    Wu, Yiquan
    Barnard, Kobus
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3559 - 3568