ConvNeXt-ST-AFF: A Novel Skin Disease Classification Model Based on Fusion of ConvNeXt and Swin Transformer

被引：5

作者：

Hao, Shengnan ^{[1
]}

Zhang, Liguo ^{[1
]}

Jiang, Yanyan ^{[2
]}

Wang, Jingkun ^{[3
]}

Ji, Zhanlin ^{[1
,4
]}

Zhao, Li ^{[3
]}

Ganchev, Ivan ^{[4
,5
,6
]}

机构：

[1] North China Univ Sci & Technol, Dept Artificial Intelligence, Tangshan 063009, Peoples R China

[2] Hebei Agr Univ, Coll Urban & Rural Construct, Baoding 071000, Peoples R China

[3] Tsinghua Univ, Inst Precis Med, Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China

[4] Univ Limerick, Telecommun Res Ctr TRC, Limerick V94 T9PX, Ireland

[5] Univ Plovdiv Paisii Hilendarski, Dept Comp Syst, Plovdiv 4000, Bulgaria

[6] Bulgarian Acad Sci, Inst Math & Informat, Sofia 1040, Bulgaria

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Skin disease classification; image denoising; model fusion; attention; ConvNeXt; swin transformer; NEURAL-NETWORK; LOCALIZATION; FRAMEWORK; FEATURES; SYSTEM;

D O I：

10.1109/ACCESS.2023.3324042

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automatic classification of dermatological images is an important technology that assists doctors in performing faster and more accurate classification of skin diseases. Recently, convolutional neural networks (CNNs) and Transformer networks have been employed in learning respectively the local and global features of lesion images. However, existing works mainly focus on utilizing a single neural network for feature extraction, which limits the model classification performance. In order to tackle this problem, a novel fusion model, named ConvNeXt-ST-AFF, is proposed in this paper, by combining the strengths of ConvNeXt and Swin Transformer (ConvNeXt-ST in the model's name). In the proposed model, the pretrained ConvNeXt and Swin Transformer networks extract local and global features from images, which are then fused using Attentional Feature Fusion (AFF) submodules (AFF in the model's name). Additionally, in order to enhance the model's attention on the regions of skin lesions during training, an Efficient Channel Attention (ECA) module is incorporated into the ConvNeXt network. Moreover, the proposed model employs a denoising module to reduce the influence of artifacts and improve the image contrast. The results, obtained by experiments conducted on two datasets, demonstrate that the proposed ConvNeXt-ST-AFF model has higher classification ability, according to multiple evaluation metrics, compared to the original ConvNeXt and Swin Transformer, and other state-of-the-art classification models.

引用

页码：117460 / 117473

页数：14

共 57 条

[1] Malignant skin melanoma detection using image augmentation by oversampling in nonlinear lower-dimensional embedding manifold
Abayomi-Alli, Olusola Oluwakemi
Damasevicius, Robertas
Misra, Sanjay
Maskeliunas, Rytis
Abayomi-Alli, Adebayo
[J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 : 2600 - 2614
[2] Automatic skin tumour border detection for digital dermoscopy using a new digital image analysis scheme
Abbas, Q.
Garcia, I. F.
Rashid, M.
[J]. BRITISH JOURNAL OF BIOMEDICAL SCIENCE, 2010, 67 (04) : 177 - 183
[3] [Anonymous], 2016, arXiv
[4] A two-stream deep neural network-based intelligent system for complex skin cancer types classification
Attique Khan, Muhammad
Sharif, Muhammad
Akram, Tallha
Kadry, Seifedine
Hsu, Ching-Hsien
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10621 - 10649
[5] Multiclass skin lesion classification in dermoscopic images using swin transformer model
Ayas, Selen
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (09) : 6713 - 6722
[6] A multimodal transformer to fuse images and metadata for skin disease classification
Cai, Gan
Zhu, Yu
Wu, Yue
Jiang, Xiaoben
Ye, Jiongyao
Yang, Dawei
[J]. VISUAL COMPUTER, 2023, 39 (07) : 2781 - 2793
[7] Dermoscopy Image Analysis: Overview and Future Directions
Celebi, M. Emre
Codella, Noel
Halpern, Allan
[J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (02) : 474 - 478
[8] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Chen, Jierun
Kao, Shiu-Hong
He, Hao
Zhuo, Weipeng
Wen, Song
Lee, Chul-Ho
Chan, S. -H. Gary
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
[9] Randaugment: Practical automated data augmentation with a reduced search space
Cubuk, Ekin D.
Zoph, Barret
Shlens, Jonathon
Le, Quoc, V
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3008 - 3017
[10] Attentional Feature Fusion
Dai, Yimian
Gieseke, Fabian
Oehmcke, Stefan
Wu, Yiquan
Barnard, Kobus
[J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3559 - 3568

← 1 2 3 4 5 6 →