Enhancing skin lesion segmentation with a fusion of convolutional neural networks and transformer models

被引:3
作者
Xu, Zhijian [1 ]
Guo, Xingyue [2 ]
Wang, Juan [2 ]
机构
[1] China West Normal Univ, Sch Elect Informat Engn, 1 Shida Rd, Nanchong 637009, Sichuan, Peoples R China
[2] China West Normal Univ, Sch Comp Sci, 1 Shida Rd, Nanchong 637009, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Convolutional neural networks; Transformer; Skin lesion segmentation; Feature fusion; Dual branch encoder;
D O I
10.1016/j.heliyon.2024.e31395
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate segmentation is crucial in diagnosing and analyzing skin lesions. However, automatic segmentation of skin lesions is extremely challenging because of their variable sizes, uneven color distributions, irregular shapes, hair occlusions, and blurred boundaries. Owing to the limited range of convolutional networks receptive fields, shallow convolution cannot extract the global features of images and thus has limited segmentation performance. Because medical image datasets are small in scale, the use of excessively deep networks could cause overfitting and increase computational complexity. Although transformer networks can focus on extracting global information, they cannot extract sufficient local information and accurately segment detailed lesion features. In this study, we designed a dual -branch encoder that combines a convolution neural network (CNN) and a transformer. The CNN branch of the encoder comprises four layers, which learn the local features of images through layer -wise downsampling. The transformer branch also comprises four layers, enabling the learning of global image information through attention mechanisms. The feature fusion module in the network integrates local features and global information, emphasizes important channel features through the channel attention mechanism, and filters irrelevant feature expressions. The information exchange between the decoder and encoder is finally achieved through skip connections to supplement the information lost during the sampling process, thereby enhancing segmentation accuracy. The data used in this paper are from four public datasets, including images of melanoma, basal cell tumor, fibroma, and benign nevus. Because of the limited size of the image data, we enhanced them using methods such as random horizontal flipping, random vertical flipping, random brightness enhancement, random contrast enhancement, and rotation. The segmentation accuracy is evaluated through intersection over union and duration, integrity, commitment, and effort indicators, reaching 87.7 % and 93.21 %, 82.05 % and 89.19 %, 86.81 % and 92.72 %, and 92.79 % and 96.21 %, respectively, on the ISIC 2016, ISIC 2017, ISIC 2018, and PH2 datasets, respectively (code: https://github.com/hyjane/CCT-Net).
引用
收藏
页数:10
相关论文
共 40 条
[1]   A new approach for texture segmentation based on the Gray Level Co-occurrence Matrix [J].
Aouat, Saliha ;
Ait-hammi, Idir ;
Hamouchene, Izem .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) :24027-24052
[2]  
Baygin M., 2022, arXiv
[3]  
Codella N, 2019, Arxiv, DOI arXiv:1902.03368
[4]  
Codella NCF, 2018, I S BIOMED IMAGING, P168, DOI 10.1109/ISBI.2018.8363547
[5]   Transfer learning for segmentation with hybrid classification to Detect Melanoma Skin Cancer [J].
Dandu, Ravi ;
Murthy, M. Vinayaka ;
Kumar, Y. B. Ravi .
HELIYON, 2023, 9 (04)
[6]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[7]   A wavelet transform-based image segmentation method [J].
Gao, Jianqiang ;
Wang, Binbin ;
Wang, Ziyi ;
Wang, Yufeng ;
Kong, Fanzhi .
OPTIK, 2020, 208
[8]   Feature fusion and non-negative matrix factorization based active contours for texture segmentation [J].
Gao, Mingqi ;
Chen, Hengxin ;
Zheng, Shenhai ;
Fang, Bin .
SIGNAL PROCESSING, 2019, 159 :104-118
[9]   Pseudo-Zernike Moments Based Sparse Representations for SAR Image Classification [J].
Gishkori, Shahzad ;
Mulgrew, Bernard .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2019, 55 (02) :1037-1044
[10]   A new computational approach for edge-preserving image decomposition [J].
Gupta, Bhupendra ;
Singh, Anuj Kumar .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (15) :19527-19546