MSDUNet: A Model Based on Feature Multi-Scale and Dual-Input Dynamic Enhancement for Skin Lesion Segmentation

被引：0

作者：

Li, Xiaosen ^{[1
,2
]}

Li, Linli ^{[3
]}

Xing, Xinlong ^{[4
,5
]}

Liao, Huixian ^{[1
]}

Wang, Wenji ^{[1
]}

Dong, Qiutong ^{[4
,5
]}

Qin, Xiao ^{[1
]}

Yuan, Chang'an ^{[6
]}

机构：

[1] Nanning Normal Univ, Guangxi Key Lab Human Machine Interact & Intellige, Nanning 530100, Peoples R China

[2] Guangxi Minzu Univ, Sch Artificial Intelligence, Nanning 530006, Peoples R China

[3] Chengdu Inst Biol Prod Co Ltd, Chengdu 610000, Peoples R China

[4] Wenzhou Med Univ, Postgrad Training Base Alliance, Wenzhou 325000, Peoples R China

[5] Univ Chinese Acad Sci, Wenzhou Inst, Wenzhou 325000, Peoples R China

[6] Guangxi Acad Sci, Nanning 530100, Peoples R China

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2025年 / 44卷 / 07期

关键词：

Image segmentation; Transformers; Decoding; Convolution; Lesions; Feature extraction; Skin; Semantics; Kernel; Adaptation models; Hybrid architecture; skin lesion segmentation; multi-scale method; deformable convolution; feature enhancement; NETWORKS;

D O I：

10.1109/TMI.2025.3549011

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Melanoma is a malignant tumor originating from the lesions of skin cells. Medical image segmentation tasks for skin lesion play a crucial role in quantitative analysis. Achieving precise and efficient segmentation remains a significant challenge for medical practitioners. Hence, a skin lesion segmentation model named MSDUNet, which incorporates multi-scale deformable block (MSD Block) and dual-input dynamic enhancement module(D2M), is proposed. Firstly, the model employs a hybrid architecture encoder that better integrates global and local features. Secondly, to better utilize macroscopic and microscopic multiscale information, improvements are made to skip connection and decoder block, introducing D2M and MSD Block. The D2M leverages large kernel dilated convolution to draw out attention bias matrix on the decoder features, supplementing and enhancing the semantic features of the decoder's lower layers transmitted through skip connection features, thereby compensating semantic gaps. The MSD Block uses channel-wise split and deformable convolutions with varying receptive fields to better extract and integrate multi-scale information while controlling the model's size, enabling the decoder to focus more on task-relevant regions and edge details. MSDUNet attains outstanding performance with Dice scores of 93.08% and 91.68% on the ISIC-2016 and ISIC-2018 datasets, respectively. Furthermore, experiments on the HAM10000 dataset demonstrate its superior performance with a Dice score of 95.40%. External validation experiments based on the ISIC-2016, ISIC-2018, and HAM10000 experimental weights on the PH2 dataset yield Dice scores of 92.67%, 92.31%, and 93.46%, respectively, showcasing the exceptional generalization capability of MSDUNet. Our code implementation is publicly available at the Github.

引用

页码：2819 / 2830

页数：12

共 49 条

[1]

Azad R., 2022, P INT WORKSH PRED IN, P83

[2] Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation [J].

Azad, Reza ;

Niggemeier, Leon ;

Huettemann, Michael ;

Kazerouni, Amirhossein ;

Aghdam, Ehsan Khodapanah ;

Velichko, Yury ;

Bagci, Ulas ;

Merhof, Dorit .

2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, :1276-1286

[3]

Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9

[4]

Chen J., 2021, PREPRINT

[5]

Chen LC, 2017, Arxiv, DOI arXiv:1706.05587

[6] TBUnet: A Pure Convolutional U-Net Capable of Multifaceted Feature Extraction for Medical Image Segmentation [J].

Chen, LiFang ;

Li, Jiawei ;

Ge, Hongze .

JOURNAL OF MEDICAL SYSTEMS, 2023, 47 (01)

[7]

Codella N, 2019, Arxiv, DOI arXiv:1902.03368

[8] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[9]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[10]

Duy Khang Nguyen, 2020, 2020 5th International Conference on Green Technology and Sustainable Development (GTSD), P366, DOI 10.1109/GTSD50082.2020.9303084

← 1 2 3 4 5 →