CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation

被引：21

作者：

Chen, Yuanbin ^{[1
,2
]}

Wang, Tao ^{[1
,2
]}

Tang, Hui ^{[1
,2
]}

Zhao, Longxuan ^{[1
,2
]}

Zhang, Xinlin ^{[1
,2
]}

Tan, Tao ^{[3
]}

Gao, Qinquan ^{[1
,2
]}

Du, Min ^{[1
,2
]}

Tong, Tong ^{[1
,2
]}

机构：

[1] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China

[2] Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou 350116, Peoples R China

[3] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China

来源：

PHYSICS IN MEDICINE AND BIOLOGY | 2023年 / 68卷 / 17期

基金：

中国国家自然科学基金;

关键词：

medical image segmentation; convolutional neural network; transformer; SKIN-LESION SEGMENTATION; NET; NETWORK;

D O I：

10.1088/1361-6560/acede8

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Medical image segmentation is a crucial and intricate process in medical image processing and analysis. With the advancements in artificial intelligence, deep learning techniques have been widely used in recent years for medical image segmentation. One such technique is the U-Net framework based on the U-shaped convolutional neural networks (CNN) and its variants. However, these methods have limitations in simultaneously capturing both the global and the remote semantic information due to the restricted receptive domain caused by the convolution operation's intrinsic features. Transformers are attention-based models with excellent global modeling capabilities, but their ability to acquire local information is limited. To address this, we propose a network that combines the strengths of bothCNNand Transformer, called CoTrFuse. The proposed CoTrFuse network uses EfficientNet and Swin Transformer as dual encoders. The Swin Transformer andCNN Fusion module are combined to fuse the features of both branches before the skip connection structure. Weevaluated the proposed network on two datasets: the ISIC-2017 challenge dataset and the COVID-QU-Ex dataset. Our experimental results demonstrate that the proposed CoTrFuse outperforms several state-of-the-art segmentation methods, indicating its superiority in medical image segmentation. The codes are available at https://github.com/BinYCn/CoTrFuse.

引用

页数：13

共 56 条

[1] Semantic Segmenation of Pathological Lung Tissue With Dilated Fully Convolutional Networks [J].

Anthimopoulos, Marios ;

Christodoulidis, Stergios ;

Ebner, Lukas ;

Geiser, Thomas ;

Christe, Andreas ;

Mougiakakou, Stavroula .

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (02) :714-722

[2]

Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9

[3] PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation [J].

Chen, Danny ;

Yang, Wenzhong ;

Wang, Liejun ;

Tan, Sixiang ;

Lin, Jiangzhaung ;

Bu, Wenxiu .

PLOS ONE, 2022, 17 (01)

[4]

Chen J, 2021, arXiv

[5]

Chen LC, 2017, Arxiv, DOI arXiv:1706.05587

[6]

Chen SH, 2019, Arxiv, DOI arXiv:1904.00625

[7] Can AI Help in Screening Viral and COVID-19 Pneumonia? [J].

Chowdhury, Muhammad E. H. ;

Rahman, Tawsifur ;

Khandakar, Amith ;

Mazhar, Rashid ;

Kadir, Muhammad Abdul ;

Bin Mahbub, Zaid ;

Islam, Khandakar Reajul ;

Khan, Muhammad Salman ;

Iqbal, Atif ;

Al Emadi, Nasser ;

Reaz, Mamun Bin Ibne ;

Islam, Mohammad Tariqul .

IEEE ACCESS, 2020, 8 :132665-132676

[8]

Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49

[9]

Codella NCF, 2018, I S BIOMED IMAGING, P168, DOI 10.1109/ISBI.2018.8363547

[10] Machine Learning and Deep Learning in Medical Imaging: Intelligent Imaging [J].

Currie, Geoff ;

Hawk, K. Elizabeth ;

Rohren, Eric ;

Vial, Alanna ;

Klein, Ran .

JOURNAL OF MEDICAL IMAGING AND RADIATION SCIENCES, 2019, 50 (04) :477-487

← 1 2 3 4 5 6 →