FTransCNN: Fusing Transformer and a CNN based on fuzzy logic for uncertain medical image segmentation

被引:54
作者
Ding, Weiping [1 ]
Wang, Haipeng [1 ]
Huang, Jiashuang [1 ]
Ju, Hengrong [1 ]
Geng, Yu [1 ]
Lin, Chin-Teng [2 ]
Pedrycz, Witold [3 ]
机构
[1] Nantong Univ, Sch Comp Sci & Technol, Nantong 226019, Peoples R China
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Sch Comp Sci, Sydney, Australia
[3] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB, Canada
基金
中国博士后科学基金;
关键词
Choquet fuzzy integral; Convolutional neural network (CNN); Fuzzy fusion; Medical image segmentation; Transformer;
D O I
10.1016/j.inffus.2023.101880
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The accurate segmentation of medical images plays a crucial role in diagnosing and treating diseases. Although many methods now use multimodal joint segmentation, the joint use of segmentation features extracted by multiple models can lead to heterogeneity and uncertainty. Unreasonable fusion methods cannot exploit the advantages of multiple models and still lack good performance in segmentation. Therefore, this study proposes the FTransCNN model, which is composed of a convolutional neural network (CNN) and Transformer and is based on a fuzzy fusion strategy that jointly utilizes the features extracted by a CNN and Transformer through a new fuzzy fusion module. First, the CNN and Transformer act as the backbone network for parallel feature extraction. Second, channel attention is used to promote the global key information of Transformer to improve the feature representation ability, and spatial attention is used to enhance the local details of CNN features and suppress irrelevant regions. Third, the proposed model applies the Hadamard product to model fine-grained interactions between the two branches and uses the Choquet fuzzy integral to suppress heterogeneity and uncertainty in fused features. Fourth, FTransCNN employs fuzzy attention fusion module (FAFM) hierarchical upsampling to effectively capture both low-level spatial features and high-level semantic context. Finally, the new model obtains the final segmentation result by using the deconvolution and results in an improvement in segmentation. The experimental results on Chest X-ray and Kvasir-SEG dataset show that FTransCNN has better performance on segmentation tasks than the-state-of-the-art deep segmentation models.
引用
收藏
页数:13
相关论文
共 44 条
[1]   Dataset of breast ultrasound images [J].
Al-Dhabyani, Walid ;
Gomaa, Mohammed ;
Khaled, Hussien ;
Fahmy, Aly .
DATA IN BRIEF, 2020, 28
[2]   A smart healthcare monitoring system for heart disease prediction based on ensemble deep learning and feature fusion [J].
Ali, Farman ;
El-Sappagh, Shaker ;
Islam, S. M. Riazul ;
Kwak, Daehan ;
Ali, Amjad ;
Imran, Muhammad ;
Kwak, Kyung-Sup .
INFORMATION FUSION, 2020, 63 :208-222
[3]   Multimodal feature fusion for CNN-based gait recognition: an empirical comparison [J].
Castro, Francisco M. ;
Marin-Jimenez, Manuel J. ;
Guil, Nicolas ;
de la Blanca, Nicolas .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17) :14173-14193
[4]  
Chen J., 2021, arXiv
[5]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[6]   Attentional Feature Fusion [J].
Dai, Yimian ;
Gieseke, Fabian ;
Oehmcke, Stefan ;
Wu, Yiquan ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :3559-3568
[7]   A Hierarchical Fused Fuzzy Deep Neural Network for Data Classification [J].
Deng, Yue ;
Ren, Zhiquan ;
Kong, Youyong ;
Bao, Feng ;
Dai, Qionghai .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2017, 25 (04) :1006-1012
[8]   MT-nCov-Net: A Multitask Deep-Learning Framework for Efficient Diagnosis of COVID-19 Using Tomography Scans [J].
Ding, Weiping ;
Abdel-Basset, Mohamed ;
Hawash, Hossam ;
Elkomy, Osama M. .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) :1285-1298
[9]   Memory augmented convolutional neural network and its application in bioimages [J].
Ding, Weiping ;
Ming, Yurui ;
Wang, Yu-Kai ;
Lin, Chin-Teng .
NEUROCOMPUTING, 2021, 466 :128-138
[10]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929