UT-MT: A Semi-Supervised Model of Fusion Transformer for 3D Medical Image Segmentation

被引:1
作者
Liu, Xianchang [1 ]
Liu, Peishun [1 ]
Wang, Jinyu [1 ]
Wang, Qinshuo [1 ]
Guo, Qing [1 ]
Tang, Ruichun [1 ]
机构
[1] Ocean Univ China, Fac Informat Sci & Engn, Qingdao, Peoples R China
来源
2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA | 2023年
关键词
semi-supervised learning; uncertainty estimation; transformer; 3D medical image segmentation;
D O I
10.1109/ICCCBDA56900.2023.10154641
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The training of 3D medical image segmentation model requires a large amount of labeled data, but the availability of labeled data is difficult, and the scarcity of labeled data makes the prediction quality of unlabeled data cannot be effectively guaranteed. To solve the above problems, the 3D medical image segmentation model UT-MT proposed in this paper combines ViT and CNN, and consists of a student model and a teacher model, and the student model learns the teacher model by minimizing the segmentation loss and consistency loss. By combining the feature learning advantages of CNN and ViT, this method enables the model to enhance the learning ability of taking into account both local and global aspects in feature extraction, and further improves the model accuracy and performance. The evaluation was performed on a public left atrial benchmark dataset, and the results at 10% of the labeled images show that the proposed method improves the Dice coefficient by 6.69% over the fully supervised method and has better segmentation of the boundaries, while our method outperforms five advanced semi-supervised segmentation methods.
引用
收藏
页码:190 / 196
页数:7
相关论文
共 33 条
[1]  
Cai ZW, 2022, Arxiv, DOI arXiv:2208.05688
[2]  
Cao H., 2021, arXiv, DOI DOI 10.48550/ARXIV.2105.05537
[3]  
Chen J., 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306
[4]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[5]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[6]   UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation [J].
Gao, Yunhe ;
Zhou, Mu ;
Metaxas, Dimitris N. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 :61-71
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Rethinking Spatial Dimensions of Vision Transformers [J].
Heo, Byeongho ;
Yun, Sangdoo ;
Han, Dongyoon ;
Chun, Sanghyuk ;
Choe, Junsuk ;
Oh, Seong Joon .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :11916-11925
[9]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[10]  
Huang HM, 2020, INT CONF ACOUST SPEE, P1055, DOI [10.1109/icassp40776.2020.9053405, 10.1109/ICASSP40776.2020.9053405]