A lightweight multi-scale multi-angle dynamic interactive transformer-CNN fusion model for 3D medical image segmentation

被引:1
作者
Hua, Xin [1 ]
Du, Zhijiang [1 ]
Yu, Hongjian [1 ]
Ma, Jixin [1 ]
Zheng, Fanjun [2 ]
Zhang, Chen [2 ]
Lu, Qiaohui [2 ]
Zhao, Hui [2 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot Technol & Syst, Harbin 15000, Heilongjiang, Peoples R China
[2] Chinese Peoples Liberat Army PLA Gen Hosp, Beijing 100853, Peoples R China
关键词
3D Medical image segmentation; Light-weight; Convolutional Neural Network; Transformer;
D O I
10.1016/j.neucom.2024.128417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combining Convolutional Neural Network(CNN) and Transformer has become one of the mainstream methods for three-dimensional (3D) medical image segmentation. However, the complexity and diversity of target forms in 3D medical images require models to capture complex feature information for segmentation, resulting in an excessive number of parameters which are not conducive to training and deployment. Therefore, we have developed a lightweight 3D multi-target semantic segmentation model. In order to enhance contextual texture connections and reinforce the expression of detailed feature information, we designed a multi-scale and multiangle feature interaction module to enhance feature representation by interacting multi-scale features from different perspectives. To address the issue of attention collapse in Transformers, leading to the neglect of other detailed feature learning, we utilized local features as dynamic parameters to interact with global features, dynamically grouping and learning critical features from global features, thereby enhancing the model's ability to learn detailed features. While ensuring the segmentation capability of the model, we aimed to keep the model lightweight, resulting in a total of 9.63 M parameters. Extensive experiments were conducted on public datasets ACDC and Brats2018, as well as a private dataset, Temporal Bone CT. The results indicate that our proposed model is more competitive compared to the latest techniques in 3D medical image segmentation.
引用
收藏
页数:14
相关论文
共 41 条
  • [1] Dual Cross-Attention for medical image segmentation
    Ates, Gorkem Can
    Mohan, Prasoon
    Celik, Emrah
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [2] DAE-Former: Dual Attention-Guided Efficient Transformer for Medical Image Segmentation
    Azad, Reza
    Arimond, Rene
    Aghdam, Ehsan Khodapanah
    Kazerouni, Amirhossein
    Merhof, Dorit
    [J]. PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2023, 2023, 14277 : 83 - 95
  • [3] Bakas S, 2019, Arxiv, DOI [arXiv:1811.02629, DOI 10.48550/ARXIV.1811.02629]
  • [4] Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved?
    Bernard, Olivier
    Lalande, Alain
    Zotti, Clement
    Cervenansky, Frederick
    Yang, Xin
    Heng, Pheng-Ann
    Cetin, Irem
    Lekadir, Karim
    Camara, Oscar
    Gonzalez Ballester, Miguel Angel
    Sanroma, Gerard
    Napel, Sandy
    Petersen, Steffen
    Tziritas, Georgios
    Grinias, Elias
    Khened, Mahendra
    Kollerathu, Varghese Alex
    Krishnamurthi, Ganapathy
    Rohe, Marc-Michel
    Pennec, Xavier
    Sermesant, Maxime
    Isensee, Fabian
    Jaeger, Paul
    Maier-Hein, Klaus H.
    Full, Peter M.
    Wolf, Ivo
    Engelhardt, Sandy
    Baumgartner, Christian F.
    Koch, Lisa M.
    Wolterink, Jelmer M.
    Isgum, Ivana
    Jang, Yeonggul
    Hong, Yoonmi
    Patravali, Jay
    Jain, Shubham
    Humbert, Olivier
    Jodoin, Pierre-Marc
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) : 2514 - 2525
  • [5] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
  • [6] An improved 3D KiU-Net for segmentation of liver tumor
    Chen, Guodong
    Li, Zheng
    Wang, Jian
    Wang, Jun
    Du, Shisuo
    Zhou, Jinghao
    Shi, Jun
    Zhou, Yongkang
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 160
  • [7] VoxResNet: Deep voxelwise residual networks for brain segmentation from 3D MR images
    Chen, Hao
    Dou, Qi
    Yu, Lequan
    Qin, Jing
    Heng, Pheng-Ann
    [J]. NEUROIMAGE, 2018, 170 : 446 - 455
  • [8] Chen J., 2021, arXiv, DOI [DOI 10.48550/ARXIV.2102.04306, 10.48550/arXiv.2102.04306]
  • [9] Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Qinlan
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [10] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929