MDHT-Net: Multi-scale Deformable U-Net with Cos-spatial and Channel Hybrid Transformer for pancreas segmentation

被引:0
作者
Wang, HuiFang [1 ]
Yang, DaWei [2 ,3 ]
Zhu, Yu [1 ]
Liu, YaTong [1 ]
Lin, JiaJun [1 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
[2] Fudan Univ, Zhongshan Hosp, Dept Pulm & Crit Care Med, Shanghai 200032, Peoples R China
[3] Shanghai Engn Res Ctr Internet Things Resp Med, Shanghai, Peoples R China
关键词
Pancreas segmentation; Deformable convolution; Transformer; Multi-scale information; CONVOLUTIONAL NEURAL-NETWORKS; ATTENTION; LOCALIZATION;
D O I
10.1007/s10489-024-05780-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate pancreas segmentation is essential for the diagnosis of pancreas disease, while it is still challenging due to the variable structure and small size of the pancreas. In this paper, we propose a Multi-scale Deformable U-Net with Cos-spatial and Channel Hybrid Transformer (MDHT-Net) for pancreas segmentation. To mitigate the ambiguity between the codec stages, the Cos-spatial and Channel Hybrid Transformer (CCHT) module is designed as a novel skip connection, enhancing the network's ability to perceive spatial information and reveal the inter-channel relationships within different layers' features. Furthermore, the CCHT efficiently aggregates multi-stage contextual information by improving the self-attention mechanism in two different manners, overcoming the limitation of computational complexity. In addition, to comprehensively understand deep semantic information, the Multi-scale Feature Adaptive-extraction (MFA) module is proposed to dynamically enhance the network's receptive field by integrating the pancreas characteristics of scale variations. The experimental results present that our proposed MDHT-Net achieves superior performance compared to other existing state-of-the-art methods on two public pancreas datasets, with the mean Dice coefficient of 91.07 +/- 1.19\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$91.07\pm 1.19$$\end{document}% for NIH and 91.52 +/- 0.66\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$91.52\pm 0.66$$\end{document}% for MSD, respectively. Given the effectiveness and advantages of our proposed MDHT-Net, it is expected to be a potential tool to assist clinicians in detecting pancreas disease and making reasonable treatment plans.
引用
收藏
页码:12272 / 12292
页数:21
相关论文
共 52 条
  • [1] Convolutional Neural Network for Head Segmentation and Counting in Crowded Retail Environment Using Top-view Depth Images
    Abed, Almustafa
    Akrout, Belhassen
    Amous, Ikram
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (03) : 3735 - 3749
  • [2] Alom M.Z., 2018, arXiv
  • [3] Deep Frequency Re-calibration U-Net for Medical Image Segmentation
    Azad, Reza
    Bozorgpour, Afshin
    Asadi-Aghbolaghi, Maryam
    Merhof, Dorit
    Escalera, Sergio
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3267 - 3276
  • [4] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
  • [5] Pancreas segmentation by two-view feature learning and multi-scale supervision
    Chen, Haipeng
    Liu, Yunjie
    Shi, Zenan
    Lyu, Yingda
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 74
  • [6] Chen Jieneng, 2021, arXiv
  • [7] CTUNet: automatic pancreas segmentation using a channel-wise transformer and 3D U-Net
    Chen, Lifang
    Wan, Li
    [J]. VISUAL COMPUTER, 2023, 39 (11) : 5229 - 5243
  • [8] Target-aware U-Net with fuzzy skip connections for refined pancreas segmentation
    Chen, Yufei
    Xu, Chang
    Ding, Weiping
    Sun, Shichen
    Yue, Xiaodong
    Fujita, Hamido
    [J]. APPLIED SOFT COMPUTING, 2022, 131
  • [9] TD-Net: Trans-Deformer network for automatic pancreas segmentation
    Dai, Shunbo
    Zhu, Yu
    Jiang, Xiaoben
    Yu, Fuli
    Lin, Jiajun
    Yang, Dawei
    [J]. NEUROCOMPUTING, 2023, 517 : 279 - 293
  • [10] A Two-Phase Approach using Mask R-CNN and 3D U-Net for High-Accuracy Automatic Segmentation of Pancreas in CT Imaging
    Dogan, Ramazan Ozgur
    Dogan, Hulya
    Bayrak, Coskun
    Kayikcioglu, Temel
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 207