Permutation invariant self-attention infused U-shaped transformer for medical image segmentation

被引:0
|
作者
Patil, Sanjeet S. [1 ]
Ramteke, Manojkumar [1 ,2 ]
Rathore, Anurag S. [1 ,2 ]
机构
[1] Indian Inst Technol Delhi, Dept Chem Engn, New Delhi 110016, India
[2] Indian Inst Technol Delhi, Yardi Sch Artificial Intelligence, New Delhi 110016, India
关键词
Permutation invariant attention; Transformers; Segmentation; Medical image analysis; Cardiac MRI; Abdominal CT;
D O I
10.1016/j.neucom.2025.129577
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The size and shape of organs in the human body vary according to factors like genetics, body size, proportions, health, lifestyle, gender, ethnicity, and race. Further, abnormalities due to cancer and chronic diseases also affect the size of organs and tumors. Moreover, the spatial location and area of these organs deviates along the transverse plane (Z plane) of the medical scans. Therefore, the generalizability and robustness of a computer vision framework over medical images can be improved if the framework is also encouraged to learn representations of the target areas regardless of their spatial location in input images. Hence, we propose a novel permutation invariant multi-headed self-attention (PISA) module to reduce a U-shaped transformer-based architecture Swin-UNet's sensitivity towards permutation. We have infused this module in the skip connection of our architecture. We have achieved a mean dice score of 79.25 on the segmentations of 8 abdominal organs, better than most state-of-the-art algorithms. Moreover, we have analyzed the generalizability of our architecture over publicly available multi-sequence cardiac MRI datasets. When tested over a sequence unseen by the model during training, 25.1 % and 9.0 % improvement in dice scores were observed in comparison to the pure-CNN- based algorithm and pure transformer-based architecture, respectively, thereby demonstrating its versatility. Replacing the Self Attention module in a U-shaped transformer architecture with our Permutation Invariant Self Attention module produced noteworthy segmentations over shuffled test images, even though the module was trained solely on normal images. The results demonstrate the enhanced efficiency of the proposed module in imparting attention to target organs irrespective of their spatial positions.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Collaborative transformer U-shaped network for medical image segmentation
    Gao, Yufei
    Zhang, Shichao
    Shi, Lei
    Zhao, Guohua
    Shi, Yucheng
    APPLIED SOFT COMPUTING, 2025, 173
  • [2] Optimization of U-shaped pure transformer medical image segmentation network
    Dan, Yongping
    Jin, Weishou
    Wang, Zhida
    Sun, Changhao
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [3] CSAUNet: A cascade self-attention u-shaped network for precise fundus vessel segmentation
    Huang, Zheng
    Sun, Ming
    Liu, Yuxin
    Wu, Jiajun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 75
  • [4] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
    Petit, Olivier
    Thome, Nicolas
    Rambour, Clement
    Themyr, Loic
    Collins, Toby
    Soler, Luc
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276
  • [5] UDT: U-shaped deformable transformer for subarachnoid haemorrhage image segmentation
    Xie, Wei
    Jin, Lianghao
    Hua, Shiqi
    Sun, Hao
    Sun, Bo
    Tu, Zhigang
    Liu, Jun
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (03) : 756 - 768
  • [6] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Wu, Yixuan
    Liao, Kuanlun
    Chen, Jintai
    Wang, Jinhong
    Chen, Danny Z.
    Gao, Honghao
    Wu, Jian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1931 - 1944
  • [7] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    JunBo Qiao
    Xing Wang
    Ji Chen
    MingTao Liu
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 1895 - 1902
  • [8] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Yixuan Wu
    Kuanlun Liao
    Jintai Chen
    Jinhong Wang
    Danny Z. Chen
    Honghao Gao
    Jian Wu
    Neural Computing and Applications, 2023, 35 : 1931 - 1944
  • [9] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    Qiao, JunBo
    Wang, Xing
    Chen, Ji
    Liu, MingTao
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (10) : 1895 - 1902
  • [10] Research of Self-Attention in Image Segmentation
    Cao, Fude
    Zheng, Chunguang
    Huang, Limin
    Wang, Aihua
    Zhang, Jiong
    Zhou, Feng
    Ju, Haoxue
    Guo, Haitao
    Du, Yuxia
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)