Permutation invariant self-attention infused U-shaped transformer for medical image segmentation

被引:0
作者
Patil, Sanjeet S. [1 ]
Ramteke, Manojkumar [1 ,2 ]
Rathore, Anurag S. [1 ,2 ]
机构
[1] Indian Inst Technol Delhi, Dept Chem Engn, New Delhi 110016, India
[2] Indian Inst Technol Delhi, Yardi Sch Artificial Intelligence, New Delhi 110016, India
关键词
Permutation invariant attention; Transformers; Segmentation; Medical image analysis; Cardiac MRI; Abdominal CT;
D O I
10.1016/j.neucom.2025.129577
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The size and shape of organs in the human body vary according to factors like genetics, body size, proportions, health, lifestyle, gender, ethnicity, and race. Further, abnormalities due to cancer and chronic diseases also affect the size of organs and tumors. Moreover, the spatial location and area of these organs deviates along the transverse plane (Z plane) of the medical scans. Therefore, the generalizability and robustness of a computer vision framework over medical images can be improved if the framework is also encouraged to learn representations of the target areas regardless of their spatial location in input images. Hence, we propose a novel permutation invariant multi-headed self-attention (PISA) module to reduce a U-shaped transformer-based architecture Swin-UNet's sensitivity towards permutation. We have infused this module in the skip connection of our architecture. We have achieved a mean dice score of 79.25 on the segmentations of 8 abdominal organs, better than most state-of-the-art algorithms. Moreover, we have analyzed the generalizability of our architecture over publicly available multi-sequence cardiac MRI datasets. When tested over a sequence unseen by the model during training, 25.1 % and 9.0 % improvement in dice scores were observed in comparison to the pure-CNN- based algorithm and pure transformer-based architecture, respectively, thereby demonstrating its versatility. Replacing the Self Attention module in a U-shaped transformer architecture with our Permutation Invariant Self Attention module produced noteworthy segmentations over shuffled test images, even though the module was trained solely on normal images. The results demonstrate the enhanced efficiency of the proposed module in imparting attention to target organs irrespective of their spatial positions.
引用
收藏
页数:13
相关论文
共 47 条
  • [1] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Wu, Yixuan
    Liao, Kuanlun
    Chen, Jintai
    Wang, Jinhong
    Chen, Danny Z.
    Gao, Honghao
    Wu, Jian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02) : 1931 - 1944
  • [2] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Yixuan Wu
    Kuanlun Liao
    Jintai Chen
    Jinhong Wang
    Danny Z. Chen
    Honghao Gao
    Jian Wu
    Neural Computing and Applications, 2023, 35 : 1931 - 1944
  • [3] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
    Petit, Olivier
    Thome, Nicolas
    Rambour, Clement
    Themyr, Loic
    Collins, Toby
    Soler, Luc
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276
  • [4] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    Qiao, JunBo
    Wang, Xing
    Chen, Ji
    Liu, MingTao
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (10) : 1895 - 1902
  • [5] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    JunBo Qiao
    Xing Wang
    Ji Chen
    MingTao Liu
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 1895 - 1902
  • [6] A U-Shaped Convolution-Aided Transformer with Double Attention for Hyperspectral Image Classification
    Qin, Ruiru
    Wang, Chuanzhi
    Wu, Yongmei
    Du, Huafei
    Lv, Mingyun
    REMOTE SENSING, 2024, 16 (02)
  • [7] RockFormer: A U-Shaped Transformer Network for Martian Rock Segmentation
    Liu, Haiqiang
    Yao, Meibao
    Xiao, Xueming
    Xiong, Yonggang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [8] Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation
    Yang, Fan
    Wang, Bo
    APPLIED SOFT COMPUTING, 2024, 167
  • [9] Weakly supervised histopathology image segmentation with self-attention
    Li, Kailu
    Qian, Ziniu
    Han, Yingnan
    Chang, Eric I-Chao
    Wei, Bingzheng
    Lai, Maode
    Liao, Jing
    Fan, Yubo
    Xu, Yan
    MEDICAL IMAGE ANALYSIS, 2023, 86
  • [10] LUCF-Net: Lightweight U-Shaped Cascade Fusion Network for Medical Image Segmentation
    She, Qingshan
    Sun, Songkai
    Ma, Yuliang
    Li, Rihui
    Zhang, Yingchun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (03) : 2088 - 2099