Permutation invariant self-attention infused U-shaped transformer for medical image segmentation

被引:0
|
作者
Patil, Sanjeet S. [1 ]
Ramteke, Manojkumar [1 ,2 ]
Rathore, Anurag S. [1 ,2 ]
机构
[1] Indian Inst Technol Delhi, Dept Chem Engn, New Delhi 110016, India
[2] Indian Inst Technol Delhi, Yardi Sch Artificial Intelligence, New Delhi 110016, India
关键词
Permutation invariant attention; Transformers; Segmentation; Medical image analysis; Cardiac MRI; Abdominal CT;
D O I
10.1016/j.neucom.2025.129577
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The size and shape of organs in the human body vary according to factors like genetics, body size, proportions, health, lifestyle, gender, ethnicity, and race. Further, abnormalities due to cancer and chronic diseases also affect the size of organs and tumors. Moreover, the spatial location and area of these organs deviates along the transverse plane (Z plane) of the medical scans. Therefore, the generalizability and robustness of a computer vision framework over medical images can be improved if the framework is also encouraged to learn representations of the target areas regardless of their spatial location in input images. Hence, we propose a novel permutation invariant multi-headed self-attention (PISA) module to reduce a U-shaped transformer-based architecture Swin-UNet's sensitivity towards permutation. We have infused this module in the skip connection of our architecture. We have achieved a mean dice score of 79.25 on the segmentations of 8 abdominal organs, better than most state-of-the-art algorithms. Moreover, we have analyzed the generalizability of our architecture over publicly available multi-sequence cardiac MRI datasets. When tested over a sequence unseen by the model during training, 25.1 % and 9.0 % improvement in dice scores were observed in comparison to the pure-CNN- based algorithm and pure transformer-based architecture, respectively, thereby demonstrating its versatility. Replacing the Self Attention module in a U-shaped transformer architecture with our Permutation Invariant Self Attention module produced noteworthy segmentations over shuffled test images, even though the module was trained solely on normal images. The results demonstrate the enhanced efficiency of the proposed module in imparting attention to target organs irrespective of their spatial positions.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] DW-SCA Unet: medical image segmentation based on depth-wise separable convolutional attention U-shaped network
    Zhou, Yi
    Tian, Wei
    Zhang, Yichi
    Wang, Chuzheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8893 - 8910
  • [42] DW-SCA Unet: medical image segmentation based on depth-wise separable convolutional attention U-shaped network
    Yi Zhou
    Wei Tian
    Yichi Zhang
    Chuzheng Wang
    Multimedia Tools and Applications, 2024, 83 : 8893 - 8910
  • [43] Decomformer: Decompose Self-Attention of Transformer for Efficient Image Restoration
    Lee, Eunho
    Hwang, Youngbae
    IEEE ACCESS, 2024, 12 : 38672 - 38684
  • [44] ENHANCING TONGUE REGION SEGMENTATION THROUGH SELF-ATTENTION AND TRANSFORMER BASED
    Song, Yihua
    Li, Can
    Zhang, Xia
    Liu, Zhen
    Song, Ningning
    Zhou, Zuojian
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2024, 24 (02)
  • [45] U-Shaped Transformer With Frequency-Band Aware Attention for Speech Enhancement
    Li, Yi
    Sun, Yang
    Wang, Wenwu
    Naqvi, Syed Mohsen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1511 - 1521
  • [46] A deep supervised transformer U-shaped full-resolution residual network for the segmentation of breast ultrasound image
    Zhou, Jiale
    Hou, Zuoxun
    Lu, Hongyan
    Wang, Wenhan
    Zhao, Wanchen
    Wang, Zenan
    Zheng, Dezhi
    Wang, Shuai
    Tang, Wenzhong
    Qu, Xiaolei
    MEDICAL PHYSICS, 2023, 50 (12) : 7513 - 7524
  • [47] Studying the Effects of Self-Attention for Medical Image Analysis
    Rao, Adrit
    Park, Jongchan
    Woo, Sanghyun
    Lee, Joon-Young
    Aalami, Oliver
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3409 - 3418
  • [48] Skin lesion image segmentation based on improved U-shaped network
    Zhao, Yuhang
    Yan, Tianxing
    Yilihamu, Yaermaimaiti
    INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2024, 8 (03) : 609 - 618
  • [49] ISC-TRANSUNET: MEDICAL IMAGE SEGMENTATION NETWORK BASED ON THE INTEGRATION OF SELF-ATTENTION AND CONVOLUTION
    Li, Fang
    Pei, Siyu
    Zhang, Ziqun
    Yang, Fuming
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2023, 23 (09)
  • [50] MCI Net: Mamba- Convolutional lightweight self-attention medical image segmentation network
    Zhang, Yelin
    Wang, Guanglei
    Ma, Pengchong
    Li, Yan
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (01):