Permutation invariant self-attention infused U-shaped transformer for medical image segmentation

被引:0
作者
Patil, Sanjeet S. [1 ]
Ramteke, Manojkumar [1 ,2 ]
Rathore, Anurag S. [1 ,2 ]
机构
[1] Indian Inst Technol Delhi, Dept Chem Engn, New Delhi 110016, India
[2] Indian Inst Technol Delhi, Yardi Sch Artificial Intelligence, New Delhi 110016, India
关键词
Permutation invariant attention; Transformers; Segmentation; Medical image analysis; Cardiac MRI; Abdominal CT;
D O I
10.1016/j.neucom.2025.129577
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The size and shape of organs in the human body vary according to factors like genetics, body size, proportions, health, lifestyle, gender, ethnicity, and race. Further, abnormalities due to cancer and chronic diseases also affect the size of organs and tumors. Moreover, the spatial location and area of these organs deviates along the transverse plane (Z plane) of the medical scans. Therefore, the generalizability and robustness of a computer vision framework over medical images can be improved if the framework is also encouraged to learn representations of the target areas regardless of their spatial location in input images. Hence, we propose a novel permutation invariant multi-headed self-attention (PISA) module to reduce a U-shaped transformer-based architecture Swin-UNet's sensitivity towards permutation. We have infused this module in the skip connection of our architecture. We have achieved a mean dice score of 79.25 on the segmentations of 8 abdominal organs, better than most state-of-the-art algorithms. Moreover, we have analyzed the generalizability of our architecture over publicly available multi-sequence cardiac MRI datasets. When tested over a sequence unseen by the model during training, 25.1 % and 9.0 % improvement in dice scores were observed in comparison to the pure-CNN- based algorithm and pure transformer-based architecture, respectively, thereby demonstrating its versatility. Replacing the Self Attention module in a U-shaped transformer architecture with our Permutation Invariant Self Attention module produced noteworthy segmentations over shuffled test images, even though the module was trained solely on normal images. The results demonstrate the enhanced efficiency of the proposed module in imparting attention to target organs irrespective of their spatial positions.
引用
收藏
页数:13
相关论文
共 47 条
  • [41] MS-Former: Multi-Scale Self-Guided Transformer for Medical Image Segmentation
    Karimijafarbigloo, Sanaz
    Azad, Reza
    Kazerouni, Amirhossein
    Merhof, Dorit
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 680 - 694
  • [42] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
    Amer, Alyaa
    Lambrou, Tryphon
    Ye, Xujiong
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [43] DUDA-Net: a double U-shaped dilated attention network for automatic infection area segmentation in COVID-19 lung CT images
    Xie, Feng
    Huang, Zheng
    Shi, Zhengjin
    Wang, Tianyu
    Song, Guoli
    Wang, Bolun
    Liu, Zihong
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (09) : 1425 - 1434
  • [44] Automatic measurement of fetal abdomen subcutaneous soft tissue thickness from ultrasound image based on a U-shaped attention network with morphological method
    Yuan, Zhenming
    Xu, Tianhao
    Yu, Cheng
    Ye, Xiaojun
    Zhang, Jian
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)
  • [45] DUDA-Net: a double U-shaped dilated attention network for automatic infection area segmentation in COVID-19 lung CT images
    Feng Xie
    Zheng Huang
    Zhengjin Shi
    Tianyu Wang
    Guoli Song
    Bolun Wang
    Zihong Liu
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 1425 - 1434
  • [46] AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net
    Khan, Akib Mohammed
    Ashrafee, Alif
    Khan, Fahim Shahriar
    Hasan, Md. Bakhtiar
    Kabir, Md. Hasanul
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [47] A multilevel self-attention based segmentation and classification technique using Directional Hexagonal Mixed Pattern algorithm for lung nodule detection in thoracic CT image
    Jeniba, J. Sahaya
    Milton, A.
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (05) : 1496 - 1510