Permutation invariant self-attention infused U-shaped transformer for medical image segmentation

被引：0

作者：

Patil, Sanjeet S. ^{[1
]}

Ramteke, Manojkumar ^{[1
,2
]}

Rathore, Anurag S. ^{[1
,2
]}

机构：

[1] Indian Inst Technol Delhi, Dept Chem Engn, New Delhi 110016, India

[2] Indian Inst Technol Delhi, Yardi Sch Artificial Intelligence, New Delhi 110016, India

来源：

NEUROCOMPUTING | 2025年 / 625卷

关键词：

Permutation invariant attention; Transformers; Segmentation; Medical image analysis; Cardiac MRI; Abdominal CT;

D O I：

10.1016/j.neucom.2025.129577

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The size and shape of organs in the human body vary according to factors like genetics, body size, proportions, health, lifestyle, gender, ethnicity, and race. Further, abnormalities due to cancer and chronic diseases also affect the size of organs and tumors. Moreover, the spatial location and area of these organs deviates along the transverse plane (Z plane) of the medical scans. Therefore, the generalizability and robustness of a computer vision framework over medical images can be improved if the framework is also encouraged to learn representations of the target areas regardless of their spatial location in input images. Hence, we propose a novel permutation invariant multi-headed self-attention (PISA) module to reduce a U-shaped transformer-based architecture Swin-UNet's sensitivity towards permutation. We have infused this module in the skip connection of our architecture. We have achieved a mean dice score of 79.25 on the segmentations of 8 abdominal organs, better than most state-of-the-art algorithms. Moreover, we have analyzed the generalizability of our architecture over publicly available multi-sequence cardiac MRI datasets. When tested over a sequence unseen by the model during training, 25.1 % and 9.0 % improvement in dice scores were observed in comparison to the pure-CNN- based algorithm and pure transformer-based architecture, respectively, thereby demonstrating its versatility. Replacing the Self Attention module in a U-shaped transformer architecture with our Permutation Invariant Self Attention module produced noteworthy segmentations over shuffled test images, even though the module was trained solely on normal images. The results demonstrate the enhanced efficiency of the proposed module in imparting attention to target organs irrespective of their spatial positions.

引用

页数：13

共 47 条

[41] MS-Former: Multi-Scale Self-Guided Transformer for Medical Image Segmentation
Karimijafarbigloo, Sanaz
Azad, Reza
Kazerouni, Amirhossein
Merhof, Dorit
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 680 - 694
[42] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
Amer, Alyaa
Lambrou, Tryphon
Ye, Xujiong
APPLIED SCIENCES-BASEL, 2022, 12 (07):
[43] DUDA-Net: a double U-shaped dilated attention network for automatic infection area segmentation in COVID-19 lung CT images
Xie, Feng
Huang, Zheng
Shi, Zhengjin
Wang, Tianyu
Song, Guoli
Wang, Bolun
Liu, Zihong
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (09) : 1425 - 1434
[44] Automatic measurement of fetal abdomen subcutaneous soft tissue thickness from ultrasound image based on a U-shaped attention network with morphological method
Yuan, Zhenming
Xu, Tianhao
Yu, Cheng
Ye, Xiaojun
Zhang, Jian
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)
[45] DUDA-Net: a double U-shaped dilated attention network for automatic infection area segmentation in COVID-19 lung CT images
Feng Xie
Zheng Huang
Zhengjin Shi
Tianyu Wang
Guoli Song
Bolun Wang
Zihong Liu
International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 1425 - 1434
[46] AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net
Khan, Akib Mohammed
Ashrafee, Alif
Khan, Fahim Shahriar
Hasan, Md. Bakhtiar
Kabir, Md. Hasanul
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[47] A multilevel self-attention based segmentation and classification technique using Directional Hexagonal Mixed Pattern algorithm for lung nodule detection in thoracic CT image
Jeniba, J. Sahaya
Milton, A.
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (05) : 1496 - 1510

← 1 2 3 4 5 →