Permutation invariant self-attention infused U-shaped transformer for medical image segmentation

被引：0

作者：

Patil, Sanjeet S. ^{[1
]}

Ramteke, Manojkumar ^{[1
,2
]}

Rathore, Anurag S. ^{[1
,2
]}

机构：

[1] Indian Inst Technol Delhi, Dept Chem Engn, New Delhi 110016, India

[2] Indian Inst Technol Delhi, Yardi Sch Artificial Intelligence, New Delhi 110016, India

来源：

NEUROCOMPUTING | 2025年 / 625卷

关键词：

Permutation invariant attention; Transformers; Segmentation; Medical image analysis; Cardiac MRI; Abdominal CT;

D O I：

10.1016/j.neucom.2025.129577

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The size and shape of organs in the human body vary according to factors like genetics, body size, proportions, health, lifestyle, gender, ethnicity, and race. Further, abnormalities due to cancer and chronic diseases also affect the size of organs and tumors. Moreover, the spatial location and area of these organs deviates along the transverse plane (Z plane) of the medical scans. Therefore, the generalizability and robustness of a computer vision framework over medical images can be improved if the framework is also encouraged to learn representations of the target areas regardless of their spatial location in input images. Hence, we propose a novel permutation invariant multi-headed self-attention (PISA) module to reduce a U-shaped transformer-based architecture Swin-UNet's sensitivity towards permutation. We have infused this module in the skip connection of our architecture. We have achieved a mean dice score of 79.25 on the segmentations of 8 abdominal organs, better than most state-of-the-art algorithms. Moreover, we have analyzed the generalizability of our architecture over publicly available multi-sequence cardiac MRI datasets. When tested over a sequence unseen by the model during training, 25.1 % and 9.0 % improvement in dice scores were observed in comparison to the pure-CNN- based algorithm and pure transformer-based architecture, respectively, thereby demonstrating its versatility. Replacing the Self Attention module in a U-shaped transformer architecture with our Permutation Invariant Self Attention module produced noteworthy segmentations over shuffled test images, even though the module was trained solely on normal images. The results demonstrate the enhanced efficiency of the proposed module in imparting attention to target organs irrespective of their spatial positions.

引用

页数：13

共 47 条

[1] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
Wu, Yixuan
Liao, Kuanlun
Chen, Jintai
Wang, Jinhong
Chen, Danny Z.
Gao, Honghao
Wu, Jian
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02) : 1931 - 1944
[2] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
Yixuan Wu
Kuanlun Liao
Jintai Chen
Jinhong Wang
Danny Z. Chen
Honghao Gao
Jian Wu
Neural Computing and Applications, 2023, 35 : 1931 - 1944
[3] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
Petit, Olivier
Thome, Nicolas
Rambour, Clement
Themyr, Loic
Collins, Toby
Soler, Luc
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276
[4] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
Qiao, JunBo
Wang, Xing
Chen, Ji
Liu, MingTao
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (10) : 1895 - 1902
[5] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
JunBo Qiao
Xing Wang
Ji Chen
MingTao Liu
International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 1895 - 1902
[6] A U-Shaped Convolution-Aided Transformer with Double Attention for Hyperspectral Image Classification
Qin, Ruiru
Wang, Chuanzhi
Wu, Yongmei
Du, Huafei
Lv, Mingyun
REMOTE SENSING, 2024, 16 (02)
[7] RockFormer: A U-Shaped Transformer Network for Martian Rock Segmentation
Liu, Haiqiang
Yao, Meibao
Xiao, Xueming
Xiong, Yonggang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[8] Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation
Yang, Fan
Wang, Bo
APPLIED SOFT COMPUTING, 2024, 167
[9] Weakly supervised histopathology image segmentation with self-attention
Li, Kailu
Qian, Ziniu
Han, Yingnan
Chang, Eric I-Chao
Wei, Bingzheng
Lai, Maode
Liao, Jing
Fan, Yubo
Xu, Yan
MEDICAL IMAGE ANALYSIS, 2023, 86
[10] LUCF-Net: Lightweight U-Shaped Cascade Fusion Network for Medical Image Segmentation
She, Qingshan
Sun, Songkai
Ma, Yuliang
Li, Rihui
Zhang, Yingchun
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (03) : 2088 - 2099

← 1 2 3 4 5 →