Transformer-based heart organ segmentation using a novel axial attention and fusion mechanism

被引：0

作者：

Addo, Addae Emmanuel ^{[1
]}

Gedeon, Kashala Kabe ^{[1
,2
]}

Liu, Zhe ^{[1
]}

机构：

[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang, Peoples R China

[2] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang 212013, Peoples R China

来源：

IMAGING SCIENCE JOURNAL | 2024年 / 72卷 / 01期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Transformers; unet; heart-segmentation; long range dependencies; spatial encoding; positional encoding; axial attention; computed tomography (CT);

D O I：

10.1080/13682199.2023.2198394

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Recent research on transformer-based models have highlighted particular methods for medical image segmentation. Additionally, the majority of transformer-based network designs used in computer vision applications have a significant number of parameters and demand extensive training datasets. Inspired by the success of transformers in recent researches, the unet-transformer approach has become one of the de-facto ideas in overcoming the above challenges. In this manuscript, a novel unet-transformer approach was proposed for heart image segmentation to solve parameters, limited dataset, over segmentation, sensitivity noise and higher training time problems. A framework in which a novel width and height wise axial attention mechanism is incorporated into the design to effectively give positional embeddings and encode spatial flattening. Furthermore, a novel local and global spatial attention mechanism is proposed to effectively learn the local and global interactions between encoder features. Finally, we introduce a mechanism to fuse both contexts for better feature representation and preparation into the decoder. The results demonstrate that our prototype provides a robust novel axial-attention mechanism.

引用

页码：121 / 139

页数：19

共 31 条

[1] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[2] Carion N., 2020, EUR C COMP VIS, DOI [DOI 10.1007/978-3-030-58452-813, 10.48550/arXiv. 2005.12872, DOI 10.48550/ARXIV.2005.12872]
[3] Chen B., 2021, TransAttUnet: multi-level attention-guided U-net with transformer for medical image segmentation
[4] Chen J, 2021, VIT V NET VISION TRA
[5] Chen J., 2021, CoRR
[6] ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data
Diakogiannis, Foivos, I
Waldner, Francois
Caccetta, Peter
Wu, Chen
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 (162) : 94 - 114
[7] Dosovitskiy A., IMAGE WORTH 16 16 WO
[8] Monarch butterfly optimization: A comprehensive review
Feng, Yanhong
Deb, Suash
Wang, Gai-Ge
Alavi, Amir H.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
[9] Huiyu Wang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12349), P108, DOI 10.1007/978-3-030-58548-8_7
[10] Novel Distant Domain Transfer Learning Method for COVID-19 Classification from X-rays Images
Kabe, Gedeon Kashala
Song, Yuqing
Liu, Zhe
[J]. 5TH INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS, ICACS 2021, 2021, : 127 - 134

← 1 2 3 4 →