Transformer-based heart organ segmentation using a novel axial attention and fusion mechanism

被引:0
作者
Addo, Addae Emmanuel [1 ]
Gedeon, Kashala Kabe [1 ,2 ]
Liu, Zhe [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang, Peoples R China
[2] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang 212013, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Transformers; unet; heart-segmentation; long range dependencies; spatial encoding; positional encoding; axial attention; computed tomography (CT);
D O I
10.1080/13682199.2023.2198394
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Recent research on transformer-based models have highlighted particular methods for medical image segmentation. Additionally, the majority of transformer-based network designs used in computer vision applications have a significant number of parameters and demand extensive training datasets. Inspired by the success of transformers in recent researches, the unet-transformer approach has become one of the de-facto ideas in overcoming the above challenges. In this manuscript, a novel unet-transformer approach was proposed for heart image segmentation to solve parameters, limited dataset, over segmentation, sensitivity noise and higher training time problems. A framework in which a novel width and height wise axial attention mechanism is incorporated into the design to effectively give positional embeddings and encode spatial flattening. Furthermore, a novel local and global spatial attention mechanism is proposed to effectively learn the local and global interactions between encoder features. Finally, we introduce a mechanism to fuse both contexts for better feature representation and preparation into the decoder. The results demonstrate that our prototype provides a robust novel axial-attention mechanism.
引用
收藏
页码:121 / 139
页数:19
相关论文
共 31 条
  • [1] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
  • [2] Carion N., 2020, EUR C COMP VIS, DOI [DOI 10.1007/978-3-030-58452-813, 10.48550/arXiv. 2005.12872, DOI 10.48550/ARXIV.2005.12872]
  • [3] Chen B., 2021, TransAttUnet: multi-level attention-guided U-net with transformer for medical image segmentation
  • [4] Chen J, 2021, VIT V NET VISION TRA
  • [5] Chen J., 2021, CoRR
  • [6] ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data
    Diakogiannis, Foivos, I
    Waldner, Francois
    Caccetta, Peter
    Wu, Chen
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 (162) : 94 - 114
  • [7] Dosovitskiy A., IMAGE WORTH 16 16 WO
  • [8] Monarch butterfly optimization: A comprehensive review
    Feng, Yanhong
    Deb, Suash
    Wang, Gai-Ge
    Alavi, Amir H.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [9] Huiyu Wang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12349), P108, DOI 10.1007/978-3-030-58548-8_7
  • [10] Novel Distant Domain Transfer Learning Method for COVID-19 Classification from X-rays Images
    Kabe, Gedeon Kashala
    Song, Yuqing
    Liu, Zhe
    [J]. 5TH INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS, ICACS 2021, 2021, : 127 - 134