Transformer-based heart organ segmentation using a novel axial attention and fusion mechanism

被引:0
作者
Addo, Addae Emmanuel [1 ]
Gedeon, Kashala Kabe [1 ,2 ]
Liu, Zhe [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang, Peoples R China
[2] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang 212013, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Transformers; unet; heart-segmentation; long range dependencies; spatial encoding; positional encoding; axial attention; computed tomography (CT);
D O I
10.1080/13682199.2023.2198394
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Recent research on transformer-based models have highlighted particular methods for medical image segmentation. Additionally, the majority of transformer-based network designs used in computer vision applications have a significant number of parameters and demand extensive training datasets. Inspired by the success of transformers in recent researches, the unet-transformer approach has become one of the de-facto ideas in overcoming the above challenges. In this manuscript, a novel unet-transformer approach was proposed for heart image segmentation to solve parameters, limited dataset, over segmentation, sensitivity noise and higher training time problems. A framework in which a novel width and height wise axial attention mechanism is incorporated into the design to effectively give positional embeddings and encode spatial flattening. Furthermore, a novel local and global spatial attention mechanism is proposed to effectively learn the local and global interactions between encoder features. Finally, we introduce a mechanism to fuse both contexts for better feature representation and preparation into the decoder. The results demonstrate that our prototype provides a robust novel axial-attention mechanism.
引用
收藏
页码:121 / 139
页数:19
相关论文
共 31 条
  • [11] FireNet-MLstm for classifying liver lesions by using deep features in CT images
    Kabe, Gedeon Kashala
    Song, Yuqing
    Liu, Zhe
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (02) : 1607 - 1623
  • [12] Optimization of FireNet for Liver Lesion Classification
    Kashala Kabe, Gedeon
    Song, Yuqing
    Liu, Zhe
    [J]. ELECTRONICS, 2020, 9 (08) : 1 - 16
  • [13] Slime mould algorithm: A new method for stochastic optimization
    Li, Shimin
    Chen, Huiling
    Wang, Mingjing
    Heidari, Ali Asghar
    Mirjalili, Seyedali
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 111 : 300 - 323
  • [14] Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
  • [15] Luo HZ, 2022, PR MACH LEARN RES, V172, P808
  • [16] Luo W., 2017, Understanding the effective receptive field in deep convolutional neural networks, P4898
  • [17] Oktay Ozan, 2018, ARXIV, DOI DOI 10.48550/ARXIV.1804.03999
  • [18] A Novel Fuzzy Parameterized Fuzzy Hypersoft Set and Riesz Summability Approach Based Decision Support System for Diagnosis of Heart Diseases
    Rahman, Atiqe Ur
    Saeed, Muhammad
    Mohammed, Mazin Abed
    Jaber, Mustafa Musa
    Garcia-Zapirain, Begonya
    [J]. DIAGNOSTICS, 2022, 12 (07)
  • [19] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [20] Medical Transformer: Gated Axial-Attention for Medical Image Segmentation
    Valanarasu, Jeya Maria Jose
    Oza, Poojan
    Hacihaliloglu, Ilker
    Patel, Vishal M.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 36 - 46