HT-Net: hierarchical context-attention transformer network for medical ct image segmentation

被引:24
作者
Ma, Mingjun [1 ]
Xia, Haiying [1 ]
Tan, Yumei [2 ]
Li, Haisheng [1 ]
Song, Shuxiang [1 ]
机构
[1] Guangxi Normal Univ, Coll Elect Engn, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; Medical image segmentation; Context-attention;
D O I
10.1007/s10489-021-03010-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) have been a prevailing technique in the field of medical CT image processing. Although encoder-decoder CNNs exploit locality for efficiency, they cannot adequately model remote pixel relationships. Recent works prove it possible to stack self-attention or transformer layers to effectively learn long-range dependencies. Transformers have been extended to computer vision tasks by creating and treating image patches as embeddings. However, transformer-based architectures lack global semantic information interaction and require large-scale dataset for training, making it difficult to effectively train with limited data samples. To address these issues, we propose a hierarchical context-attention transformer network (HT-Net), which integrates the multi-scale, transformer and hierarchical context extraction modules in skip-connections. The multi-scale module captures richer CT semantic information, enabling transformers to better encode feature maps of tokenized image patches from different stages of CNN as input attention sequences.The hierarchical context attention module complements global information and re-weights the pixels to capture semantic context. Extensive experiments on three datasets demonstrate that the proposed HT-Net outperforms state-of-the-art approaches.
引用
收藏
页码:10692 / 10705
页数:14
相关论文
共 50 条
  • [1] HT-Net: hierarchical context-attention transformer network for medical ct image segmentation
    Mingjun Ma
    Haiying Xia
    Yumei Tan
    Haisheng Li
    Shuxiang Song
    Applied Intelligence, 2022, 52 : 10692 - 10705
  • [2] MC-Net: multi-scale context-attention network for medical CT image segmentation
    Xia, Haiying
    Ma, Mingjun
    Li, Haisheng
    Song, Shuxiang
    APPLIED INTELLIGENCE, 2022, 52 (02) : 1508 - 1519
  • [3] MC-Net: multi-scale context-attention network for medical CT image segmentation
    Haiying Xia
    Mingjun Ma
    Haisheng Li
    Shuxiang Song
    Applied Intelligence, 2022, 52 : 1508 - 1519
  • [4] HT-Net: A Hybrid Transformer Network for Fundus Vessel Segmentation
    Hu, Xiaolong
    Wang, Liejun
    Li, Yongming
    SENSORS, 2022, 22 (18)
  • [5] DSGA-Net: Deeply separable gated transformer and attention strategy for medical image segmentation network
    Sun, Junding
    Zhao, Jiuqiang
    Wu, Xiaosheng
    Tang, Chaosheng
    Wang, Shuihua
    Zhang, Yudong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (05)
  • [6] Hierarchical volumetric transformer with comprehensive attention for medical image segmentation
    Zhang, Zhuang
    Luo, Wenjie
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (02) : 3177 - 3190
  • [7] CT-Net: Asymmetric compound branch Transformer for medical image segmentation
    Zhang, Ning
    Yu, Long
    Zhang, Dezhi
    Wu, Weidong
    Tian, Shengwei
    Kang, Xiaojing
    Li, Min
    NEURAL NETWORKS, 2024, 170 : 298 - 311
  • [8] A context hierarchical integrated network for medical image segmentation?
    Xie, Xiwang
    Pan, Xipeng
    Zhang, Weidong
    An, Jubai
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [9] LET-Net: locally enhanced transformer network for medical image segmentation
    Ta, Na
    Chen, Haipeng
    Liu, Xianzhu
    Jin, Nuo
    MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3847 - 3861
  • [10] LET-Net: locally enhanced transformer network for medical image segmentation
    Na Ta
    Haipeng Chen
    Xianzhu Liu
    Nuo Jin
    Multimedia Systems, 2023, 29 (6) : 3847 - 3861