HT-Net: hierarchical context-attention transformer network for medical ct image segmentation

被引:30
作者
Ma, Mingjun [1 ]
Xia, Haiying [1 ]
Tan, Yumei [2 ]
Li, Haisheng [1 ]
Song, Shuxiang [1 ]
机构
[1] Guangxi Normal Univ, Coll Elect Engn, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; Medical image segmentation; Context-attention;
D O I
10.1007/s10489-021-03010-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) have been a prevailing technique in the field of medical CT image processing. Although encoder-decoder CNNs exploit locality for efficiency, they cannot adequately model remote pixel relationships. Recent works prove it possible to stack self-attention or transformer layers to effectively learn long-range dependencies. Transformers have been extended to computer vision tasks by creating and treating image patches as embeddings. However, transformer-based architectures lack global semantic information interaction and require large-scale dataset for training, making it difficult to effectively train with limited data samples. To address these issues, we propose a hierarchical context-attention transformer network (HT-Net), which integrates the multi-scale, transformer and hierarchical context extraction modules in skip-connections. The multi-scale module captures richer CT semantic information, enabling transformers to better encode feature maps of tokenized image patches from different stages of CNN as input attention sequences.The hierarchical context attention module complements global information and re-weights the pixels to capture semantic context. Extensive experiments on three datasets demonstrate that the proposed HT-Net outperforms state-of-the-art approaches.
引用
收藏
页码:10692 / 10705
页数:14
相关论文
共 50 条
[31]   Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation [J].
Rahman, Md Mostafijur ;
Marculescu, Radu .
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 :1526-1544
[32]   An effective CNN and Transformer complementary network for medical image segmentation [J].
Yuan, Feiniu ;
Zhang, Zhengxiao ;
Fang, Zhijun .
PATTERN RECOGNITION, 2023, 136
[33]   MCI Net: Mamba- Convolutional lightweight self-attention medical image segmentation network [J].
Zhang, Yelin ;
Wang, Guanglei ;
Ma, Pengchong ;
Li, Yan .
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (01)
[34]   DA-TransUNet: integrating spatial and channel dual attention with transformer U-net for medical image segmentation [J].
Sun, Guanqun ;
Pan, Yizhi ;
Kong, Weikun ;
Xu, Zichang ;
Ma, Jianhua ;
Racharak, Teeradaj ;
Nguyen, Le-Minh ;
Xin, Junyi .
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2024, 12
[35]   Efficient hierarchical multiscale convolutional attention for accurate medical image segmentation [J].
Wang, Bing ;
Wei, Zhihong ;
Ju, Mengyi ;
Zhao, Zutong ;
Zhang, Shiyin .
VISUAL COMPUTER, 2025,
[36]   Medical Transformer: Gated Axial-Attention for Medical Image Segmentation [J].
Valanarasu, Jeya Maria Jose ;
Oza, Poojan ;
Hacihaliloglu, Ilker ;
Patel, Vishal M. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 :36-46
[37]   DGFAU-Net: Global feature attention upsampling network for medical image segmentation [J].
Dunlu Peng ;
Xi Yu ;
Wenjia Peng ;
Jianping Lu .
Neural Computing and Applications, 2021, 33 :12023-12037
[38]   DGFAU-Net: Global feature attention upsampling network for medical image segmentation [J].
Peng, Dunlu ;
Yu, Xi ;
Peng, Wenjia ;
Lu, Jianping .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (18) :12023-12037
[39]   DPCTN: Dual path context-aware transformer network for medical image segmentation [J].
Song, Pengfei ;
Yang, Zhe ;
Li, Jinjiang ;
Fan, Hui .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
[40]   MIXED TRANSFORMER U-NET FOR MEDICAL IMAGE SEGMENTATION [J].
Wang, Hongyi ;
Xie, Shiao ;
Lin, Lanfen ;
Iwamoto, Yutaro ;
Han, Xian-Hua ;
Chen, Yen-Wei ;
Tong, Ruofeng .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :2390-2394