HMSAM-UNet: A Hierarchical Multi-Scale Attention Module-Based Convolutional Neural Network for Improved CT Image Segmentation

被引:1
作者
Liu, Na [1 ]
Lu, Zhonghua [1 ]
Lian, Wenyong [2 ]
Tian, Min [1 ]
Ma, Chiyue [1 ]
Peng, Lijuan [3 ]
机构
[1] Shihezi Univ, Coll Mech & Elect Engn, Shihezi 832000, Peoples R China
[2] Xinjiang Prod & Construct Corps, Gen Hosp Div 3, Tumxuk 659003, Xinjiang, Peoples R China
[3] Shihezi Univ, Coll Sci, Shihezi 832000, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
CT image segmentation; hierarchical attention mechanism; inception module; multi-scale; critical region perception;
D O I
10.1109/ACCESS.2024.3401669
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While modern deep learning methods have made significant progress in medical image segmentation, some challenges remain, including accurately capturing features at multiple scales, limited ability to detect critical regions, and susceptibility to noise and background interference. To address these challenges, a new neural network called HMSAM-UNet is introduced in this work. A novel module, the Hierarchical Multi-Scale Attention Module (HMSAM), was designed in HMSAM-UNet to improve the precision and accuracy of CT image segmentation significantly. Specifically, HMSAM integrates the Hierarchical Attention Mechanism and Inception Module via residual connections. The Hierarchical Attention Mechanism can highlight important regions by learning attention weights, dramatically enhancing the model's ability to perceive critical areas for more accurate localization and segmentation of target structures in CT images. Meanwhile, incorporating the Inception module effectively strengthens the network's capacity to capture multi-scale features, substantially improving the model's ability to comprehend the structural characteristics of CT images. The results show that the average loss achieved by the proposed model has a 50.04% reduction compared to the original U-Net architecture. Furthermore, compared to other deep learning models such as FCN, DeepLabV3, PSPNet, Unet, UNet++, and SegNet, the model proposed in this work attains an average Dice coefficient of 98.72, and an average IoU score of 97.46 on the three datasets, both of which are the highest among all compared models.
引用
收藏
页码:79415 / 79427
页数:13
相关论文
共 37 条
[1]   Literature review: efficient deep neural networks techniques for medical image analysis [J].
Abdou, Mohamed A. .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (08) :5791-5812
[2]   Medical Image Analysis using Convolutional Neural Networks: A Review [J].
Anwar, Syed Muhammad ;
Majid, Muhammad ;
Qayyum, Adnan ;
Awais, Muhammad ;
Alnowami, Majdi ;
Khan, Muhammad Khurram .
JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (11)
[3]  
Chen BZ, 2022, Arxiv, DOI [arXiv:2107.05274, DOI 10.48550/ARXIV.2107.05274, 10.48550/arxiv.2107.05274]
[4]   A novel convolutional neural network for kidney ultrasound images segmentation [J].
Chen, Gongping ;
Yin, Jingjing ;
Dai, Yu ;
Zhang, Jianxun ;
Yin, Xiaotao ;
Cui, Liang .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 218
[5]   Fully convolutional attention network for biomedical image segmentation [J].
Cheng, Junlong ;
Tian, Shengwei ;
Yu, Long ;
Lu, Hongchun ;
Lv, Xiaoyi .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 107
[6]  
Ding F., 2019, arXiv
[7]  
Diwakar M., 2014, P 3 INT C SOFT COMP, V1, P403, DOI DOI 10.1007/978-81-322-1771-835#CITEAS
[8]   A Novel Lightweight Swin-Unet Network for Semantic Segmentation of COVID-19 Lesion in CT Images [J].
Gao, Zhi-Jun ;
He, Yi ;
Li, Yi .
IEEE ACCESS, 2023, 11 :950-962
[9]   Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey [J].
Iqbal, Ahmed ;
Sharif, Muhammad ;
Yasmin, Mussarat ;
Raza, Mudassar ;
Aftab, Shabib .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) :333-368
[10]   Automatic Quantification of Computed Tomography Features in Acute Traumatic Brain Injury [J].
Jain, Saurabh ;
Vande Vyyere, Thijs ;
Terzopoulos, Vasilis ;
Sima, Diana Maria ;
Roura, Eloy ;
Maas, Andrew ;
Wilms, Guido ;
Verheyden, Jan .
JOURNAL OF NEUROTRAUMA, 2019, 36 (11) :1794-1803