Advances in attention mechanisms for medical image segmentation

被引:0
作者
Zhang, Jianpeng [1 ]
Chen, Xiaomin [4 ]
Yang, Bing [2 ]
Guan, Qingbiao [2 ]
Chen, Qi [3 ]
Chen, Jian [4 ]
Wu, Qi [3 ]
Xie, Yutong [3 ]
Xia, Yong [2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci & Engn, Natl Engn Lab Integrated Aero Space Ground Ocean B, Xian, Peoples R China
[3] Univ Adelaide, Adelaide, Australia
[4] South China Univ Technol, Guangzhou, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Medical image segmentation; Attention mechanism; Transformer; Mamba; Deep learning; CONVOLUTIONAL NEURAL-NETWORK; SWIN TRANSFORMER; MULTIORGAN; FIELD;
D O I
10.1016/j.cosrev.2024.100721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical image segmentation plays an important role in computer-aided diagnosis. Attention mechanisms that distinguish important parts from irrelevant parts have been widely used in medical image segmentation tasks. This paper systematically reviews the basic principles of attention mechanisms and their applications in medical image segmentation. First, we review the basic concepts of attention mechanism and formulation. Second, we surveyed about 200 articles related to medical image segmentation, and divided them into three groups based on their attention mechanisms, Pre-Transformer attention, Transformer attention and Mamba-related attention. In each group, we deeply analyze the attention mechanisms from three aspects based on the current literature work, i.e., the principle of the mechanism (what to use), implementation methods (how to use), and application tasks (where to use). We also thoroughly analyzed the advantages and limitations of their applications to different tasks. Finally, we summarize the current state of research and shortcomings in the field, and discuss the potential challenges in the future, including task specificity, robustness, standard evaluation, etc. We hope that this review can showcase the overall research context of traditional, Transformer and Mamba attention methods, provide a clear reference for subsequent research, and inspire more advanced attention research, not only in medical image segmentation, but also in other image analysis scenarios. Finally, we maintain the paper list and open-source code at here.
引用
收藏
页数:18
相关论文
共 178 条
  • [1] Xu K., Ba J., Kiros R., Cho K., Courville A., Salakhudinov R., Zemel R., Bengio Y., Show, attend and tell: Neural image caption generation with visual attention, International Conference on Machine Learning, pp. 2048-2057, (2015)
  • [2] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., Kaiser L., Polosukhin I., Attention is all you need, Adv. Neural Inf. Process. Syst., 30, (2017)
  • [3] Hu J., Shen L., Sun G., Squeeze-and-excitation networks, pp. 7132-7141, (2018)
  • [4] Lin F., Xia Y., Song S., Ravikumar N., Frangi A.F., High-throughput 3Dra segmentation of brain vasculature and aneurysms using deep learning, Comput. Methods Programs Biomed., 230, (2023)
  • [5] Wang G., Shapey J., Li W., Dorent R., Dimitriadis A., Bisdas S., Paddick I., Bradford R., Zhang S., Ourselin S., Et al., Automatic segmentation of vestibular schwannoma from T2-weighted MRI by deep spatial attention with hardness-weighted loss, Medical Image Computing and Computer Assisted Intervention–MICCAI 2019: 22nd International Conference, Shenzhen, China, October 13–17, 2019, Proceedings, Part II 22, pp. 264-272, (2019)
  • [6] Dou H., Karimi D., Rollins C.K., Ortinau C.M., Vasung L., Velasco-Annis C., Ouaalam A., Yang X., Ni D., Gholipour A., A deep attentive convolutional neural network for automatic cortical plate segmentation in fetal MRI, IEEE Trans. Med. Imaging, 40, 4, pp. 1123-1133, (2020)
  • [7] Huang J., Li H., Li G., Wan X., Attentive symmetric autoencoder for brain MRI segmentation, International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 203-213, (2022)
  • [8] Li X., Li M., Yan P., Li G., Jiang Y., Luo H., Yin S., Deep learning attention mechanism in medical image analysis: Basics and beyonds, Int. J. Netw. Dyn. Intell., pp. 93-116, (2023)
  • [9] Wang Y., Zhou Y., Shen W., Park S., Fishman E.K., Yuille A.L., Abdominal multi-organ segmentation with organ-attention networks and statistical fusion, Med. Image Anal., 55, pp. 88-102, (2019)
  • [10] Oktay O., Schlemper J., Folgoc L.L., Lee M., Heinrich M., Misawa K., Mori K., McDonagh S., Hammerla N.Y., Kainz B., Et al., (2018)