Advances in attention mechanisms for medical image segmentation

被引:2
作者
Zhang, Jianpeng [1 ]
Chen, Xiaomin [4 ]
Yang, Bing [2 ]
Guan, Qingbiao [2 ]
Chen, Qi [3 ]
Chen, Jian [4 ]
Wu, Qi [3 ]
Xie, Yutong [3 ]
Xia, Yong [2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci & Engn, Natl Engn Lab Integrated Aero Space Ground Ocean B, Xian, Peoples R China
[3] Univ Adelaide, Adelaide, Australia
[4] South China Univ Technol, Guangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Medical image segmentation; Attention mechanism; Transformer; Mamba; Deep learning; CONVOLUTIONAL NEURAL-NETWORK; SWIN TRANSFORMER; MULTIORGAN; FIELD;
D O I
10.1016/j.cosrev.2024.100721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical image segmentation plays an important role in computer-aided diagnosis. Attention mechanisms that distinguish important parts from irrelevant parts have been widely used in medical image segmentation tasks. This paper systematically reviews the basic principles of attention mechanisms and their applications in medical image segmentation. First, we review the basic concepts of attention mechanism and formulation. Second, we surveyed about 200 articles related to medical image segmentation, and divided them into three groups based on their attention mechanisms, Pre-Transformer attention, Transformer attention and Mamba-related attention. In each group, we deeply analyze the attention mechanisms from three aspects based on the current literature work, i.e., the principle of the mechanism (what to use), implementation methods (how to use), and application tasks (where to use). We also thoroughly analyzed the advantages and limitations of their applications to different tasks. Finally, we summarize the current state of research and shortcomings in the field, and discuss the potential challenges in the future, including task specificity, robustness, standard evaluation, etc. We hope that this review can showcase the overall research context of traditional, Transformer and Mamba attention methods, provide a clear reference for subsequent research, and inspire more advanced attention research, not only in medical image segmentation, but also in other image analysis scenarios. Finally, we maintain the paper list and open-source code at here.
引用
收藏
页数:18
相关论文
共 200 条
[1]   Multi-frame Attention Network for Left Ventricle Segmentation in 3D Echocardiography [J].
Ahn, Shawn S. ;
Ta, Kevinminh ;
Thorn, Stephanie ;
Langdon, Jonathan ;
Sinusas, Albert J. ;
Duncan, James S. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 :348-357
[2]  
Archit A, 2024, Arxiv, DOI [arXiv:2404.07705, 10.48550/arXiv.2404.07705, DOI 10.48550/ARXIV.2404.07705]
[3]   Dual Cross-Attention for medical image segmentation [J].
Ates, Gorkem Can ;
Mohan, Prasoon ;
Celik, Emrah .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[4]   DAE-Former: Dual Attention-Guided Efficient Transformer for Medical Image Segmentation [J].
Azad, Reza ;
Arimond, Rene ;
Aghdam, Ehsan Khodapanah ;
Kazerouni, Amirhossein ;
Merhof, Dorit .
PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2023, 2023, 14277 :83-95
[5]   Advances in medical image analysis with vision Transformers: A review [J].
Azad, Reza ;
Kazerouni, Amirhossein ;
Heidari, Moein ;
Aghdam, Ehsan Khodapanah ;
Molaei, Amirali ;
Jia, Yiwei ;
Jose, Abin ;
Roy, Rijo ;
Merhof, Dorit .
MEDICAL IMAGE ANALYSIS, 2024, 91
[6]   Contextual Attention Network: Transformer Meets U-Net [J].
Azad, Reza ;
Heidari, Moein ;
Wu, Yuli ;
Merhof, Dorit .
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 :377-386
[7]   Towards automatic polyp detection with a polyp appearance model [J].
Bernal, J. ;
Sanchez, J. ;
Vilarino, F. .
PATTERN RECOGNITION, 2012, 45 (09) :3166-3182
[8]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[9]   Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved? [J].
Bernard, Olivier ;
Lalande, Alain ;
Zotti, Clement ;
Cervenansky, Frederick ;
Yang, Xin ;
Heng, Pheng-Ann ;
Cetin, Irem ;
Lekadir, Karim ;
Camara, Oscar ;
Gonzalez Ballester, Miguel Angel ;
Sanroma, Gerard ;
Napel, Sandy ;
Petersen, Steffen ;
Tziritas, Georgios ;
Grinias, Elias ;
Khened, Mahendra ;
Kollerathu, Varghese Alex ;
Krishnamurthi, Ganapathy ;
Rohe, Marc-Michel ;
Pennec, Xavier ;
Sermesant, Maxime ;
Isensee, Fabian ;
Jaeger, Paul ;
Maier-Hein, Klaus H. ;
Full, Peter M. ;
Wolf, Ivo ;
Engelhardt, Sandy ;
Baumgartner, Christian F. ;
Koch, Lisa M. ;
Wolterink, Jelmer M. ;
Isgum, Ivana ;
Jang, Yeonggul ;
Hong, Yoonmi ;
Patravali, Jay ;
Jain, Shubham ;
Humbert, Olivier ;
Jodoin, Pierre-Marc .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) :2514-2525
[10]   D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images [J].
Bougourzi F. ;
Dornaika F. ;
Distante C. ;
Taleb-Ahmed A. .
Computers in Biology and Medicine, 2024, 176