Multi-level Feature Attention Network for medical image segmentation

被引:1
作者
Zhang, Yaning [1 ]
Yin, Jianjian [1 ]
Gu, Yanhui [1 ]
Chen, Yi [1 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat Artificial Intelligence, Nanjing 210023, Peoples R China
关键词
Medical image segmentation; Swin Transformer; Cross-connection multi-level attention; Pyramid collaborative attention; UNET;
D O I
10.1016/j.eswa.2024.125785
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network architectures deriving from the Unet framework and its convolutional neural network variants have garnered significant attention for their impressive feats in computer vision. However, the shallow-level details and deep-level semantic information are underutilized in these methods, leading to the model's inability to adequately localize target regions. In this paper, we put forward a Multi-level Feature Attention Network, a novel method that cross-connects encoder and decoder features and focuses on multi-scale semantic features. Firstly, we extend UperNet using a hierarchical Swin Transformer with shifted windows, giving the network global modeling capabilities. Secondly, we introduce a Cross-connection Multi-level Attention module that connects encoder and decoder to refine the decoder's output features and supplement detailed information. Finally, we employ a Pyramid Collaborative Attention (PCA) module to mine the encoder's deepest semantic features across multiple scales. Our method establishes state-of-the-art performance on the ACDC, ISIC2017 and BUSI datasets, showcasing its exceptional capability in segmenting medical images.
引用
收藏
页数:10
相关论文
共 59 条
  • [1] Dataset of breast ultrasound images
    Al-Dhabyani, Walid
    Gomaa, Mohammed
    Khaled, Hussien
    Fahmy, Aly
    [J]. DATA IN BRIEF, 2020, 28
  • [2] Asadi-Aghbolaghi M, 2020, Arxiv, DOI arXiv:2003.05056
  • [3] Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions
    Azad, Reza
    Asadi-Aghbolaghi, Maryam
    Fathy, Mahmood
    Escalera, Sergio
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 406 - 415
  • [4] Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved?
    Bernard, Olivier
    Lalande, Alain
    Zotti, Clement
    Cervenansky, Frederick
    Yang, Xin
    Heng, Pheng-Ann
    Cetin, Irem
    Lekadir, Karim
    Camara, Oscar
    Gonzalez Ballester, Miguel Angel
    Sanroma, Gerard
    Napel, Sandy
    Petersen, Steffen
    Tziritas, Georgios
    Grinias, Elias
    Khened, Mahendra
    Kollerathu, Varghese Alex
    Krishnamurthi, Ganapathy
    Rohe, Marc-Michel
    Pennec, Xavier
    Sermesant, Maxime
    Isensee, Fabian
    Jaeger, Paul
    Maier-Hein, Klaus H.
    Full, Peter M.
    Wolf, Ivo
    Engelhardt, Sandy
    Baumgartner, Christian F.
    Koch, Lisa M.
    Wolterink, Jelmer M.
    Isgum, Ivana
    Jang, Yeonggul
    Hong, Yoonmi
    Patravali, Jay
    Jain, Shubham
    Humbert, Olivier
    Jodoin, Pierre-Marc
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) : 2514 - 2525
  • [5] Dense-UNet: a novel multiphoton in vivo cellular image segmentation model based on a convolutional neural network
    Cai, Sijing
    Tian, Yunxian
    Lui, Harvey
    Zeng, Haishan
    Wu, Yi
    Chen, Guannan
    [J]. QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2020, 10 (06) : 1275 - 1285
  • [6] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
  • [7] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
    Chen, Chun-Fu
    Fan, Quanfu
    Panda, Rameswar
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 347 - 356
  • [8] Rethinking the unpretentious U-net for medical ultrasound image segmentation
    Chen, Gongping
    Li, Lei
    Zhang, Jianxun
    Dai, Yu
    [J]. PATTERN RECOGNITION, 2023, 142
  • [9] Codella NCF, 2018, I S BIOMED IMAGING, P168, DOI 10.1109/ISBI.2018.8363547
  • [10] Ms RED: A novel multi-scale residual encoding and decoding network for skin lesion segmentation
    Dai, Duwei
    Dong, Caixia
    Xu, Songhua
    Yan, Qingsen
    Li, Zongfang
    Zhang, Chunyan
    Luo, Nana
    [J]. MEDICAL IMAGE ANALYSIS, 2022, 75