HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation

被引:13
作者
Fu, Bangkang [1 ,2 ]
Peng, Yunsong [2 ]
He, Junjie [2 ]
Tian, Chong [2 ]
Sun, Xinhuan [2 ]
Wang, Rongpin [2 ,3 ]
机构
[1] Guizhou Univ, Med Coll, Guiyang 550000, Guizhou, Peoples R China
[2] Guizhou Prov Peoples Hosp, Dept Radiol, Key Lab Intelligent Med Imaging Anal & Accurate D, Int Exemplary Cooperat Base Precis Imaging Diag &, Guiyang 550002, Peoples R China
[3] Guizhou Prov Peoples Hosp, Dept Med Imaging, Int Exemplary Cooperat Base Precis Imaging Diag &, 83 Zhongshan East Rd, Guiyang 550002, Guizhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -scale features; Transformer; U; -net; Medical image segmentation; Convolution neural network;
D O I
10.1016/j.compbiomed.2024.108013
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate medical image segmentation is of great significance for subsequent diagnosis and analysis. The acquisition of multi-scale information plays an important role in segmenting regions of interest of different sizes. With the emergence of Transformers, numerous networks adopted hybrid structures incorporating Transformers and CNNs to learn multi-scale information. However, the majority of research has focused on the design and composition of CNN and Transformer structures, neglecting the inconsistencies in feature learning between Transformer and CNN. This oversight has resulted in the hybrid network's performance not being fully realized. In this work, we proposed a novel hybrid multi-scale segmentation network named HmsU-Net, which effectively fused multi-scale features. Specifically, HmsU-Net employed a parallel design incorporating both CNN and Transformer architectures. To address the inconsistency in feature learning between CNN and Transformer within the same stage, we proposed the multi-scale feature fusion module. For feature fusion across different stages, we introduced the cross-attention module. Comprehensive experiments conducted on various datasets demonstrate that our approach surpasses current state-of-the-art methods.
引用
收藏
页数:10
相关论文
共 52 条
  • [1] Azad R., 2022, Medical Image Segmentation Review: the Success of U-Net, V1-38
  • [2] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
  • [3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [4] Codella N, 2019, Arxiv, DOI arXiv:1902.03368
  • [5] Clinically applicable deep learning for diagnosis and referral in retinal disease
    De Fauw, Jeffrey
    Ledsam, Joseph R.
    Romera-Paredes, Bernardino
    Nikolov, Stanislav
    Tomasev, Nenad
    Blackwell, Sam
    Askham, Harry
    Glorot, Xavier
    O'Donoghue, Brendan
    Visentin, Daniel
    van den Driessche, George
    Lakshminarayanan, Balaji
    Meyer, Clemens
    Mackinder, Faith
    Bouton, Simon
    Ayoub, Kareem
    Chopra, Reena
    King, Dominic
    Karthikesalingam, Alan
    Hughes, Cian O.
    Raine, Rosalind
    Hughes, Julian
    Sim, Dawn A.
    Egan, Catherine
    Tufail, Adnan
    Montgomery, Hugh
    Hassabis, Demis
    Rees, Geraint
    Back, Trevor
    Khaw, Peng T.
    Suleyman, Mustafa
    Cornebise, Julien
    Keane, Pearse A.
    Ronneberger, Olaf
    [J]. NATURE MEDICINE, 2018, 24 (09) : 1342 - +
  • [6] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [7] The Importance of Skip Connections in Biomedical Image Segmentation
    Drozdzal, Michal
    Vorontsov, Eugene
    Chartrand, Gabriel
    Kadoury, Samuel
    Pal, Chris
    [J]. DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS, 2016, 10008 : 179 - 187
  • [8] Medical Image Segmentation based on U-Net: A Review
    Du, Getao
    Cao, Xu
    Liang, Jimin
    Chen, Xueli
    Zhan, Yonghua
    [J]. JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2020, 64 (02)
  • [9] U-Net: deep learning for cell counting, detection, and morphometry
    Falk, Thorsten
    Mai, Dominic
    Bensch, Robert
    Cicek, Oezguen
    Abdulkadir, Ahmed
    Marrakchi, Yassine
    Boehm, Anton
    Deubner, Jan
    Jaeckel, Zoe
    Seiwald, Katharina
    Dovzhenko, Alexander
    Tietz, Olaf
    Dal Bosco, Cristina
    Walsh, Sean
    Saltukoglu, Deniz
    Tay, Tuan Leng
    Prinz, Marco
    Palme, Klaus
    Simons, Matias
    Diester, Ilka
    Brox, Thomas
    Ronneberger, Olaf
    [J]. NATURE METHODS, 2019, 16 (01) : 67 - +
  • [10] Multi-Organ Segmentation Over Partially Labeled Datasets With Multi-Scale Feature Abstraction
    Fang, Xi
    Yan, Pingkun
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (11) : 3619 - 3629