HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation

被引：13

作者：

Fu, Bangkang ^{[1
,2
]}

Peng, Yunsong ^{[2
]}

He, Junjie ^{[2
]}

Tian, Chong ^{[2
]}

Sun, Xinhuan ^{[2
]}

Wang, Rongpin ^{[2
,3
]}

机构：

[1] Guizhou Univ, Med Coll, Guiyang 550000, Guizhou, Peoples R China

[2] Guizhou Prov Peoples Hosp, Dept Radiol, Key Lab Intelligent Med Imaging Anal & Accurate D, Int Exemplary Cooperat Base Precis Imaging Diag &, Guiyang 550002, Peoples R China

[3] Guizhou Prov Peoples Hosp, Dept Med Imaging, Int Exemplary Cooperat Base Precis Imaging Diag &, 83 Zhongshan East Rd, Guiyang 550002, Guizhou, Peoples R China

来源：

COMPUTERS IN BIOLOGY AND MEDICINE | 2024年 / 170卷

基金：

中国国家自然科学基金;

关键词：

Multi -scale features; Transformer; U; -net; Medical image segmentation; Convolution neural network;

D O I：

10.1016/j.compbiomed.2024.108013

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Accurate medical image segmentation is of great significance for subsequent diagnosis and analysis. The acquisition of multi-scale information plays an important role in segmenting regions of interest of different sizes. With the emergence of Transformers, numerous networks adopted hybrid structures incorporating Transformers and CNNs to learn multi-scale information. However, the majority of research has focused on the design and composition of CNN and Transformer structures, neglecting the inconsistencies in feature learning between Transformer and CNN. This oversight has resulted in the hybrid network's performance not being fully realized. In this work, we proposed a novel hybrid multi-scale segmentation network named HmsU-Net, which effectively fused multi-scale features. Specifically, HmsU-Net employed a parallel design incorporating both CNN and Transformer architectures. To address the inconsistency in feature learning between CNN and Transformer within the same stage, we proposed the multi-scale feature fusion module. For feature fusion across different stages, we introduced the cross-attention module. Comprehensive experiments conducted on various datasets demonstrate that our approach surpasses current state-of-the-art methods.

引用

页数：10

共 52 条

[1] Azad R., 2022, Medical Image Segmentation Review: the Success of U-Net, V1-38
[2] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[4] Codella N, 2019, Arxiv, DOI arXiv:1902.03368
[5] Clinically applicable deep learning for diagnosis and referral in retinal disease
De Fauw, Jeffrey
Ledsam, Joseph R.
Romera-Paredes, Bernardino
Nikolov, Stanislav
Tomasev, Nenad
Blackwell, Sam
Askham, Harry
Glorot, Xavier
O'Donoghue, Brendan
Visentin, Daniel
van den Driessche, George
Lakshminarayanan, Balaji
Meyer, Clemens
Mackinder, Faith
Bouton, Simon
Ayoub, Kareem
Chopra, Reena
King, Dominic
Karthikesalingam, Alan
Hughes, Cian O.
Raine, Rosalind
Hughes, Julian
Sim, Dawn A.
Egan, Catherine
Tufail, Adnan
Montgomery, Hugh
Hassabis, Demis
Rees, Geraint
Back, Trevor
Khaw, Peng T.
Suleyman, Mustafa
Cornebise, Julien
Keane, Pearse A.
Ronneberger, Olaf
[J]. NATURE MEDICINE, 2018, 24 (09) : 1342 - +
[6] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[7] The Importance of Skip Connections in Biomedical Image Segmentation
Drozdzal, Michal
Vorontsov, Eugene
Chartrand, Gabriel
Kadoury, Samuel
Pal, Chris
[J]. DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS, 2016, 10008 : 179 - 187
[8] Medical Image Segmentation based on U-Net: A Review
Du, Getao
Cao, Xu
Liang, Jimin
Chen, Xueli
Zhan, Yonghua
[J]. JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2020, 64 (02)
[9] U-Net: deep learning for cell counting, detection, and morphometry
Falk, Thorsten
Mai, Dominic
Bensch, Robert
Cicek, Oezguen
Abdulkadir, Ahmed
Marrakchi, Yassine
Boehm, Anton
Deubner, Jan
Jaeckel, Zoe
Seiwald, Katharina
Dovzhenko, Alexander
Tietz, Olaf
Dal Bosco, Cristina
Walsh, Sean
Saltukoglu, Deniz
Tay, Tuan Leng
Prinz, Marco
Palme, Klaus
Simons, Matias
Diester, Ilka
Brox, Thomas
Ronneberger, Olaf
[J]. NATURE METHODS, 2019, 16 (01) : 67 - +
[10] Multi-Organ Segmentation Over Partially Labeled Datasets With Multi-Scale Feature Abstraction
Fang, Xi
Yan, Pingkun
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (11) : 3619 - 3629

← 1 2 3 4 5 6 →