ERDUnet: An Efficient Residual Double-Coding Unet for Medical Image Segmentation

被引:28
作者
Li, Hao [1 ]
Zhai, Di-Hua [1 ,2 ]
Xia, Yuanqing [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314001, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; deep learning; encoder-decoder network; convolutional neural network; reduce parameter scale; U-NET; ATTENTION;
D O I
10.1109/TCSVT.2023.3300846
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Medical image segmentation is widely used in clinical diagnosis, and methods based on convolutional neural networks have been able to achieve high accuracy. However, it is still difficult to extract global context features, and the parameters are too large to be clinically applied. In this regard, we propose a novel network structure to improve the traditional encoder-decoder network model, which saves parameters while maintaining segmentation accuracy. We improve the feature extraction efficiency by constructing an encoder module that can simultaneously extract local features and global continuity information. A novel attention module is designed to optimize segmentation boundary regions while improving training efficiency. The feature transfer structure of the decoding part is also improved, which fully integrates the features of different levels to restore the spatial resolution more finely. We evaluate our model on seven different medical segmentation datasets, the 2018 Data Science Bowl Challenge (DSBC2018), the 2018 Lesion Boundary Segmentation Challenge (ISIC2018), the Gland Segmentation in Colon Histology Images Challenge (GlaS), Kvasir-SEG, CVC-ClinicDB, Kvasir-Instrument and Polypgen. Extensive experimental results show that our model can achieve good segmentation performance while maintaining a small number of parameters and computational load, which can further facilitate the generalization of the theoretical approach to clinical practice. Our code will be released at https://github.com/caijilia/ERDUnet.
引用
收藏
页码:2083 / 2096
页数:14
相关论文
共 55 条
[41]   The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions [J].
Tschandl, Philipp ;
Rosendahl, Cliff ;
Kittler, Harald .
SCIENTIFIC DATA, 2018, 5
[42]   Medical Transformer: Gated Axial-Attention for Medical Image Segmentation [J].
Valanarasu, Jeya Maria Jose ;
Oza, Poojan ;
Hacihaliloglu, Ilker ;
Patel, Vishal M. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 :36-46
[43]  
Valanarasu JMJ, 2022, Arxiv, DOI [arXiv:2203.04967, 10.48550/arXiv.2203.04967, DOI 10.48550/ARXIV.2203.04967, 10.48550/arxiv.2203.04967]
[44]  
Wang HN, 2022, AAAI CONF ARTIF INTE, P2441
[45]   Wide Weighted Attention Multi-Scale Network for Accurate MR Image Super-Resolution [J].
Wang, Haoqian ;
Hu, Xiaowan ;
Zhao, Xiaole ;
Zhang, Yulun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) :962-975
[46]   Stepwise Feature Fusion: Local Guides Global [J].
Wang, Jinfeng ;
Huang, Qiming ;
Tang, Feilong ;
Meng, Jia ;
Su, Jionglong ;
Song, Sifan .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 :110-120
[47]   Deep High-Resolution Representation Learning for Visual Recognition [J].
Wang, Jingdong ;
Sun, Ke ;
Cheng, Tianheng ;
Jiang, Borui ;
Deng, Chaorui ;
Zhao, Yang ;
Liu, Dong ;
Mu, Yadong ;
Tan, Mingkui ;
Wang, Xinggang ;
Liu, Wenyu ;
Xiao, Bin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) :3349-3364
[48]   Multimodal medical image segmentation using multi-scale context- aware network [J].
Wang, Xue ;
Li, Zhanshan ;
Huang, Yongping ;
Jiao, Yingying .
NEUROCOMPUTING, 2022, 486 :135-146
[49]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19
[50]  
Xie EZ, 2021, ADV NEUR IN, V34