PMED-Net: Pyramid Based Multi-Scale Encoder-Decoder Network for Medical Image Segmentation

被引:19
|
作者
Khan, Abbas [1 ,2 ]
Kim, Hyongsuk [1 ,2 ]
Chua, Leon [3 ]
机构
[1] Jeonbuk Natl Univ, Div Elect & Informat Engn, Jeonju 54896, South Korea
[2] Jeonbuk Natl Univ, Core Res Inst Intelligent Robots, Jeonju 54896, South Korea
[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
新加坡国家研究基金会;
关键词
Image segmentation; Decoding; Feature extraction; Medical diagnostic imaging; Training; Diseases; Deep learning; Convolutional neural networks; encoder-decoder architecture; medical image processing; semantic segmentation;
D O I
10.1109/ACCESS.2021.3071754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A pyramidical multi-scale encoder-decoder network, namely PMED-Net, is proposed for medical image segmentation. Different variants of encoder-decoder networks are in practice for segmenting the medical images and U-Net is the most widely used one. However, the existing architectures for segmenting medical images have millions of parameters that require enormous computations which results in memory and cost-inefficiency. To overcome such limitations, we come up with the idea of training small networks in a cascaded form for coarse-to-fine prediction. The proposed adaptive network is extended up to six pyramid levels, and at each level, features are extracted at different scales of the input image. Each lightweight encoder-decoder network is trained independently to minimize loss, where succeeding level networks further refine the prior predictions. Evaluation and comparison of our architecture were performed on four different publicly available medical image segmentation datasets: International Skin Imaging Collaboration (ISIC) challenge 2018 dataset, brain tumor dataset, nuclei dataset, and X-ray dataset. The experimental results of the PMED-Net are either better or on par with other state-of-the-art networks in terms of IoU, F1-Score, and sensitivity metrics. Moreover, PMED-Net is efficient in terms of parameterized complexity as it has 1/21.3, 1/21.1, 1/14.0, 1/11.6, 1/11.2, 1/6.64, and 1/4.95 times fewer parameters than SegNet, U-Net, BCDU-Net, CU-Net, FCN-8s, ORED-Net, and MultiResUNet respectively. The pre-trained models, datasets information, and implementation details are available at https://github.com/kabbas570/Pyramid-Based-Encoder-Decoder.
引用
收藏
页码:55988 / 55998
页数:11
相关论文
共 50 条
  • [31] Multi-scale fusion residual encoder-decoder approach for low illumination image enhancement
    Pan Xiaoying
    Wei Miao
    Wang Hao
    Jia Fengzhu
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2022, 29 (02) : 63 - 72
  • [32] Multi-scale fusion residual encoder-decoder approach for low illumination image enhancement
    Xiaoying P.
    Miao W.
    Hao W.
    Fengzhü J.
    Journal of China Universities of Posts and Telecommunications, 2022, 29 (02): : 63 - 72
  • [33] A Multi-Scale Fusion Residual Encoder-Decoder Approach for Low Illumination Image Enhancement
    Pan X.
    Wei M.
    Wang H.
    Jia F.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (01): : 104 - 112
  • [34] DAMS: Document Image Steganography with Dual Attention Multi-scale Encoder-Decoder Architecture
    Li, Kaijiang
    Qin, Yi
    Wang, Peisen
    Guo, Chunyi
    Wang, Junqi
    Jia, Ruiyang
    Jiang, Wenfeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 118 - 131
  • [35] Motion U-Net: Multi-cue Encoder-Decoder Network for Motion Segmentation
    Rahmon, Gani
    Bunyak, Filiz
    Seetharaman, Guna
    Palaniappan, Kannappan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8125 - 8132
  • [36] Remote Sensing LiDAR and Hyperspectral Classification with Multi-Scale Graph Encoder-Decoder Network
    Wang, Fang
    Du, Xingqian
    Zhang, Weiguang
    Nie, Liang
    Wang, Hu
    Zhou, Shun
    Ma, Jun
    REMOTE SENSING, 2024, 16 (20)
  • [37] SAR IMAGES ENHANCEMENT VIA DEEP MULTI-SCALE ENCODER-DECODER NEURAL NETWORK
    Yang, Xiaqing
    Zhou, Yuanyuan
    Wang, Chen
    Shi, Jun
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3368 - 3371
  • [38] MCV-UNet: a modified convolution & transformer hybrid encoder-decoder network with multi-scale information fusion for ultrasound image semantic segmentation
    Xu, Zihong
    Wang, Ziyang
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [39] MCV-UNet: a modified convolution & transformer hybrid encoder-decoder network with multi-scale information fusion for ultrasound image semantic segmentation
    Xu, Zihong
    Wang, Ziyang
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [40] Rethinking the Encoder-decoder Structure in Medical Image Segmentation from Releasing Decoder Structure
    Ni, Jiajia
    Mu, Wei
    Pan, An
    Chen, Zhengming
    JOURNAL OF BIONIC ENGINEERING, 2024, 21 (03) : 1511 - 1521