PMED-Net: Pyramid Based Multi-Scale Encoder-Decoder Network for Medical Image Segmentation

被引:19
|
作者
Khan, Abbas [1 ,2 ]
Kim, Hyongsuk [1 ,2 ]
Chua, Leon [3 ]
机构
[1] Jeonbuk Natl Univ, Div Elect & Informat Engn, Jeonju 54896, South Korea
[2] Jeonbuk Natl Univ, Core Res Inst Intelligent Robots, Jeonju 54896, South Korea
[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
新加坡国家研究基金会;
关键词
Image segmentation; Decoding; Feature extraction; Medical diagnostic imaging; Training; Diseases; Deep learning; Convolutional neural networks; encoder-decoder architecture; medical image processing; semantic segmentation;
D O I
10.1109/ACCESS.2021.3071754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A pyramidical multi-scale encoder-decoder network, namely PMED-Net, is proposed for medical image segmentation. Different variants of encoder-decoder networks are in practice for segmenting the medical images and U-Net is the most widely used one. However, the existing architectures for segmenting medical images have millions of parameters that require enormous computations which results in memory and cost-inefficiency. To overcome such limitations, we come up with the idea of training small networks in a cascaded form for coarse-to-fine prediction. The proposed adaptive network is extended up to six pyramid levels, and at each level, features are extracted at different scales of the input image. Each lightweight encoder-decoder network is trained independently to minimize loss, where succeeding level networks further refine the prior predictions. Evaluation and comparison of our architecture were performed on four different publicly available medical image segmentation datasets: International Skin Imaging Collaboration (ISIC) challenge 2018 dataset, brain tumor dataset, nuclei dataset, and X-ray dataset. The experimental results of the PMED-Net are either better or on par with other state-of-the-art networks in terms of IoU, F1-Score, and sensitivity metrics. Moreover, PMED-Net is efficient in terms of parameterized complexity as it has 1/21.3, 1/21.1, 1/14.0, 1/11.6, 1/11.2, 1/6.64, and 1/4.95 times fewer parameters than SegNet, U-Net, BCDU-Net, CU-Net, FCN-8s, ORED-Net, and MultiResUNet respectively. The pre-trained models, datasets information, and implementation details are available at https://github.com/kabbas570/Pyramid-Based-Encoder-Decoder.
引用
收藏
页码:55988 / 55998
页数:11
相关论文
共 50 条
  • [21] LAEDNet: A Lightweight Attention Encoder-Decoder Network for ultrasound medical image segmentation
    Zhou, Quan
    Wang, Qianwen
    Bao, Yunchao
    Kong, Lingjun
    Jin, Xin
    Ou, Weihua
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 99
  • [22] Multi-Scale Attention and Encoder-Decoder Network for Video Saliency Object Detection
    Hongbo Bi
    Huihui Zhu
    Lina Yang
    Ranwan Wu
    Pattern Recognition and Image Analysis, 2022, 32 : 340 - 350
  • [23] Building Extraction of Aerial Images by a Global and Multi-Scale Encoder-Decoder Network
    Ma, Jingjing
    Wu, Linlin
    Tang, Xu
    Liu, Fang
    Zhang, Xiangrong
    Jiao, Licheng
    REMOTE SENSING, 2020, 12 (15)
  • [24] Multi-Scale Attention and Encoder-Decoder Network for Video Saliency Object Detection
    Bi, Hongbo
    Zhu, Huihui
    Yang, Lina
    Wu, Ranwan
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (02) : 340 - 350
  • [25] Encoder-Decoder Networks for Retinal Vessel Segmentation Using Large Multi-scale Patches
    Browatzki, Bjoern
    Lies, Joern-Philipp
    Wallraven, Christian
    OPHTHALMIC MEDICAL IMAGE ANALYSIS, OMIA 2020, 2020, 12069 : 42 - 52
  • [26] J-Net: Asymmetric Encoder-Decoder for Medical Semantic Segmentation
    Shi, Yanli
    Sheng, Pengpeng
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [27] Highly efficient encoder-decoder network based on multi-scale edge enhancement and dilated convolution for LDCT image denoising
    Jia, Lina
    He, Xu
    Huang, Aimin
    Jia, Beibei
    Wang, Xinfeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6081 - 6091
  • [28] Segmentation of pancreatic tumors based on multi-scale convolution and channel attention mechanism in the encoder-decoder scheme
    Du, Yue
    Zuo, Xiaoying
    Liu, Shidong
    Cheng, Dai
    Li, Jie
    Sun, Mingzhu
    Zhao, Xin
    Ding, Hui
    Hu, Yabin
    MEDICAL PHYSICS, 2023, 50 (12) : 7764 - 7778
  • [29] Manipulating Retinal OCT data for Image Segmentation based on Encoder-Decoder Network
    Song, Mingue
    Kim, Yanggon
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [30] GIVTED-Net: GhostNet-Mobile Involution ViT Encoder-Decoder Network for Lightweight Medical Image Segmentation
    Dwika Hefni Al-Fahsi, Resha
    Naghim Fauzaini Prawirosoenoto, Ahmad
    Adi Nugroho, Hanung
    Ardiyanto, Igi
    IEEE ACCESS, 2024, 12 : 81281 - 81292