Semantic Segmentation by Using Down-Sampling and Subpixel Convolution: DSSC-UNet

被引:3
作者
Kwon, Young-Man [1 ]
Bae, Sunghoon [1 ]
Chung, Dong-Keun [1 ]
Lim, Myung-Jae [1 ]
机构
[1] Eulji Univ, Dept Med IT, 553 Sanseong Daero, Seongnam Si 13135, Gyeonggi Do, South Korea
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 75卷 / 01期
关键词
Semantic segmentation; encoder-decoder; U-Net; DSSC-UNet; subpixel convolution; down-sampling;
D O I
10.32604/cmc.2023.033370
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, semantic segmentation has been widely applied to image processing, scene understanding, and many others. Especially, in deep learning-based semantic segmentation, the U-Net with convolutional encoder-decoder architecture is a representative model which is proposed for image segmentation in the biomedical field. It used max pooling operation for reducing the size of image and making noise robust. However, instead of reducing the complexity of the model, max pooling has the disadvantage of omitting some information about the image in reducing it. So, this paper used two diagonal elements of down-sampling operation instead of it. We think that the down-sampling feature maps have more information intrinsically than max pooling feature maps because of keeping the Nyquist theorem and extracting the latent information from them. In addition, this paper used two other diagonal elements for the skip connection. In decoding, we used Subpixel Convolution rather than transposed convolution to efficiently decode the encoded feature maps. Including all the ideas, this paper proposed the new encoder-decoder model called Down-Sampling and Subpixel Convolution U-Net (DSSC-UNet). To prove the better performance of the proposed model, this paper measured the performance of the U -Net and DSSC-UNet on the Cityscapes. As a result, DSSC-UNet achieved 89.6% Mean Intersection Over Union (Mean-IoU) and U-Net achieved 85.6% Mean-IoU, confirming that DSSC-UNet achieved better performance.
引用
收藏
页码:683 / 696
页数:14
相关论文
共 40 条
  • [31] ARGA-Unet: Advanced U-net segmentation model using residual grouped convolution and attention mechanism for brain tumor MRI image segmentation
    XUN S.
    ZHANG Y.
    DUAN S.
    WANG M.
    CHEN J.
    TONG T.
    GAO Q.
    LAM C.
    HU M.
    TAN T.
    Virtual Reality and Intelligent Hardware, 2024, 6 (03): : 203 - 216
  • [32] Improving Breast Cancer Detection and Diagnosis through Semantic Segmentation Using the Unet3+ Deep Learning Framework
    Alam, Taukir
    Shia, Wei-Chung
    Hsu, Fang-Rong
    Hassan, Taimoor
    BIOMEDICINES, 2023, 11 (06)
  • [33] Restricted Deformable Convolution-Based Road Scene Semantic Segmentation Using Surround View Cameras
    Deng, Liuyuan
    Yang, Ming
    Li, Hao
    Li, Tianyi
    Hu, Bing
    Wang, Chunxiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (10) : 4350 - 4362
  • [34] Fast 3D Semantic Segmentation Using a Self Attention Network and Random Sampling
    Babu, Sandeep
    Jegarian, Majid
    Fischer, Dirk
    Mertschinbg, Baerbel
    TOWARDS AUTONOMOUS ROBOTIC SYSTEMS, TAROS 2023, 2023, 14136 : 255 - 266
  • [35] High-Resolution Semantic Segmentation of Woodland Fires Using Residual Attention UNet and Time Series of Sentinel-2
    Shirvani, Zeinab
    Abdi, Omid
    Goodman, Rosa C.
    REMOTE SENSING, 2023, 15 (05)
  • [36] Deep learning-based multi-category disease semantic image segmentation detection for concrete structures using the Res-Unet model
    Han, Xiaojian
    Cheng, Qibin
    Chen, Qizhi
    Chen, Lingkun
    Liu, Peng
    JOURNAL OF CIVIL STRUCTURAL HEALTH MONITORING, 2024, : 1369 - 1380
  • [37] Semi-Supervised Adversarial Semantic Segmentation Network Using Transformer and Multiscale Convolution for High-Resolution Remote Sensing Imagery
    Zheng, Yalan
    Yang, Mengyuan
    Wang, Min
    Qian, Xiaojun
    Yang, Rui
    Zhang, Xin
    Dong, Wen
    REMOTE SENSING, 2022, 14 (08)
  • [38] DCAE-UNET: IMPROVED OPTIC DISC SEGMENTATION MODEL USING SEMI-SUPERVISED DEEP DILATED CONVOLUTION AUTOENCODER-BASED MODIFIED U-NET
    Shalini, R.
    Gopi, Varun P.
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2023, 35 (06):
  • [39] Integration of top-down and bottom-up visual processing using a recurrent convolutional–deconvolutional neural network for semantic segmentation
    Byung Wan Kim
    Youngbin Park
    Il Hong Suh
    Intelligent Service Robotics, 2020, 13 : 87 - 97
  • [40] Integration of top-down and bottom-up visual processing using a recurrent convolutional-deconvolutional neural network for semantic segmentation
    Kim, Byung Wan
    Park, Youngbin
    Suh, Il Hong
    INTELLIGENT SERVICE ROBOTICS, 2020, 13 (01) : 87 - 97