PPEDNet: Pyramid Pooling Encoder-Decoder Network for Real-Time Semantic Segmentation

被引:1
|
作者
Tan, Zhentao [1 ,2 ]
Liu, Bin [1 ,2 ]
Yu, Nenghai [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Electromagnet Space Informat, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Peoples R China
来源
IMAGE AND GRAPHICS (ICIG 2017), PT I | 2017年 / 10666卷
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Pyramid pooling; Real-time; OBJECT CLASSES;
D O I
10.1007/978-3-319-71607-7_29
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Image semantic segmentation is a fundamental problem and plays an important role in computer vision and artificial intelligence. Recent deep neural networks have improved the accuracy of semantic segmentation significantly. Meanwhile, the number of network parameters and floating point operations have also increased notably. The real-world applications not only have high requirements on the segmentation accuracy, but also demand real-time processing. In this paper, we propose a pyramid pooling encoder-decoder network named PPEDNet for both better accuracy and faster processing speed. Our encoder network is based on VGG16 and discards the fully connected layers due to their huge amounts of parameters. To extract context feature efficiently, we design a pyramid pooling architecture. The decoder is a trainable convolutional network for upsampling the output of the encoder, and fine-tuning the segmentation details. Our method is evaluated on CamVid dataset, achieving 7.214% mIOU accuracy improvement while reducing 17.9% of the parameters compared with the state-of-the-art algorithm.
引用
收藏
页码:328 / 339
页数:12
相关论文
共 50 条
  • [1] LEDNET: A LIGHTWEIGHT ENCODER-DECODER NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Wang, Yu
    Zhou, Quan
    Liu, Jia
    Xiong, Jian
    Gao, Guangwei
    Wu, Xiaofu
    Latecki, Longin Jan
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1860 - 1864
  • [2] Fast Real-time Semantic Segmentation Network with an Asymmetric Encoder-Decoder Structure
    Rui, Tang
    Yan, Li Hui
    Kai, Xu
    Yi, Ding
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 2408 - 2413
  • [3] Pooling Attention-based Encoder-Decoder Network for semantic segmentation
    Xu, Haixia
    Huang, Yunjia
    Hancock, Edwin R.
    Wang, Shuailong
    Xuan, Qijun
    Zhou, Wei
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93
  • [4] Encoder-decoder with double spatial pyramid for semantic segmentation
    Kong, Huifang
    Hu, Jie
    Fan, Lei
    Zhang, Xiaoxue
    Fang, Yao
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (06)
  • [5] Encoder-Decoder Network with Depthwise Atrous Spatial Pyramid Pooling for Automatic Brain Tumor Segmentation
    AboElenein, Nagwa M.
    Piao, Songhao
    Zhang, Zhehong
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1697 - 1713
  • [6] Real-time semantic segmentation of microvascular decompression images based on encoder-decoder structure
    Bai Rui-feng
    Jiang Shan
    Sun Hai-jiang
    Liu Xin-rui
    CHINESE OPTICS, 2022, 15 (05) : 1055 - 1065
  • [7] DARSegNet: A Real-Time Semantic Segmentation Method Based on Dual Attention Fusion Module and Encoder-Decoder Network
    Xing, Yongfeng
    Zhong, Luo
    Zhong, Xian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [8] Real-time semantic segmentation in traffic scene using Cross Stage Partial-based encoder-decoder network
    Zhou, Liguo
    Chen, Guang
    Liu, Lian
    Wang, Ruining
    Knoll, Alois
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [9] Encoder-decoder with dense dilated spatial pyramid pooling for prostate MR images segmentation
    Geng, Lei
    Wang, Jia
    Xiao, Zhitao
    Tong, Jun
    Zhang, Fang
    Wu, Jun
    COMPUTER ASSISTED SURGERY, 2019, 24 : 13 - 19
  • [10] An Encoder-Decoder Network Based FCN Architecture for Semantic Segmentation
    Xing, Yongfeng
    Zhong, Luo
    Zhong, Xian
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020