Deep Multiphase Level Set for Scene Parsing

被引:17
作者
Zhang, Pingping [1 ]
Liu, Wei [2 ]
Lei, Yinjie [3 ]
Wang, Hongyu [1 ]
Lu, Huchuan [1 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 50005, Australia
[3] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic scene parsing; multiphase level set; recurrent convolutional network; object boundary estimation; IMAGE SEGMENTATION; NETWORKS; MUMFORD; MODEL;
D O I
10.1109/TIP.2019.2957915
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, Fully Convolutional Network (FCN) seems to be the go-to architecture for image segmentation, including semantic scene parsing. However, it is difficult for a generic FCN to predict semantic labels around the object boundaries, thus FCN-based methods usually produce parsing results with inaccurate boundaries. Meanwhile, many works have demonstrate that level set based active contours are superior to the boundary estimation in sub-pixel accuracy. However, they are quite sensitive to initial settings. To address these limitations, in this paper we propose a novel Deep Multiphase Level Set (DMLS) method for semantic scene parsing, which efficiently incorporates multiphase level sets into deep neural networks. The proposed method consists of three modules, i.e., recurrent FCNs, adaptive multiphase level set, and deeply supervised learning. More specifically, recurrent FCNs learn multi-level representations of input images with different contexts. Adaptive multiphase level set drives the discriminative contour for each semantic class, which makes use of the advantages of both global and local information. In each time-step of the recurrent FCNs, deeply supervised learning is incorporated for model training. Extensive experiments on three public benchmarks have shown that our proposed method achieves new state-of-the-art performances. The source codes will be released at https://github.com/Pchank/DMLS-for-SSP.
引用
收藏
页码:4556 / 4567
页数:12
相关论文
共 61 条
  • [1] [Anonymous], 2018, OCNet: Object Context Network for Scene Parsing
  • [2] [Anonymous], 1999, LEVEL SET METHODS FA
  • [3] [Anonymous], ADV NEURAL INFORM PR
  • [4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [5] Dense Decoder Shortcut Connections for Single-Pass Semantic Segmentation
    Bilinski, Piotr
    Prisacariu, Victor
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6596 - 6605
  • [6] In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Kontschieder, Peter
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5639 - 5647
  • [7] Loss Max-Pooling for Semantic Image Segmentation
    Bulo, Samuel Rota
    Neuhold, Gerhard
    Kontschieder, Peter
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7082 - 7091
  • [8] Chen LC, 2018, ADV NEUR IN, V31
  • [9] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [10] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848