Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引:217
|
作者
Zhu, Yi [1 ]
Sapra, Karan [2 ]
Reda, Fitsum A. [2 ]
Shih, Kevin J. [2 ]
Newsam, Shawn [1 ]
Tao, Andrew [2 ]
Catanzaro, Bryan [2 ]
机构
[1] Univ Calif Merced, Merced, CA 95343 USA
[2] Nvidia Corp, Santa Clara, CA USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00906
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.
引用
收藏
页码:8848 / 8857
页数:10
相关论文
共 50 条
  • [11] Efficient Video Semantic Segmentation with Labels Propagation and Refinement
    Paul, Matthieu
    Mayer, Christoph
    Van Gool, Luc
    Timofte, Radu
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2862 - 2871
  • [12] Semantic Video Segmentation by Gated Recurrent Flow Propagation
    Nilsson, David
    Sminchisescu, Cristian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6819 - 6828
  • [13] Feature Fusion and Label Propagation for Textured Object Video Segmentation
    Prasath, V. B. Surya
    Pelapur, Rengarajan
    Palaniappan, Kannappan
    Seetharaman, Gunasekaran
    GEOSPATIAL INFOFUSION AND VIDEO ANALYTICS IV; AND MOTION IMAGERY FOR ISR AND SITUATIONAL AWARENESS II, 2014, 9089
  • [14] CONTEXT PROPAGATION FROM PROPOSALS FOR SEMANTIC VIDEO OBJECT SEGMENTATION
    Wang, Tinghuai
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 256 - 260
  • [15] Label-efficient Segmentation via Affinity Propagation
    Li, Wentong
    Yuan, Yuqian
    Wang, Song
    Liu, Wenyu
    Tang, Dongqi
    Liu, Jian
    Zhu, Jianke
    Zhang, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [16] Video Semantic Segmentation via Sparse Temporal Transformer
    Li, Jiangtong
    Wang, Wentao
    Chen, Junjie
    Niu, Li
    Si, Jianlou
    Qian, Chen
    Zhang, Liqing
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 59 - 68
  • [17] Context Label Learning: Improving Background Class Representations in Semantic Segmentation
    Li, Zeju
    Kamnitsas, Konstantinos
    Ouyang, Cheng
    Chen, Chen
    Glocker, Ben
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (06) : 1885 - 1896
  • [18] Improving video foreground segmentation and propagation through multifeature fusion
    Cheng, Xiaoliu
    Wang, Yan
    Yuan, Xiaobing
    Li, Baoqing
    Ding, Yuanyuan
    Zhang, Zebin
    JOURNAL OF ELECTRONIC IMAGING, 2015, 24 (06)
  • [19] Efficient frame-sequential label propagation for video object segmentation
    Yadang Chen
    Chuanyan Hao
    Wen Wu
    Enhua Wu
    Multimedia Tools and Applications, 2018, 77 : 6117 - 6133
  • [20] Efficient frame-sequential label propagation for video object segmentation
    Chen, Yadang
    Hao, Chuanyan
    Wu, Wen
    Wu, Enhua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (05) : 6117 - 6133