Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引:217
|
作者
Zhu, Yi [1 ]
Sapra, Karan [2 ]
Reda, Fitsum A. [2 ]
Shih, Kevin J. [2 ]
Newsam, Shawn [1 ]
Tao, Andrew [2 ]
Catanzaro, Bryan [2 ]
机构
[1] Univ Calif Merced, Merced, CA 95343 USA
[2] Nvidia Corp, Santa Clara, CA USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00906
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.
引用
收藏
页码:8848 / 8857
页数:10
相关论文
共 50 条
  • [1] Video semantic segmentation via feature propagation with holistic attention
    Wu, Junrong
    Wen, Zongzheng
    Zhao, Sanyuan
    Huang, Kele
    PATTERN RECOGNITION, 2020, 104
  • [2] Improving Unsupervised Label Propagation for Pose Tracking and Video Object Segmentation
    Waldmann, Urs
    Bamberger, Jannik
    Johannsen, Ole
    Deussen, Oliver
    Goldlucke, Bastian
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 230 - 245
  • [3] Can Ground Truth Label Propagation from Video Help Semantic Segmentation?
    Mustikovela, Siva Karthik
    Yang, Michael Ying
    Rother, Carsten
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 804 - 820
  • [4] Improving Semantic Image Segmentation via Label Fusion in Semantically Textured Meshes
    Fervers, Florian
    Breuer, Timo
    Stachowiak, Gregor
    Bullinger, Sebastian
    Bodensteiner, Christoph
    Arens, Michael
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 509 - 516
  • [5] Asymmetric Label Propagation for Video Object Segmentation
    Chen, Zhen
    Yang, Ming
    Zhang, Shiliang
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [6] Mask Propagation for Efficient Video Semantic Segmentation
    Weng, Yuetian
    Han, Mingfei
    He, Haoyu
    Li, Mingjie
    Yao, Lina
    Chang, Xiaojun
    Zhuang, Bohan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Enhancing Semi Supervised Semantic Segmentation Through Cycle-Consistent Label Propagation in Video
    Veerababu Addanki
    Dhanvanth Reddy Yerramreddy
    Sathvik Durgapu
    Sasi Sai Nadh Boddu
    Vyshnav Durgapu
    Neural Processing Letters, 56
  • [8] Enhancing Semi Supervised Semantic Segmentation Through Cycle-Consistent Label Propagation in Video
    Addanki, Veerababu
    Yerramreddy, Dhanvanth Reddy
    Durgapu, Sathvik
    Boddu, Sasi Sai Nadh
    Durgapu, Vyshnav
    NEURAL PROCESSING LETTERS, 2024, 56 (01)
  • [9] CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation
    Sun, Boyuan
    Yang, Yuqi
    Le, Zhang
    Cheng, Ming-Ming
    Hou, Qibin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3097 - 3107
  • [10] Real-Time Semantic Segmentation with Label Propagation
    Sheikh, Rasha
    Garbade, Martin
    Gall, Juergen
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 3 - 14