Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引:217
|
作者
Zhu, Yi [1 ]
Sapra, Karan [2 ]
Reda, Fitsum A. [2 ]
Shih, Kevin J. [2 ]
Newsam, Shawn [1 ]
Tao, Andrew [2 ]
Catanzaro, Bryan [2 ]
机构
[1] Univ Calif Merced, Merced, CA 95343 USA
[2] Nvidia Corp, Santa Clara, CA USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00906
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.
引用
收藏
页码:8848 / 8857
页数:10
相关论文
共 50 条
  • [21] MULTI-LABEL PROPAGATION FOR COHERENT VIDEO SEGMENTATION AND ARTISTIC STYLIZATION
    Wang, Tinghuai
    Guillemaut, Jean-Yves
    Collomosse, John
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3005 - 3008
  • [22] Improving Segmentation of the Inferior Alveolar Nerve through Deep Label Propagation
    Cipriano, Marco
    Allegretti, Stefano
    Bolelli, Federico
    Pollastri, Federico
    Grana, Costantino
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21105 - 21114
  • [23] Interactive shape co-segmentation via label propagation
    Wu, Zizhao
    Shou, Ruyang
    Wang, Yunhai
    Liu, Xinguo
    COMPUTERS & GRAPHICS-UK, 2014, 38 : 248 - 254
  • [24] Label Propagation and Contrastive Regularization for Semisupervised Semantic Segmentation of Remote Sensing Images
    Yang, Zhujun
    Yan, Zhiyuan
    Diao, Wenhui
    Zhang, Qiang
    Kang, Yuzhuo
    Li, Junxi
    Li, Xinming
    Sun, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [25] Improving Video Segmentation via Dynamic Anchor Queries
    Zhou, Yikang
    Zhang, Tao
    Ji, Shunping
    Yan, Shuicheng
    Li, Xiangtai
    COMPUTER VISION - ECCV 2024, PT L, 2025, 15108 : 446 - 463
  • [26] Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation
    Wang, Bowen
    Li, Liangzhi
    Nakashima, Yuta
    Kawasaki, Ryo
    Nagahara, Hajime
    Yagi, Yasushi
    IEEE ACCESS, 2021, 9 : 46810 - 46820
  • [27] Semantic Object Segmentation via Detection in Weakly Labeled Video
    Zhang, Yu
    Chen, Xiaowu
    Li, Jia
    Wang, Chen
    Xia, Changqun
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3641 - 3649
  • [28] Improving Semantic Segmentation via Efficient Self-Training
    Zhu, Yi
    Zhang, Zhongyue
    Wu, Chongruo
    Zhang, Zhi
    He, Tong
    Zhang, Hang
    Manmatha, R.
    Li, Mu
    Smola, Alexander
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1589 - 1602
  • [29] Improving Semantic Segmentation via Decoupled Body and Edge Information
    Yu, Lintao
    Yao, Anni
    Duan, Jin
    ENTROPY, 2023, 25 (06)
  • [30] Shot Boundary Detection and Label Propagation for Spatio-Temporal Video Segmentation
    Piramanayagam, Sankaranarayanan
    Saber, Eli
    Cahill, Nathan D.
    Messinger, David
    IMAGE PROCESSING: MACHINE VISION APPLICATIONS VIII, 2015, 9405