Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引:220
作者
Zhu, Yi [1 ]
Sapra, Karan [2 ]
Reda, Fitsum A. [2 ]
Shih, Kevin J. [2 ]
Newsam, Shawn [1 ]
Tao, Andrew [2 ]
Catanzaro, Bryan [2 ]
机构
[1] Univ Calif Merced, Merced, CA 95343 USA
[2] Nvidia Corp, Santa Clara, CA USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00906
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.
引用
收藏
页码:8848 / 8857
页数:10
相关论文
共 50 条
  • [31] Shot Boundary Detection and Label Propagation for Spatio-Temporal Video Segmentation
    Piramanayagam, Sankaranarayanan
    Saber, Eli
    Cahill, Nathan D.
    Messinger, David
    IMAGE PROCESSING: MACHINE VISION APPLICATIONS VIII, 2015, 9405
  • [32] Semantic Trajectory Clustering via Improved Label Propagation With Core Structure
    Qiao, Dianfeng
    Liang, Yan
    Ma, Chaoxiong
    Zhang, Huixia
    IEEE SENSORS JOURNAL, 2022, 22 (01) : 639 - 650
  • [33] Enhancing software modularization via semantic outliers filtration and label propagation
    Yang, Kaiyuan
    Wang, Junfeng
    Fang, Zhiyang
    Wu, Peng
    Song, Zihua
    INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 145
  • [34] Learning random-walk label propagation for weakly-supervised semantic segmentation
    Vernaza, Paul
    Chandraker, Manmohan
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2953 - 2961
  • [35] Spatiotemporal Semantic Video Segmentation
    Galmar, E.
    Athanasiadis, Th
    Huet, B.
    Avrithis, Y.
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 578 - +
  • [36] Improving Video Instance Segmentation via Temporal Pyramid Routing
    Li, Xiangtai
    He, Hao
    Yang, Yibo
    Ding, Henghui
    Yang, Kuiyuan
    Cheng, Guangliang
    Tong, Yunhai
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6594 - 6601
  • [37] Label Propagation in Video Sequences
    Badrinarayanan, Vijay
    Galasso, Fabio
    Cipolla, Roberto
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3265 - 3272
  • [38] On the Importance of Label Quality for Semantic Segmentation
    Zlateski, Aleksandar
    Jaroensri, Ronnachai
    Sharma, Prafull
    Durand, Fredo
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1479 - 1487
  • [39] Soft-Boundary Label Relaxation with class placement constraints for semantic segmentation of the railway environment
    Furitsu, Yuki
    Deguchi, Daisuke
    Kawanishi, Yasutomo
    Ide, Ichiro
    Murase, Hiroshi
    Mukojima, Hiroki
    Nagamine, Nozomi
    PATTERN RECOGNITION LETTERS, 2021, 150 : 258 - 264
  • [40] Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters
    Sener, Ozan
    Ugur, Kemal
    Alatan, A. Aydin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (05) : 1292 - 1302