Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引：217

作者：

Zhu, Yi ^{[1
]}

Sapra, Karan ^{[2
]}

Reda, Fitsum A. ^{[2
]}

Shih, Kevin J. ^{[2
]}

Newsam, Shawn ^{[1
]}

Tao, Andrew ^{[2
]}

Catanzaro, Bryan ^{[2
]}

机构：

[1] Univ Calif Merced, Merced, CA 95343 USA

[2] Nvidia Corp, Santa Clara, CA USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00906

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.

引用

页码：8848 / 8857

页数：10

共 50 条

[11] Efficient Video Semantic Segmentation with Labels Propagation and Refinement
Paul, Matthieu
Mayer, Christoph
Van Gool, Luc
Timofte, Radu
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2862 - 2871
[12] Semantic Video Segmentation by Gated Recurrent Flow Propagation
Nilsson, David
Sminchisescu, Cristian
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6819 - 6828
[13] Feature Fusion and Label Propagation for Textured Object Video Segmentation
Prasath, V. B. Surya
Pelapur, Rengarajan
Palaniappan, Kannappan
Seetharaman, Gunasekaran
GEOSPATIAL INFOFUSION AND VIDEO ANALYTICS IV; AND MOTION IMAGERY FOR ISR AND SITUATIONAL AWARENESS II, 2014, 9089
[14] CONTEXT PROPAGATION FROM PROPOSALS FOR SEMANTIC VIDEO OBJECT SEGMENTATION
Wang, Tinghuai
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 256 - 260
[15] Label-efficient Segmentation via Affinity Propagation
Li, Wentong
Yuan, Yuqian
Wang, Song
Liu, Wenyu
Tang, Dongqi
Liu, Jian
Zhu, Jianke
Zhang, Lei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[16] Video Semantic Segmentation via Sparse Temporal Transformer
Li, Jiangtong
Wang, Wentao
Chen, Junjie
Niu, Li
Si, Jianlou
Qian, Chen
Zhang, Liqing
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 59 - 68
[17] Context Label Learning: Improving Background Class Representations in Semantic Segmentation
Li, Zeju
Kamnitsas, Konstantinos
Ouyang, Cheng
Chen, Chen
Glocker, Ben
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (06) : 1885 - 1896
[18] Improving video foreground segmentation and propagation through multifeature fusion
Cheng, Xiaoliu
Wang, Yan
Yuan, Xiaobing
Li, Baoqing
Ding, Yuanyuan
Zhang, Zebin
JOURNAL OF ELECTRONIC IMAGING, 2015, 24 (06)
[19] Efficient frame-sequential label propagation for video object segmentation
Yadang Chen
Chuanyan Hao
Wen Wu
Enhua Wu
Multimedia Tools and Applications, 2018, 77 : 6117 - 6133
[20] Efficient frame-sequential label propagation for video object segmentation
Chen, Yadang
Hao, Chuanyan
Wu, Wen
Wu, Enhua
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (05) : 6117 - 6133

← 1 2 3 4 5 →