Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引：217

作者：

Zhu, Yi ^{[1
]}

Sapra, Karan ^{[2
]}

Reda, Fitsum A. ^{[2
]}

Shih, Kevin J. ^{[2
]}

Newsam, Shawn ^{[1
]}

Tao, Andrew ^{[2
]}

Catanzaro, Bryan ^{[2
]}

机构：

[1] Univ Calif Merced, Merced, CA 95343 USA

[2] Nvidia Corp, Santa Clara, CA USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00906

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.

引用

页码：8848 / 8857

页数：10

共 50 条

[1] Video semantic segmentation via feature propagation with holistic attention
Wu, Junrong
Wen, Zongzheng
Zhao, Sanyuan
Huang, Kele
PATTERN RECOGNITION, 2020, 104
[2] Improving Unsupervised Label Propagation for Pose Tracking and Video Object Segmentation
Waldmann, Urs
Bamberger, Jannik
Johannsen, Ole
Deussen, Oliver
Goldlucke, Bastian
PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 230 - 245
[3] Can Ground Truth Label Propagation from Video Help Semantic Segmentation?
Mustikovela, Siva Karthik
Yang, Michael Ying
Rother, Carsten
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 804 - 820
[4] Improving Semantic Image Segmentation via Label Fusion in Semantically Textured Meshes
Fervers, Florian
Breuer, Timo
Stachowiak, Gregor
Bullinger, Sebastian
Bodensteiner, Christoph
Arens, Michael
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 509 - 516
[5] Asymmetric Label Propagation for Video Object Segmentation
Chen, Zhen
Yang, Ming
Zhang, Shiliang
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
[6] Mask Propagation for Efficient Video Semantic Segmentation
Weng, Yuetian
Han, Mingfei
He, Haoyu
Li, Mingjie
Yao, Lina
Chang, Xiaojun
Zhuang, Bohan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[7] Enhancing Semi Supervised Semantic Segmentation Through Cycle-Consistent Label Propagation in Video
Veerababu Addanki
Dhanvanth Reddy Yerramreddy
Sathvik Durgapu
Sasi Sai Nadh Boddu
Vyshnav Durgapu
Neural Processing Letters, 56
[8] Enhancing Semi Supervised Semantic Segmentation Through Cycle-Consistent Label Propagation in Video
Addanki, Veerababu
Yerramreddy, Dhanvanth Reddy
Durgapu, Sathvik
Boddu, Sasi Sai Nadh
Durgapu, Vyshnav
NEURAL PROCESSING LETTERS, 2024, 56 (01)
[9] CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation
Sun, Boyuan
Yang, Yuqi
Le, Zhang
Cheng, Ming-Ming
Hou, Qibin
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3097 - 3107
[10] Real-Time Semantic Segmentation with Label Propagation
Sheikh, Rasha
Garbade, Martin
Gall, Juergen
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 3 - 14

← 1 2 3 4 5 →