Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引：220

作者：

Zhu, Yi ^{[1
]}

Sapra, Karan ^{[2
]}

Reda, Fitsum A. ^{[2
]}

Shih, Kevin J. ^{[2
]}

Newsam, Shawn ^{[1
]}

Tao, Andrew ^{[2
]}

Catanzaro, Bryan ^{[2
]}

机构：

[1] Univ Calif Merced, Merced, CA 95343 USA

[2] Nvidia Corp, Santa Clara, CA USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00906

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.

引用

页码：8848 / 8857

页数：10

共 50 条

[31] Shot Boundary Detection and Label Propagation for Spatio-Temporal Video Segmentation
Piramanayagam, Sankaranarayanan
Saber, Eli
Cahill, Nathan D.
Messinger, David
IMAGE PROCESSING: MACHINE VISION APPLICATIONS VIII, 2015, 9405
[32] Semantic Trajectory Clustering via Improved Label Propagation With Core Structure
Qiao, Dianfeng
Liang, Yan
Ma, Chaoxiong
Zhang, Huixia
IEEE SENSORS JOURNAL, 2022, 22 (01) : 639 - 650
[33] Enhancing software modularization via semantic outliers filtration and label propagation
Yang, Kaiyuan
Wang, Junfeng
Fang, Zhiyang
Wu, Peng
Song, Zihua
INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 145
[34] Learning random-walk label propagation for weakly-supervised semantic segmentation
Vernaza, Paul
Chandraker, Manmohan
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2953 - 2961
[35] Spatiotemporal Semantic Video Segmentation
Galmar, E.
Athanasiadis, Th
Huet, B.
Avrithis, Y.
2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 578 - +
[36] Improving Video Instance Segmentation via Temporal Pyramid Routing
Li, Xiangtai
He, Hao
Yang, Yibo
Ding, Henghui
Yang, Kuiyuan
Cheng, Guangliang
Tong, Yunhai
Tao, Dacheng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6594 - 6601
[37] Label Propagation in Video Sequences
Badrinarayanan, Vijay
Galasso, Fabio
Cipolla, Roberto
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3265 - 3272
[38] On the Importance of Label Quality for Semantic Segmentation
Zlateski, Aleksandar
Jaroensri, Ronnachai
Sharma, Prafull
Durand, Fredo
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1479 - 1487
[39] Soft-Boundary Label Relaxation with class placement constraints for semantic segmentation of the railway environment
Furitsu, Yuki
Deguchi, Daisuke
Kawanishi, Yasutomo
Ide, Ichiro
Murase, Hiroshi
Mukojima, Hiroki
Nagamine, Nozomi
PATTERN RECOGNITION LETTERS, 2021, 150 : 258 - 264
[40] Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters
Sener, Ozan
Ugur, Kemal
Alatan, A. Aydin
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (05) : 1292 - 1302

← 1 2 3 4 5 →