Improving Semantic Segmentation via Video Propagation and Label Relaxation

被引：217

作者：

Zhu, Yi ^{[1
]}

Sapra, Karan ^{[2
]}

Reda, Fitsum A. ^{[2
]}

Shih, Kevin J. ^{[2
]}

Newsam, Shawn ^{[1
]}

Tao, Andrew ^{[2
]}

Catanzaro, Bryan ^{[2
]}

机构：

[1] Univ Calif Merced, Merced, CA 95343 USA

[2] Nvidia Corp, Santa Clara, CA USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00906

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018.

引用

页码：8848 / 8857

页数：10

共 50 条

[21] MULTI-LABEL PROPAGATION FOR COHERENT VIDEO SEGMENTATION AND ARTISTIC STYLIZATION
Wang, Tinghuai
Guillemaut, Jean-Yves
Collomosse, John
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3005 - 3008
[22] Improving Segmentation of the Inferior Alveolar Nerve through Deep Label Propagation
Cipriano, Marco
Allegretti, Stefano
Bolelli, Federico
Pollastri, Federico
Grana, Costantino
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21105 - 21114
[23] Interactive shape co-segmentation via label propagation
Wu, Zizhao
Shou, Ruyang
Wang, Yunhai
Liu, Xinguo
COMPUTERS & GRAPHICS-UK, 2014, 38 : 248 - 254
[24] Label Propagation and Contrastive Regularization for Semisupervised Semantic Segmentation of Remote Sensing Images
Yang, Zhujun
Yan, Zhiyuan
Diao, Wenhui
Zhang, Qiang
Kang, Yuzhuo
Li, Junxi
Li, Xinming
Sun, Xian
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[25] Improving Video Segmentation via Dynamic Anchor Queries
Zhou, Yikang
Zhang, Tao
Ji, Shunping
Yan, Shuicheng
Li, Xiangtai
COMPUTER VISION - ECCV 2024, PT L, 2025, 15108 : 446 - 463
[26] Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation
Wang, Bowen
Li, Liangzhi
Nakashima, Yuta
Kawasaki, Ryo
Nagahara, Hajime
Yagi, Yasushi
IEEE ACCESS, 2021, 9 : 46810 - 46820
[27] Semantic Object Segmentation via Detection in Weakly Labeled Video
Zhang, Yu
Chen, Xiaowu
Li, Jia
Wang, Chen
Xia, Changqun
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3641 - 3649
[28] Improving Semantic Segmentation via Efficient Self-Training
Zhu, Yi
Zhang, Zhongyue
Wu, Chongruo
Zhang, Zhi
He, Tong
Zhang, Hang
Manmatha, R.
Li, Mu
Smola, Alexander
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1589 - 1602
[29] Improving Semantic Segmentation via Decoupled Body and Edge Information
Yu, Lintao
Yao, Anni
Duan, Jin
ENTROPY, 2023, 25 (06)
[30] Shot Boundary Detection and Label Propagation for Spatio-Temporal Video Segmentation
Piramanayagam, Sankaranarayanan
Saber, Eli
Cahill, Nathan D.
Messinger, David
IMAGE PROCESSING: MACHINE VISION APPLICATIONS VIII, 2015, 9405

← 1 2 3 4 5 →