Pattern-Affinitive Propagation across Depth, Surface Normal and Semantic Segmentation

被引：234

作者：

Zhang, Zhenyu ^{[1
,3
,4
,5
,6
,7
]}

Cui, Zhen ^{[1
,3
,4
]}

Xu, Chunyan ^{[1
,3
,4
]}

Yan, Yan ^{[1
,3
,4
]}

Sebe, Nicu ^{[2
,6
,7
]}

Yang, Jian ^{[1
,3
,4
]}

机构：

[1] Nanjing Univ Sci & Technol, PCA Lab, Nanjing, Jiangsu, Peoples R China

[2] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy

[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Key Lab Intelligent Percept & Syst High Dimens In, PCA Lab,Minist Educ, Nanjing, Jiangsu, Peoples R China

[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing, Jiangsu, Peoples R China

[5] Univ Trento, Trento, Italy

[6] Univ Trento, Dept Informat Engn, Trento, Italy

[7] Univ Trento, Multimedia & Human Understanding Grp MHUG, Trento, Italy

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00423

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper; we propose a novel Pattern-Affinitive Propagation (PAP) framework to jointly predict depth, surface normal and semantic segmentation. The motivation behind it comes from the statistic observation that pattern-affinitive pairs recur much frequently across different tasks as well as within a task. Thus, we can conduct two types of propagations, cross-task propagation and task-specific propagation, to adaptively diffuse those similar patterns. The former integrates cross-task affinity patterns to adapt to each task therein through the calculation on non-local relationships. Next the latter performs an iterative diffusion in the feature space so that the cross-task affinity patterns can be widely-spread within the task. Accordingly, the learning of each task can be regularized and boosted by the complementary task-level affinities. Extensive experiments demonstrate the effectiveness and the superiority of our method on the joint three tasks. Meanwhile, we achieve the state-of-the-art or competitive results on the three related datasets, NYUD-nu 2, SUN-RGBD and KITTI.

引用

页码：4101 / 4110

页数：10

共 62 条

[31]

He K., ICCV

[32] Guided Image Filtering [J].

He, Kaiming ;

Sun, Jian ;

Tang, Xiaoou .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (06) :1397-1409

[33] STD2P: RGBD Semantic Segmentation using Spatio-Temporal Data-Driven Pooling [J].

He, Yang ;

Chiu, Wei-Chen ;

Keuper, Margret ;

Fritz, Mario .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7158-7167

[34]

Kendall A., 2017, Advances in Neural Information Processing Systems, V30

[35] Unified Depth Prediction and Intrinsic Image Decomposition from a Single Image via Joint Convolutional Neural Fields [J].

Kim, Seungryong ;

Park, Kihong ;

Sohn, Kwanghoon ;

Lin, Stephen .

COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :143-159

[36] UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory [J].

Kokkinos, Iasonas .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5454-5463

[37]

Kr_ahenb_uhl P., 2011, ADV NEURAL INF PROCE, P109, DOI DOI 10.5555/2986459.2986472

[38] Deeper Depth Prediction with Fully Convolutional Residual Networks [J].

Laina, Iro ;

Rupprecht, Christian ;

Belagiannis, Vasileios ;

Tombari, Federico ;

Navab, Nassir .

PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :239-248

[39] A closed-form solution to natural image matting [J].

Levin, Anat ;

Lischinski, Dani ;

Weiss, Yair .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (02) :228-242

[40]

Li B, 2015, PROC CVPR IEEE, P1119, DOI 10.1109/CVPR.2015.7298715

← 1 2 3 4 5 6 7 →