Pattern-Affinitive Propagation across Depth, Surface Normal and Semantic Segmentation

被引:234
作者
Zhang, Zhenyu [1 ,3 ,4 ,5 ,6 ,7 ]
Cui, Zhen [1 ,3 ,4 ]
Xu, Chunyan [1 ,3 ,4 ]
Yan, Yan [1 ,3 ,4 ]
Sebe, Nicu [2 ,6 ,7 ]
Yang, Jian [1 ,3 ,4 ]
机构
[1] Nanjing Univ Sci & Technol, PCA Lab, Nanjing, Jiangsu, Peoples R China
[2] Univ Trento, Multimedia & Human Understanding Grp, Trento, Italy
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Key Lab Intelligent Percept & Syst High Dimens In, PCA Lab,Minist Educ, Nanjing, Jiangsu, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing, Jiangsu, Peoples R China
[5] Univ Trento, Trento, Italy
[6] Univ Trento, Dept Informat Engn, Trento, Italy
[7] Univ Trento, Multimedia & Human Understanding Grp MHUG, Trento, Italy
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper; we propose a novel Pattern-Affinitive Propagation (PAP) framework to jointly predict depth, surface normal and semantic segmentation. The motivation behind it comes from the statistic observation that pattern-affinitive pairs recur much frequently across different tasks as well as within a task. Thus, we can conduct two types of propagations, cross-task propagation and task-specific propagation, to adaptively diffuse those similar patterns. The former integrates cross-task affinity patterns to adapt to each task therein through the calculation on non-local relationships. Next the latter performs an iterative diffusion in the feature space so that the cross-task affinity patterns can be widely-spread within the task. Accordingly, the learning of each task can be regularized and boosted by the complementary task-level affinities. Extensive experiments demonstrate the effectiveness and the superiority of our method on the joint three tasks. Meanwhile, we achieve the state-of-the-art or competitive results on the three related datasets, NYUD-nu 2, SUN-RGBD and KITTI.
引用
收藏
页码:4101 / 4110
页数:10
相关论文
共 62 条
[31]  
He K., ICCV
[32]   Guided Image Filtering [J].
He, Kaiming ;
Sun, Jian ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (06) :1397-1409
[33]   STD2P: RGBD Semantic Segmentation using Spatio-Temporal Data-Driven Pooling [J].
He, Yang ;
Chiu, Wei-Chen ;
Keuper, Margret ;
Fritz, Mario .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7158-7167
[34]  
Kendall A., 2017, Advances in Neural Information Processing Systems, V30
[35]   Unified Depth Prediction and Intrinsic Image Decomposition from a Single Image via Joint Convolutional Neural Fields [J].
Kim, Seungryong ;
Park, Kihong ;
Sohn, Kwanghoon ;
Lin, Stephen .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :143-159
[36]   UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory [J].
Kokkinos, Iasonas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5454-5463
[37]  
Kr_ahenb_uhl P., 2011, ADV NEURAL INF PROCE, P109, DOI DOI 10.5555/2986459.2986472
[38]   Deeper Depth Prediction with Fully Convolutional Residual Networks [J].
Laina, Iro ;
Rupprecht, Christian ;
Belagiannis, Vasileios ;
Tombari, Federico ;
Navab, Nassir .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :239-248
[39]   A closed-form solution to natural image matting [J].
Levin, Anat ;
Lischinski, Dani ;
Weiss, Yair .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (02) :228-242
[40]  
Li B, 2015, PROC CVPR IEEE, P1119, DOI 10.1109/CVPR.2015.7298715