Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection

被引:95
作者
Wang, Li [1 ]
Du, Liang [1 ]
Ye, Xiaoqing [2 ]
Fu, Yanwei [1 ]
Guo, Guodong [2 ]
Xue, Xiangyang [1 ]
Feng, Jianfeng [1 ]
Zhang, Li [1 ]
机构
[1] Fudan Univ, Shanghai, Peoples R China
[2] Baidu Inc, Beijing, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
国家重点研发计划;
关键词
D O I
10.1109/CVPR46437.2021.00052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this paper is to learn context- and depth-aware feature representation to solve the problem of monocular 3D object detection. We make following contributions: (i) rather than appealing to the complicated pseudo-LiDAR based approach, we propose a depth-conditioned dynamic message propagation (DDMP) network to effectively integrate the multi-scale depth information with the image context; (ii) this is achieved by first adaptively sampling context-aware nodes in the image context and then dynamically predicting hybrid depth-dependent filter weights and affinity matrices for propagating information; (Hi) by augmenting a center-aware depth encoding (CDE) task, our method successfully alleviates the inaccurate depth prior; (iv) we thoroughly demonstrate the effectiveness of our proposed approach and show state-of-the-art results among the monocular-based approaches on the KITTI benchmark dataset. Particularly, we rank 1st in the highly competitive KITTI monocular 3D object detection track on the submission day (November 16th, 2020).
引用
收藏
页码:454 / 463
页数:10
相关论文
共 52 条
[1]  
[Anonymous], 2019, AAAI
[2]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00149
[3]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00217
[4]   Kinematic 3D Object Detection in Monocular Video [J].
Brazil, Garrick ;
Pons-Moll, Gerard ;
Liu, Xiaoming ;
Schiele, Bernt .
COMPUTER VISION - ECCV 2020, PT XXIII, 2020, 12368 :135-152
[5]   M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [J].
Brazil, Garrick ;
Liu, Xiaoming .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9286-9295
[6]  
Cai YJ, 2020, AAAI CONF ARTIF INTE, V34, P10478
[7]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[8]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[9]   Domain Adaptive Image-to-image Translation [J].
Chen, Ying-Cong ;
Xu, Xiaogang ;
Jia, Jiaya .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5273-5282
[10]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773