Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection

被引：95

作者：

Wang, Li ^{[1
]}

Du, Liang ^{[1
]}

Ye, Xiaoqing ^{[2
]}

Fu, Yanwei ^{[1
]}

Guo, Guodong ^{[2
]}

Xue, Xiangyang ^{[1
]}

Feng, Jianfeng ^{[1
]}

Zhang, Li ^{[1
]}

机构：

[1] Fudan Univ, Shanghai, Peoples R China

[2] Baidu Inc, Beijing, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

国家重点研发计划;

关键词：

D O I：

10.1109/CVPR46437.2021.00052

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The objective of this paper is to learn context- and depth-aware feature representation to solve the problem of monocular 3D object detection. We make following contributions: (i) rather than appealing to the complicated pseudo-LiDAR based approach, we propose a depth-conditioned dynamic message propagation (DDMP) network to effectively integrate the multi-scale depth information with the image context; (ii) this is achieved by first adaptively sampling context-aware nodes in the image context and then dynamically predicting hybrid depth-dependent filter weights and affinity matrices for propagating information; (Hi) by augmenting a center-aware depth encoding (CDE) task, our method successfully alleviates the inaccurate depth prior; (iv) we thoroughly demonstrate the effectiveness of our proposed approach and show state-of-the-art results among the monocular-based approaches on the KITTI benchmark dataset. Particularly, we rank 1st in the highly competitive KITTI monocular 3D object detection track on the submission day (November 16th, 2020).

引用

页码：454 / 463

页数：10

共 52 条

[1]

[Anonymous], 2019, AAAI

[2]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00149

[3]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00217

[4] Kinematic 3D Object Detection in Monocular Video [J].

Brazil, Garrick ;

Pons-Moll, Gerard ;

Liu, Xiaoming ;

Schiele, Bernt .

COMPUTER VISION - ECCV 2020, PT XXIII, 2020, 12368 :135-152

[5] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [J].

Brazil, Garrick ;

Liu, Xiaoming .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9286-9295

[6]

Cai YJ, 2020, AAAI CONF ARTIF INTE, V34, P10478

[7] Pyramid Stereo Matching Network [J].

Chang, Jia-Ren ;

Chen, Yong-Sheng .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418

[8] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[9] Domain Adaptive Image-to-image Translation [J].

Chen, Ying-Cong ;

Xu, Xiaogang ;

Jia, Jiaya .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5273-5282

[10] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

← 1 2 3 4 5 6 →