Monocular Differentiable Rendering for Self-supervised 3D Object Detection

被引:28
作者
Beker, Deniz [1 ]
Kato, Hiroharu [1 ]
Morariu, Mihai Adrian [1 ]
Ando, Takahiro [1 ]
Matsuoka, Toru [1 ]
Kehl, Wadim [2 ]
Gaidon, Adrien [3 ]
机构
[1] Preferred Networks Inc, Chiyoda City, Japan
[2] Toyota Res Inst Adv Dev, Chuo City, Japan
[3] Toyota Res Inst, Los Altos, CA USA
来源
COMPUTER VISION - ECCV 2020, PT XXI | 2020年 / 12366卷
关键词
D O I
10.1007/978-3-030-58589-1_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection from monocular images is an ill-posed problem due to the projective entanglement of depth and scale. To overcome this ambiguity, we present a novel self-supervised method for textured 3D shape reconstruction and pose estimation of rigid objects with the help of strong shape priors and 2D instance masks. Our method predicts the 3D location and meshes of each object in an image using differentiable rendering and a self-supervised objective derived from a pretrained monocular depth estimation network. We use the KITTI 3D object detection dataset to evaluate the accuracy of the method. Experiments demonstrate that we can effectively use noisy monocular depth and differentiable rendering as an alternative to expensive 3D ground-truth labels or LiDAR information.
引用
收藏
页码:514 / 529
页数:16
相关论文
共 38 条
[21]   Accurate Monocular 3D Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving [J].
Ma, Xinzhu ;
Wang, Zhihui ;
Li, Haojie ;
Zhang, Pengbo ;
Ouyang, Wanli ;
Fan, Xin .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6850-6859
[22]   Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints [J].
Mahjourian, Reza ;
Wicke, Martin ;
Angelova, Anelia .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5667-5675
[23]   ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape [J].
Manhardt, Fabian ;
Kehl, Wadim ;
Gaidon, Adrien .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2064-2073
[24]   3D Bounding Box Estimation Using Deep Learning and Geometry [J].
Mousavian, Arsalan ;
Anguelov, Dragomir ;
Flynn, John ;
Kosecka, Jana .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5632-5640
[25]  
Peng BH, 2019, ADV NEUR IN, V32
[26]  
Pillai S, 2019, IEEE INT CONF ROBOT, P9250, DOI [10.1109/ICRA.2019.8793621, 10.1109/icra.2019.8793621]
[27]  
Rui Wang, 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA), P11067, DOI 10.1109/ICRA40945.2020.9197095
[28]  
Simonelli A., 2019, ARXIV
[29]   Disentangling Monocular 3D Object Detection [J].
Simonelli, Andrea ;
Bulo, Samuel Rota ;
Porzi, Lorenzo ;
Lopez-Antequera, Manuel ;
Kontschieder, Peter .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1991-1999
[30]  
Stutz D., 2018, IJCV