3D Object Detection for Autonomous Driving: A Comprehensive Survey

被引:162
作者
Mao, Jiageng [1 ]
Shi, Shaoshuai [2 ]
Wang, Xiaogang [1 ,3 ]
Li, Hongsheng [1 ,3 ,4 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Max Planck Inst Informat, Saarbrucken, Germany
[3] Ctr Perceptual & Interact Intelligence, Hong Kong, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
关键词
3D object detection; Perception; Autonomous driving; Deep learning; Computer vision; Robotics; POINT CLOUD; END;
D O I
10.1007/s11263-023-01790-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous driving, in recent years, has been receiving increasing attention for its potential to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving pipelines, the perception system is an indispensable component, aiming to accurately estimate the status of surrounding environments and provide reliable observations for prediction and planning. 3D object detection, which aims to predict the locations, sizes, and categories of the 3D objects near an autonomous vehicle, is an important part of a perception system. This paper reviews the advances in 3D object detection for autonomous driving. First, we introduce the background of 3D object detection and discuss the challenges in this task. Second, we conduct a comprehensive survey of the progress in 3D object detection from the aspects of models and sensory inputs, including LiDAR-based, camera-based, and multi-modal detection approaches. We also provide an in-depth analysis of the potentials and challenges in each category of methods. Additionally, we systematically investigate the applications of 3D object detection in driving systems. Finally, we conduct a performance analysis of the 3D object detection approaches, and we further summarize the research trends over the years and prospect the future directions of this area.
引用
收藏
页码:1909 / 1963
页数:55
相关论文
共 378 条
[1]   Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes [J].
Abu Alhaija, Hassan ;
Mustikovela, Siva Karthik ;
Mescheder, Lars ;
Geiger, Andreas ;
Rother, Carsten .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) :961-972
[2]  
Aghdam H. H., 2021, CVPR
[3]   YOLO3D: End-to-End Real-Time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud [J].
Ali, Waleed ;
Abdelkarim, Sherif ;
Zidan, Mahmoud ;
Zahran, Mohamed ;
El Sallab, Ahmad .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 :716-728
[4]  
Amini A, 2020, IEEE ROBOT AUTOM LET, V5, P1143, DOI [10.29209/id.v5i1.100, 10.1109/LRA.2020.2966414]
[5]   A Survey on 3D Object Detection Methods for Autonomous Driving Applications [J].
Arnold, Eduardo ;
Al-Jarrah, Omar Y. ;
Dianati, Mehrdad ;
Fallah, Saber ;
Oxtoby, David ;
Mouzakitis, Alex .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) :3782-3795
[6]   TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers [J].
Bai, Xuyang ;
Hu, Zeyu ;
Zhu, Xinge ;
Huang, Qingqiu ;
Chen, Yilun ;
Fu, Hangbo ;
Tai, Chiew-Lan .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1080-1089
[7]   MonoFENet: Monocular 3D Object Detection With Feature Enhancement Networks [J].
Bao, Wentao ;
Xu, Bin ;
Chen, Zhenzhong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :2753-2765
[8]  
Barrera A, 2020, IEEE INT C INTELL TR
[9]   Monocular Differentiable Rendering for Self-supervised 3D Object Detection [J].
Beker, Deniz ;
Kato, Hiroharu ;
Morariu, Mihai Adrian ;
Ando, Takahiro ;
Matsuoka, Toru ;
Kehl, Wadim ;
Gaidon, Adrien .
COMPUTER VISION - ECCV 2020, PT XXI, 2020, 12366 :514-529
[10]  
Beltrán J, 2018, IEEE INT C INTELL TR, P3517, DOI 10.1109/ITSC.2018.8569311