3D Object Detection for Autonomous Driving: A Comprehensive Survey

被引：162

作者：

Mao, Jiageng ^{[1
]}

Shi, Shaoshuai ^{[2
]}

Wang, Xiaogang ^{[1
,3
]}

Li, Hongsheng ^{[1
,3
,4
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] Max Planck Inst Informat, Saarbrucken, Germany

[3] Ctr Perceptual & Interact Intelligence, Hong Kong, Peoples R China

[4] Shanghai AI Lab, Shanghai, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2023年 / 131卷 / 08期

关键词：

3D object detection; Perception; Autonomous driving; Deep learning; Computer vision; Robotics; POINT CLOUD; END;

D O I：

10.1007/s11263-023-01790-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous driving, in recent years, has been receiving increasing attention for its potential to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving pipelines, the perception system is an indispensable component, aiming to accurately estimate the status of surrounding environments and provide reliable observations for prediction and planning. 3D object detection, which aims to predict the locations, sizes, and categories of the 3D objects near an autonomous vehicle, is an important part of a perception system. This paper reviews the advances in 3D object detection for autonomous driving. First, we introduce the background of 3D object detection and discuss the challenges in this task. Second, we conduct a comprehensive survey of the progress in 3D object detection from the aspects of models and sensory inputs, including LiDAR-based, camera-based, and multi-modal detection approaches. We also provide an in-depth analysis of the potentials and challenges in each category of methods. Additionally, we systematically investigate the applications of 3D object detection in driving systems. Finally, we conduct a performance analysis of the 3D object detection approaches, and we further summarize the research trends over the years and prospect the future directions of this area.

引用

页码：1909 / 1963

页数：55

共 378 条

[1] Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes [J].