A survey on 3D object detection in real time for autonomous driving

被引：2

作者：

Contreras, Marcelo ^{[1
]}

Jain, Aayush ^{[2
]}

Bhatt, Neel P. ^{[1
]}

Banerjee, Arunava ^{[1
]}

Hashemi, Ehsan ^{[1
]}

机构：

[1] Univ Alberta, Edmonton, AB, Canada

[2] Indian Inst Technol Kharagpur, Kharagpur, W Bengal, India

来源：

FRONTIERS IN ROBOTICS AND AI | 2024年 / 11卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

3D object detection; autonomous navigation; visual navigation; robot perception; automated driving systems (ADS); visual-aided decision; DEPTH;

D O I：

10.3389/frobt.2024.1212070

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This survey reviews advances in 3D object detection approaches for autonomous driving. A brief introduction to 2D object detection is first discussed and drawbacks of the existing methodologies are identified for highly dynamic environments. Subsequently, this paper reviews the state-of-the-art 3D object detection techniques that utilizes monocular and stereo vision for reliable detection in urban settings. Based on depth inference basis, learning schemes, and internal representation, this work presents a method taxonomy of three classes: model-based and geometrically constrained approaches, end-to-end learning methodologies, and hybrid methods. There is highlighted segment for current trend of multi-view detectors as end-to-end methods due to their boosted robustness. Detectors from the last two kinds were specially selected to exploit the autonomous driving context in terms of geometry, scene content and instances distribution. To prove the effectiveness of each method, 3D object detection datasets for autonomous vehicles are described with their unique features, e. g., varying weather conditions, multi-modality, multi camera perspective and their respective metrics associated to different difficulty categories. In addition, we included multi-modal visual datasets, i. e., V2X that may tackle the problems of single-view occlusion. Finally, the current research trends in object detection are summarized, followed by a discussion on possible scope for future research in this domain.

引用

页数：17

共 50 条

[21] A new real-time algorithm for Multiview Disparity based 3D-object segmentation
Zhang, YC
He, Y
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2001, 2001, 4310 : 702 - 712
[22] Real-time capturing and 3D visualization method based on integral imaging
Kim, Jonghyun
Jung, Jae-Hyun
Jang, Changwon
Lee, Byoungho
OPTICS EXPRESS, 2013, 21 (16): : 18742 - 18753
[23] Real-time estimation of 3D scene geometry from a single image
Jung, Chanho
Kim, Changick
PATTERN RECOGNITION, 2012, 45 (09) : 3256 - 3269
[24] Development of the 3D measurement system in real-time for micro-manipulation
Ohara, Kenichi
Kojima, Masaru
Takagi, Shota
Horade, Mitsuhiro
Mae, Yasushi
Arai, Tatsuo
2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 2022 - 2027
[25] SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection
Zhang, Hongcheng
Liang, Liu
Zeng, Pengxin
Song, Xiao
Wang, Zhe
COMPUTER VISION-ECCV 2024, PT XXXV, 2025, 15093 : 109 - 128
[26] A Combined 2D-3D Object Detection Framework
Amara, Kahina
Djekoune, Oualid
Achour, Nouara
Belhocine, Mahmoud
Bellal, Rima Narimene
IETE JOURNAL OF RESEARCH, 2017, 63 (05) : 607 - 615
[27] Sparse RGB-D images create a real thing: A flexible voxel based 3D reconstruction pipeline for single object
Luo, Fei
Zhu, Yongqiong
Fu, Yanping
Zhou, Huajian
Chen, Zezheng
Xiao, Chunxia
VISUAL INFORMATICS, 2023, 7 (01) : 66 - 76
[28] An End-to-End Real-Time 3D System for Integral Photography Display
Zhang, Shenghao
Wang, Zhenyu
Zhu, Mingtong
Wang, Ronggang
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 246 - 256
[29] Exploring RGB plus Depth Fusion for Real-Time Object Detection
Ophoff, Tanguy
Van Beeck, Kristof
Goedeme, Toon
SENSORS, 2019, 19 (04)
[30] Visualization pipeline of autonomous driving scenes based on FCCR-3D reconstruction
Bai, Ling
Li, Yinguo
Zhou, Zhongkui
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)

← 1 2 3 4 5 →