A survey on 3D object detection in real time for autonomous driving

被引:2
|
作者
Contreras, Marcelo [1 ]
Jain, Aayush [2 ]
Bhatt, Neel P. [1 ]
Banerjee, Arunava [1 ]
Hashemi, Ehsan [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Indian Inst Technol Kharagpur, Kharagpur, W Bengal, India
来源
FRONTIERS IN ROBOTICS AND AI | 2024年 / 11卷
基金
加拿大自然科学与工程研究理事会;
关键词
3D object detection; autonomous navigation; visual navigation; robot perception; automated driving systems (ADS); visual-aided decision; DEPTH;
D O I
10.3389/frobt.2024.1212070
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This survey reviews advances in 3D object detection approaches for autonomous driving. A brief introduction to 2D object detection is first discussed and drawbacks of the existing methodologies are identified for highly dynamic environments. Subsequently, this paper reviews the state-of-the-art 3D object detection techniques that utilizes monocular and stereo vision for reliable detection in urban settings. Based on depth inference basis, learning schemes, and internal representation, this work presents a method taxonomy of three classes: model-based and geometrically constrained approaches, end-to-end learning methodologies, and hybrid methods. There is highlighted segment for current trend of multi-view detectors as end-to-end methods due to their boosted robustness. Detectors from the last two kinds were specially selected to exploit the autonomous driving context in terms of geometry, scene content and instances distribution. To prove the effectiveness of each method, 3D object detection datasets for autonomous vehicles are described with their unique features, e. g., varying weather conditions, multi-modality, multi camera perspective and their respective metrics associated to different difficulty categories. In addition, we included multi-modal visual datasets, i. e., V2X that may tackle the problems of single-view occlusion. Finally, the current research trends in object detection are summarized, followed by a discussion on possible scope for future research in this domain.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A new real-time algorithm for Multiview Disparity based 3D-object segmentation
    Zhang, YC
    He, Y
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2001, 2001, 4310 : 702 - 712
  • [22] Real-time capturing and 3D visualization method based on integral imaging
    Kim, Jonghyun
    Jung, Jae-Hyun
    Jang, Changwon
    Lee, Byoungho
    OPTICS EXPRESS, 2013, 21 (16): : 18742 - 18753
  • [23] Real-time estimation of 3D scene geometry from a single image
    Jung, Chanho
    Kim, Changick
    PATTERN RECOGNITION, 2012, 45 (09) : 3256 - 3269
  • [24] Development of the 3D measurement system in real-time for micro-manipulation
    Ohara, Kenichi
    Kojima, Masaru
    Takagi, Shota
    Horade, Mitsuhiro
    Mae, Yasushi
    Arai, Tatsuo
    2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 2022 - 2027
  • [25] SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection
    Zhang, Hongcheng
    Liang, Liu
    Zeng, Pengxin
    Song, Xiao
    Wang, Zhe
    COMPUTER VISION-ECCV 2024, PT XXXV, 2025, 15093 : 109 - 128
  • [26] A Combined 2D-3D Object Detection Framework
    Amara, Kahina
    Djekoune, Oualid
    Achour, Nouara
    Belhocine, Mahmoud
    Bellal, Rima Narimene
    IETE JOURNAL OF RESEARCH, 2017, 63 (05) : 607 - 615
  • [27] Sparse RGB-D images create a real thing: A flexible voxel based 3D reconstruction pipeline for single object
    Luo, Fei
    Zhu, Yongqiong
    Fu, Yanping
    Zhou, Huajian
    Chen, Zezheng
    Xiao, Chunxia
    VISUAL INFORMATICS, 2023, 7 (01) : 66 - 76
  • [28] An End-to-End Real-Time 3D System for Integral Photography Display
    Zhang, Shenghao
    Wang, Zhenyu
    Zhu, Mingtong
    Wang, Ronggang
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 246 - 256
  • [29] Exploring RGB plus Depth Fusion for Real-Time Object Detection
    Ophoff, Tanguy
    Van Beeck, Kristof
    Goedeme, Toon
    SENSORS, 2019, 19 (04)
  • [30] Visualization pipeline of autonomous driving scenes based on FCCR-3D reconstruction
    Bai, Ling
    Li, Yinguo
    Zhou, Zhongkui
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)