A survey on 3D object detection in real time for autonomous driving

被引:2
|
作者
Contreras, Marcelo [1 ]
Jain, Aayush [2 ]
Bhatt, Neel P. [1 ]
Banerjee, Arunava [1 ]
Hashemi, Ehsan [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Indian Inst Technol Kharagpur, Kharagpur, W Bengal, India
来源
FRONTIERS IN ROBOTICS AND AI | 2024年 / 11卷
基金
加拿大自然科学与工程研究理事会;
关键词
3D object detection; autonomous navigation; visual navigation; robot perception; automated driving systems (ADS); visual-aided decision; DEPTH;
D O I
10.3389/frobt.2024.1212070
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This survey reviews advances in 3D object detection approaches for autonomous driving. A brief introduction to 2D object detection is first discussed and drawbacks of the existing methodologies are identified for highly dynamic environments. Subsequently, this paper reviews the state-of-the-art 3D object detection techniques that utilizes monocular and stereo vision for reliable detection in urban settings. Based on depth inference basis, learning schemes, and internal representation, this work presents a method taxonomy of three classes: model-based and geometrically constrained approaches, end-to-end learning methodologies, and hybrid methods. There is highlighted segment for current trend of multi-view detectors as end-to-end methods due to their boosted robustness. Detectors from the last two kinds were specially selected to exploit the autonomous driving context in terms of geometry, scene content and instances distribution. To prove the effectiveness of each method, 3D object detection datasets for autonomous vehicles are described with their unique features, e. g., varying weather conditions, multi-modality, multi camera perspective and their respective metrics associated to different difficulty categories. In addition, we included multi-modal visual datasets, i. e., V2X that may tackle the problems of single-view occlusion. Finally, the current research trends in object detection are summarized, followed by a discussion on possible scope for future research in this domain.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Real-time 3D capturing-visualization conversion for light field microscopy
    Lee, Byoungho
    Kim, Jonghyun
    INTERNATIONAL CONFERENCE ON OPTICS IN PRECISION ENGINEERING AND NANOTECHNOLOGY (ICOPEN2013), 2013, 8769
  • [32] Representation, Analysis, and Recognition of 3D Humans: A Survey
    Berretti, Stefano
    Daoudi, Mohamed
    Turaga, Pavan
    Basu, Anup
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (01)
  • [33] A comprehensive survey on 3D face recognition methods
    Li, Menghan
    Huang, Bin
    Tian, Guohui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
  • [34] Eliminating the Blind Spot: Adapting 3D Object Detection and Monocular Depth Estimation to 360° Panoramic Imagery
    de La Garanderie, Gregoire Payen
    Abarghouei, Amir Atapour
    Breckon, Toby P.
    COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 812 - 830
  • [35] High-speed simultaneous measurement of depth and normal for real-time 3D reconstruction
    Miyashita, Leo
    Kimura, Yohta
    Tabata, Satoshi
    Ishikawa, Masatoshi
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [36] Hydra: A Real-time Spatial Perception System for 3D Scene Graph Construction and Optimization
    Hughes, Nathan
    Chang, Yun
    Carlone, Luca
    ROBOTICS: SCIENCE AND SYSTEM XVIII, 2022,
  • [37] An Innovative No-Reference Metric for Real-Time 3D Stereoscopic Video Quality Assessment
    Han, Yi
    Yuan, Zhenhui
    Muntean, Gabriel-Miro
    IEEE TRANSACTIONS ON BROADCASTING, 2016, 62 (03) : 654 - 663
  • [38] Military Object Real-Time Detection Technology Combined with Visual Salience and Psychology
    Hua, Xia
    Wang, Xinqing
    Wang, Dong
    Huang, Jie
    Hu, Xiaodong
    ELECTRONICS, 2018, 7 (10)
  • [39] Recover 3D Information of the Moving Object from Video Streams
    Zheng, Yu-tong
    Li, Ming
    Liao, Fang
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 1656 - 1664
  • [40] Feature-Based Monocular Dynamic 3D Object Reconstruction
    Jin, Shaokun
    Ou, Yongsheng
    SOCIAL ROBOTICS, ICSR 2018, 2018, 11357 : 380 - 389