Ground-Aware Monocular 3D Object Detection for Autonomous Driving

被引:92
作者
Liu, Yuxuan [1 ]
Yixuan, Yuan [2 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Robot & Multipercept Lab, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期
关键词
Three-dimensional displays; Cameras; Object detection; Two dimensional displays; Feature extraction; Convolution; Neural networks; Automation technologies for smart cities; deep learning for visual perception; object detection; segmentation and categorization;
D O I
10.1109/LRA.2021.3052442
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Estimating the 3D position and orientation of objects in the environment with a single RGB camera is a critical and challenging task for low-cost urban autonomous driving and mobile robots. Most of the existing algorithms are based on the geometric constraints in 2D-3D correspondence, which stems from generic 6D object pose estimation. We first identify how the ground plane provides additional clues in depth reasoning in 3D detection in driving scenes. Based on this observation, we then improve the processing of 3D anchors and introduce a novel neural network module to fully utilize such application-specific priors in the framework of deep learning. Finally, we introduce an efficient neural network embedded with the proposed module for 3D object detection. We further verify the power of the proposed module with a neural network designed for monocular depth prediction. The two proposed networks achieve state-of-the-art performances on the KITTI 3D object detection and depth prediction benchmarks, respectively.
引用
收藏
页码:919 / 926
页数:8
相关论文
共 50 条
  • [1] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [2] Context-Aware 3D Object Detection From a Single Image in Autonomous Driving
    Zhou, Dingfu
    Song, Xibin
    Fang, Jin
    Dai, Yuchao
    Li, Hongdong
    Zhang, Liangjun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18568 - 18580
  • [3] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [4] OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection
    Su, Yongzhi
    Di, Yan
    Zhai, Guangyao
    Manhardt, Fabian
    Rambach, Jason
    Busam, Benjamin
    Stricker, Didier
    Tombari, Federico
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (03) : 1327 - 1334
  • [5] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
    Alaba, Simegnew Yihunie
    Ball, John E.
    IEEE ACCESS, 2024, 12 : 50165 - 50176
  • [6] Performance and Challenges of 3D Object Detection Methods in Complex Scenes for Autonomous Driving
    Wang, Ke
    Zhou, Tianqiang
    Li, Xingcan
    Ren, Fan
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (02): : 1699 - 1716
  • [7] Enhancing Grid-Based 3D Object Detection in Autonomous Driving With Improved Dimensionality Reduction
    Huang, Dihe
    Chen, Ying
    Ding, Yikang
    Liu, Yong
    Nie, Qiang
    Wang, Chengjie
    Li, Zhiheng
    IEEE ACCESS, 2023, 11 : 35243 - 35254
  • [8] Depth Estimation From Surface-Ground Correspondence for Monocular 3D Object Detection
    Ji, Yinshuai
    Xu, Jinhua
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16312 - 16322
  • [9] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [10] 3D object detection algorithms in autonomous driving: A review
    Ren K.-Y.
    Gu M.-Y.
    Yuan Z.-Q.
    Yuan S.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (04): : 865 - 889