Context-Aware 3D Object Detection From a Single Image in Autonomous Driving

被引:8
作者
Zhou, Dingfu [1 ,2 ]
Song, Xibin [1 ,2 ]
Fang, Jin [1 ,2 ]
Dai, Yuchao [3 ]
Li, Hongdong [4 ]
Zhang, Liangjun [1 ,2 ]
机构
[1] Baidu Res, Robot & Autonomous Driving Lab, Beijing 100085, Peoples R China
[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China
[3] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710060, Peoples R China
[4] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Three-dimensional displays; Object detection; Training; Feature extraction; Task analysis; Sensors; Detectors; Monocular 3D object detection; context-aware feature aggregation; self-attention; RECOGNITION; MODEL;
D O I
10.1109/TITS.2022.3154022
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Camera sensors have been widely used in Driver-Assistance and Autonomous Driving Systems due to their rich texture information. Recently, with the development of deep learning techniques, many approaches have been proposed to detect objects in 3D from a single frame, however, there is still much room for improvement. In this paper, we generally review the recently proposed state-of-the-art monocular-based 3D object detection approaches first. Based on the analysis of the disadvantage of previous center-based frameworks, a novel feature aggregation strategy has been proposed to boost the 3D object detection by exploring the context information. Specifically, an Instance-Guided Spatial Attention (IGSA) module is proposed to collect the local instance information and the Channel-Wise Feature Attention (CWFA) module is employed for aggregating the global context information. In addition, an instance-guided object regression strategy is also proposed to alleviate the influence of center location prediction uncertainty in the inference process. Finally, the proposed approach has been verified on the public 3D object detection benchmark. The experimental results show that the proposed approach can significantly boost the performance of the baseline method on both 3D detection and 2D Bird's-Eye View among all three categories. Furthermore, our method outperforms all the monocular-based methods (even these trained with depth as auxiliary inputs) and achieves state-of-the-art performance on the KITTI benchmark.
引用
收藏
页码:18568 / 18580
页数:13
相关论文
共 50 条
  • [1] Ground-Aware Monocular 3D Object Detection for Autonomous Driving
    Liu, Yuxuan
    Yixuan, Yuan
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 919 - 926
  • [2] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [3] Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point Clouds
    Tian, Yonglin
    Huang, Lichao
    Yu, Hui
    Wu, Xiangbin
    Li, Xuesong
    Wang, Kunfeng
    Wang, Zilei
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10773 - 10785
  • [4] RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving
    Zhang, Xinyu
    Wang, Li
    Zhang, Guoxin
    Lan, Tianwei
    Zhang, Haoming
    Zhao, Lijun
    Li, Jun
    Zhu, Lei
    Liu, Huaping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [5] Density Aware 3D Object Single Stage Detector
    Ning, Jingmei
    Da, Feipeng
    Gai, Shaoyan
    IEEE SENSORS JOURNAL, 2021, 21 (20) : 23108 - 23117
  • [6] Exploring Diversity-Based Active Learning for 3D Object Detection in Autonomous Driving
    Lin, Jinpeng
    Liang, Zhihao
    Deng, Shengheng
    Cai, Lile
    Jiang, Tao
    Li, Tianrui
    Jia, Kui
    Xu, Xun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15454 - 15466
  • [7] Deep Learning-Based Image 3-D Object Detection for Autonomous Driving: Review
    Alaba, Simegnew Yihunie
    Ball, John E.
    IEEE SENSORS JOURNAL, 2023, 23 (04) : 3378 - 3394
  • [8] Context-aware 3D object anchoring for mobile robots
    Guenther, Martin
    Ruiz-Sarmiento, J. R.
    Galindo, Cipriano
    Gonzalez-Jimenez, Javier
    Hertzberg, Joachim
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 110 : 12 - 32
  • [9] Bridging Multi-Scale Context-Aware Representation for Object Detection
    Wang, Boying
    Ji, Ruyi
    Zhang, Libo
    Wu, Yanjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2317 - 2329
  • [10] Performance and Challenges of 3D Object Detection Methods in Complex Scenes for Autonomous Driving
    Wang, Ke
    Zhou, Tianqiang
    Li, Xingcan
    Ren, Fan
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (02): : 1699 - 1716