Context-Aware 3D Object Detection From a Single Image in Autonomous Driving

被引:8
|
作者
Zhou, Dingfu [1 ,2 ]
Song, Xibin [1 ,2 ]
Fang, Jin [1 ,2 ]
Dai, Yuchao [3 ]
Li, Hongdong [4 ]
Zhang, Liangjun [1 ,2 ]
机构
[1] Baidu Res, Robot & Autonomous Driving Lab, Beijing 100085, Peoples R China
[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China
[3] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710060, Peoples R China
[4] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Three-dimensional displays; Object detection; Training; Feature extraction; Task analysis; Sensors; Detectors; Monocular 3D object detection; context-aware feature aggregation; self-attention; RECOGNITION; MODEL;
D O I
10.1109/TITS.2022.3154022
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Camera sensors have been widely used in Driver-Assistance and Autonomous Driving Systems due to their rich texture information. Recently, with the development of deep learning techniques, many approaches have been proposed to detect objects in 3D from a single frame, however, there is still much room for improvement. In this paper, we generally review the recently proposed state-of-the-art monocular-based 3D object detection approaches first. Based on the analysis of the disadvantage of previous center-based frameworks, a novel feature aggregation strategy has been proposed to boost the 3D object detection by exploring the context information. Specifically, an Instance-Guided Spatial Attention (IGSA) module is proposed to collect the local instance information and the Channel-Wise Feature Attention (CWFA) module is employed for aggregating the global context information. In addition, an instance-guided object regression strategy is also proposed to alleviate the influence of center location prediction uncertainty in the inference process. Finally, the proposed approach has been verified on the public 3D object detection benchmark. The experimental results show that the proposed approach can significantly boost the performance of the baseline method on both 3D detection and 2D Bird's-Eye View among all three categories. Furthermore, our method outperforms all the monocular-based methods (even these trained with depth as auxiliary inputs) and achieves state-of-the-art performance on the KITTI benchmark.
引用
收藏
页码:18568 / 18580
页数:13
相关论文
共 50 条
  • [41] Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
    Chen, Yi-Nan
    Dai, Hang
    Ding, Yong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 877 - 887
  • [42] Efficient 3D Object Detection Models and Evaluation Method for Autonomous Driving
    Lee, Jin-Hee
    Lee, Jae-Keun
    Lee, Joohyun
    Kim, Je-Seok
    Kwon, Soon
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [43] R-CNN Based 3D Object Detection for Autonomous Driving
    Hu, Hongyu
    Zhao, Tongtong
    Wang, Qi
    Gao, Fei
    He, Lei
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 918 - 929
  • [44] A semi-supervised 3D object detection method for autonomous driving
    Zhang, Jiacheng
    Liu, Huafeng
    Lu, Jianfeng
    DISPLAYS, 2022, 71
  • [45] Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving
    Dong, Yinpeng
    Kang, Caixin
    Zhang, Jinlai
    Zhu, Zijian
    Wang, Yikai
    Yang, Xiao
    Su, Hang
    Wei, Xingxing
    Zhu, Jun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1022 - 1032
  • [46] Monocular 3D Object Detection for Autonomous Driving Based on Contextual Transformer
    She, Xiangyang
    Yan, Weijia
    Dong, Lihong
    Computer Engineering and Applications, 2024, 60 (19) : 178 - 189
  • [47] Testing Object Detection for Autonomous Driving Systems via 3D Reconstruction
    Shao, Jinyang
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, : 117 - 119
  • [48] Monocular 3D object detection using dual quadric for autonomous driving
    Li, Peixuan
    Zhao, Huaici
    NEUROCOMPUTING, 2021, 441 : 151 - 160
  • [49] Multi-modality 3D object detection in autonomous driving: A review
    Tang, Yingjuan
    He, Hongwen
    Wang, Yong
    Mao, Zan
    Wang, Haoyu
    NEUROCOMPUTING, 2023, 553
  • [50] PLOT: a 3D point cloud object detection network for autonomous driving
    Zhang, Yihuan
    Wang, Liang
    Dai, Yifan
    ROBOTICA, 2023, 41 (05) : 1483 - 1499