Context-Aware 3D Object Detection From a Single Image in Autonomous Driving

被引:8
作者
Zhou, Dingfu [1 ,2 ]
Song, Xibin [1 ,2 ]
Fang, Jin [1 ,2 ]
Dai, Yuchao [3 ]
Li, Hongdong [4 ]
Zhang, Liangjun [1 ,2 ]
机构
[1] Baidu Res, Robot & Autonomous Driving Lab, Beijing 100085, Peoples R China
[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China
[3] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710060, Peoples R China
[4] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Three-dimensional displays; Object detection; Training; Feature extraction; Task analysis; Sensors; Detectors; Monocular 3D object detection; context-aware feature aggregation; self-attention; RECOGNITION; MODEL;
D O I
10.1109/TITS.2022.3154022
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Camera sensors have been widely used in Driver-Assistance and Autonomous Driving Systems due to their rich texture information. Recently, with the development of deep learning techniques, many approaches have been proposed to detect objects in 3D from a single frame, however, there is still much room for improvement. In this paper, we generally review the recently proposed state-of-the-art monocular-based 3D object detection approaches first. Based on the analysis of the disadvantage of previous center-based frameworks, a novel feature aggregation strategy has been proposed to boost the 3D object detection by exploring the context information. Specifically, an Instance-Guided Spatial Attention (IGSA) module is proposed to collect the local instance information and the Channel-Wise Feature Attention (CWFA) module is employed for aggregating the global context information. In addition, an instance-guided object regression strategy is also proposed to alleviate the influence of center location prediction uncertainty in the inference process. Finally, the proposed approach has been verified on the public 3D object detection benchmark. The experimental results show that the proposed approach can significantly boost the performance of the baseline method on both 3D detection and 2D Bird's-Eye View among all three categories. Furthermore, our method outperforms all the monocular-based methods (even these trained with depth as auxiliary inputs) and achieves state-of-the-art performance on the KITTI benchmark.
引用
收藏
页码:18568 / 18580
页数:13
相关论文
共 50 条
  • [21] 3D OBJECT DETECTION FOR AUTONOMOUS DRIVING USING TEMPORAL LIDAR DATA
    McCrae, Scott
    Zakhor, Avideh
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2661 - 2665
  • [22] PLOT: a 3D point cloud object detection network for autonomous driving
    Zhang, Yihuan
    Wang, Liang
    Dai, Yifan
    ROBOTICA, 2023, 41 (05) : 1483 - 1499
  • [23] RoIFusion: 3D Object Detection From LiDAR and Vision
    Chen, Can
    Fragonara, Luca Zanotti
    Tsourdos, Antonios
    IEEE ACCESS, 2021, 9 (09): : 51710 - 51721
  • [24] POAT-Net: Parallel Offset-Attention Assisted Transformer for 3D Object Detection for Autonomous Driving
    Wang, Jinyang
    Lin, Xiao
    Yu, Hongying
    IEEE ACCESS, 2021, 9 : 151110 - 151117
  • [25] 3D Context-Aware Convolutional Neural Network for False Positive Reduction in Clustered Microcalcifications Detection
    Zheng, Jian
    Sun, Haotian
    Wu, Shandong
    Jiang, Ke
    Peng, Yunsong
    Yang, Xiaodong
    Zhang, Fan
    Li, Ming
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (03) : 764 - 773
  • [26] REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION
    Kuan, Kingsley
    Manek, Gaurav
    Lin, Jie
    Fang, Yuan
    Chandrasekhar, Vijay
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1347 - 1351
  • [27] Super Sparse 3D Object Detection
    Fan, Lue
    Yang, Yuxue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
  • [28] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [29] 3D object detection for autonomous driving: Methods, models, sensors, data, and challenges
    Ghasemieh A.
    Kashef R.
    Transportation Engineering, 2022, 8
  • [30] Multimodal Cooperative 3D Object Detection Over Connected Vehicles for Autonomous Driving
    Chi, Fangyuan
    Wang, Yixiao
    Pourazad, Mahsa T.
    Nasiopoulos, Panos
    Leung, Victor C. M.
    IEEE NETWORK, 2023, 37 (04): : 265 - 272