Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

被引:21
|
作者
Zhu, Zijian [1 ]
Zhang, Yichi [2 ,5 ]
Chen, Hai [3 ]
Dong, Yinpeng [2 ]
Zhao, Shu [3 ]
Ding, Wenbo [4 ]
Zhong, Jiachen [4 ]
Zheng, Shibao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
[2] Tsinghua Univ, Inst AI, Dept Comp Sci & Tech, THBI Lab,BNRist Ctr, Beijing, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Key Lab Intelligent Comp & Signal Proc,Minist Edu, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei, Peoples R China
[4] SAIC Motor AI Lab, Shanghai, Peoples R China
[5] Zhongguancun Lab, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.02069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection is an essential perception task in autonomous driving to understand the environments. The Bird's-Eye-View (BEV) representations have significantly improved the performance of 3D detectors with camera inputs on popular benchmarks. However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems. In this paper, we evaluate the natural and adversarial robustness of various representative models under extensive settings, to fully understand their behaviors influenced by explicit BEV features compared with those without BEV. In addition to the classic settings, we propose a 3D consistent patch attack by applying adversarial patches in the 3D space to guarantee the spatiotemporal consistency, which is more realistic for the scenario of autonomous driving. With substantial experiments, we draw several findings: 1) BEV models tend to be more stable than previous methods under different natural conditions and common corruptions due to the expressive spatial representations; 2) BEV models are more vulnerable to adversarial noises, mainly caused by the redundant BEV features; 3) Camera-LiDAR fusion models have superior performance under different settings with multi-modal inputs, but BEV fusion model is still vulnerable to adversarial noises of both point cloud and image. These findings alert the safety issue in the applications of BEV detectors and could facilitate the development of more robust models.
引用
收藏
页码:21600 / 21610
页数:11
相关论文
共 50 条
  • [41] 3D Object Detection From Images for Autonomous Driving: A Survey
    Ma, Xinzhu
    Ouyang, Wanli
    Simonelli, Andrea
    Ricci, Elisa
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 3537 - 3556
  • [42] A Survey on 3D Object Detection Methods for Autonomous Driving Applications
    Arnold, Eduardo
    Al-Jarrah, Omar Y.
    Dianati, Mehrdad
    Fallah, Saber
    Oxtoby, David
    Mouzakitis, Alex
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3782 - 3795
  • [43] Joint 3D Instance Segmentation and Object Detection for Autonomous Driving
    Zhou, Dingfu
    Fang, Jin
    Song, Xibin
    Liu, Liu
    Yin, Junbo
    Dai, Yuchao
    Li, Hongdong
    Yang, Ruigang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1836 - 1846
  • [44] A survey on 3D object detection in real time for autonomous driving
    Contreras, Marcelo
    Jain, Aayush
    Bhatt, Neel P.
    Banerjee, Arunava
    Hashemi, Ehsan
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [45] A Review of 3D Object Detection for Autonomous Driving of Electric Vehicles
    Dai, Deyun
    Chen, Zonghai
    Bao, Peng
    Wang, Jikai
    WORLD ELECTRIC VEHICLE JOURNAL, 2021, 12 (03)
  • [46] LiDAR-based 3D Object Detection for Autonomous Driving
    Li, Zirui
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 507 - 512
  • [47] Rethinking the Non-Maximum Suppression Step in 3D Object Detection from a Bird's-Eye View
    Li, Bohao
    Song, Shaojing
    Ai, Luxia
    ELECTRONICS, 2024, 13 (20)
  • [48] MENet: Map-enhanced 3D object detection in bird's-eye view for LiDAR point clouds
    Huang, Yuanxian
    Zhou, Jian
    Li, Xicheng
    Dong, Zhen
    Xiao, Jinsheng
    Wang, Shurui
    Zhang, Hongjuan
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 120
  • [49] Investigating 3D object detection using stereo camera and LiDAR fusion with bird's-eye view representation
    Nie, Xin
    Zhu, Lin
    He, Zhicheng
    Cheng, Aiguo
    Zhong, Shengshi
    Li, Eric
    NEUROCOMPUTING, 2025, 620
  • [50] CL-fusionBEV: 3D object detection method with camera-LiDAR fusion in Bird's Eye View
    Shi, Peicheng
    Liu, Zhiqiang
    Dong, Xinlong
    Yang, Aixi
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 7681 - 7696