Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

被引:21
|
作者
Zhu, Zijian [1 ]
Zhang, Yichi [2 ,5 ]
Chen, Hai [3 ]
Dong, Yinpeng [2 ]
Zhao, Shu [3 ]
Ding, Wenbo [4 ]
Zhong, Jiachen [4 ]
Zheng, Shibao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
[2] Tsinghua Univ, Inst AI, Dept Comp Sci & Tech, THBI Lab,BNRist Ctr, Beijing, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Key Lab Intelligent Comp & Signal Proc,Minist Edu, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei, Peoples R China
[4] SAIC Motor AI Lab, Shanghai, Peoples R China
[5] Zhongguancun Lab, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.02069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection is an essential perception task in autonomous driving to understand the environments. The Bird's-Eye-View (BEV) representations have significantly improved the performance of 3D detectors with camera inputs on popular benchmarks. However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems. In this paper, we evaluate the natural and adversarial robustness of various representative models under extensive settings, to fully understand their behaviors influenced by explicit BEV features compared with those without BEV. In addition to the classic settings, we propose a 3D consistent patch attack by applying adversarial patches in the 3D space to guarantee the spatiotemporal consistency, which is more realistic for the scenario of autonomous driving. With substantial experiments, we draw several findings: 1) BEV models tend to be more stable than previous methods under different natural conditions and common corruptions due to the expressive spatial representations; 2) BEV models are more vulnerable to adversarial noises, mainly caused by the redundant BEV features; 3) Camera-LiDAR fusion models have superior performance under different settings with multi-modal inputs, but BEV fusion model is still vulnerable to adversarial noises of both point cloud and image. These findings alert the safety issue in the applications of BEV detectors and could facilitate the development of more robust models.
引用
收藏
页码:21600 / 21610
页数:11
相关论文
共 50 条
  • [1] L-shape Fitting Algorithm for 3D Object Detection in Bird's-Eye-View in an Autonomous Driving System
    Chekanov, Mikhail O.
    Shipitko, Oleg S.
    SIXTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2023, 2024, 13072
  • [2] BEVRefiner: Improving 3D Object Detection in Bird's-Eye-View via Dual Refinement
    Wang, Binglu
    Zheng, Haowen
    Zhang, Lei
    Liu, Nian
    Anwer, Rao Muhammad
    Cholakkal, Hisham
    Zhao, Yongqiang
    Li, Zhijun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 15094 - 15105
  • [3] Traffic Object Detection for Autonomous Driving Fusing LiDAR and Pseudo 4D-Radar Under Bird's-Eye-View
    Meng, ZeYu
    Song, YongHong
    Zhang, YuanLin
    Nan, YueXing
    Bai, ZeNan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 18185 - 18195
  • [4] 3D Bird's-Eye-View Instance Segmentation
    Elich, Cathrin
    Engelmann, Francis
    Kontogianni, Theodora
    Leibe, Bastian
    PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 48 - 61
  • [5] Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
    Ding, Xinpeng
    Han, Jianhua
    Xu, Hang
    Liang, Xiaodan
    Zhang, Wei
    Li, Xiaomeng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13668 - 13677
  • [6] BEVDetNet: Bird's Eye View LiDAR Point Cloud based Real-time 3D Object Detection for Autonomous Driving
    Mohapatra, Sambit
    Yogamani, Senthil
    Gotzig, Heinrich
    Milz, Stefan
    Maeder, Patrick
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2809 - 2815
  • [7] SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection
    Zhang, Jinqing
    Zhang, Yanan
    Liu, Qingjie
    Wang, Yunhong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3325 - 3334
  • [8] Efficient Learning of Urban Driving Policies Using Bird's-Eye-View State Representations
    Trumpp, Raphael
    Buechner, Martin
    Valada, Abhinav
    Caccamo, Marco
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4181 - 4186
  • [9] Accurate 3D Object Detection from Point Cloud Data using Bird's Eye View Representations
    Aranjuelo, Nerea
    Engels, Guus
    Montero, David
    Nieto, Marcos
    Arganda-Carreras, Ignacio
    Unzueta, Luis
    Otaegui, Oihana
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 246 - 253
  • [10] Autonomous driving: a bird's eye view
    Martinez-Diaz, Margarita
    Soriguera, Francesc
    Perez, Ignacio
    IET INTELLIGENT TRANSPORT SYSTEMS, 2019, 13 (04) : 563 - 579