FROM 3D TO 4D: FIXING THE ERRONEOUS COUPLING BETWEEN IOU AND ANGLE FOR OPTIMIZING 3D OBJECT DETECTION

被引:0
作者
Lun, Hengsheng [1 ]
Lu, Ke [1 ,2 ]
Mu, Liping [1 ]
Wang, Shuhua [1 ]
Xue, Jian [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Engn Sci, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年
基金
中国国家自然科学基金;
关键词
3D object detection; Point cloud; Autonomous driving;
D O I
10.1109/ICME57554.2024.10687801
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The IoU metric directly measures the overlap between two boxes, maintaining consistency in model optimization and testing stages. It has emerged as a highly regarded regression loss in the field of 3D object detection. However, the optimization of IoU often leads to an increased angular error. This erroneous coupling phenomenon renders the model susceptible to settling into sub-optimal solutions, which have not been extensively analyzed and addressed, significantly impeding further advancements in the accuracy of 3D object detection. In this paper, a novel concept "4DIoU" is introduced for 3D object detection, where the angle information is integrated as an additional dimension in the IoU calculation, and a new formula for measuring angle correlation is proposed. The 4DIoU not only resolves the erroneous coupling between IoU and angles but also capitalizes on angle information to enhance network optimization. Furthermore, a new encoding and decoding paradigm is proposed, which is more compatible with 4DIoU for object detection in point clouds. Extensive experiments on nuScenes, Waymo and KITTI datasets demonstrate the effectiveness of our method. The plug-and-play design of our approach proves to be highly versatile.
引用
收藏
页数:6
相关论文
共 15 条
[1]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[2]   VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking [J].
Chen, Yukang ;
Liu, Jianhui ;
Zhang, Xiangyu ;
Qi, Xiaojuan ;
Jia, Jiaya .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :21674-21683
[3]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[4]   PointPillars: Fast Encoders for Object Detection from Point Clouds [J].
Lang, Alex H. ;
Vora, Sourabh ;
Caesar, Holger ;
Zhou, Lubing ;
Yang, Jiong ;
Beijbom, Oscar .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697
[5]  
Lun Hengsheng, 2022, 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), P2427, DOI 10.1109/SMC53654.2022.9945348
[6]   Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU Supervision [J].
Ming, Qi ;
Miao, Lingjuan ;
Ma, Zhe ;
Zhao, Lin ;
Zhou, Zhiqiang ;
Huang, Xuhui ;
Chen, Yuanpei ;
Guo, Yufei .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :5136-5145
[7]  
OpenPCDet Development Team, 2020, OP OP SOURC TOOLB 3D
[8]   Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression [J].
Rezatofighi, Hamid ;
Tsoi, Nathan ;
Gwak, JunYoung ;
Sadeghian, Amir ;
Reid, Ian ;
Savarese, Silvio .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :658-666
[9]   Rethinking IoU-based Optimization for Single-stage 3D Object Detection [J].
Sheng, Hualian ;
Cai, Sijia ;
Zhao, Na ;
Deng, Bing ;
Huang, Jianqiang ;
Hua, Xian-Sheng ;
Zhao, Min-Jian ;
Lee, Gim Hee .
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 :544-561
[10]   PillarNet: Real-Time and High-Performance Pillar-Based 3D Object Detection [J].
Shi, Guangsheng ;
Li, Ruifeng ;
Ma, Chao .
COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 :35-52