Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU Supervision

被引:6
作者
Ming, Qi [1 ]
Miao, Lingjuan [1 ]
Ma, Zhe [1 ]
Zhao, Lin [1 ]
Zhou, Zhiqiang [1 ]
Huang, Xuhui [1 ]
Chen, Yuanpei [1 ]
Guo, Yufei [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Intelligent Sci & Technol Acad CASIC, Beijing, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
关键词
D O I
10.1109/CVPR52729.2023.00497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intersection-over-Union (IoU) is the most popular metric to evaluate regression performance in 3D object detection. Recently, there are also some methods applying IoU to the optimization of 3D bounding box regression. However, we demonstrate through experiments and mathematical proof that the 3D IoU loss suffers from abnormal gradient w.r.t. angular error and object scale, which further leads to slow convergence and suboptimal regression process, respectively. In this paper, we propose a Gradient-Corrected IoU (GCIoU) loss to achieve fast and accurate 3D bounding box regression. Specifically, a gradient correction strategy is designed to endow 3D IoU loss with a reasonable gradient. It ensures that the model converges quickly in the early stage of training, and helps to achieve fine-grained refinement of bounding boxes in the later stage. To solve suboptimal regression of 3D IoU loss for objects at different scales, we introduce a gradient rescaling strategy to adaptively optimize the step size. Finally, we integrate GCIoU Loss into multiple models to achieve stable performance gains and faster model convergence. Experiments on KITTI dataset demonstrate superiority of the proposed method. The code is available at https://github.com/ming71/GCIoU-loss.
引用
收藏
页码:5136 / 5145
页数:10
相关论文
共 50 条
[1]  
[Anonymous], 2020, EUR C COMP VIS SPRIN, DOI DOI 10.5220/0009422300680077
[2]   Effects of heat treatment on the thermal and mechanical properties of ramie fabric-reinforced poly(lactic acid) biocomposites [J].
Chen, Xie ;
Ren, Jie ;
Zhang, Naiwen ;
Gu, Shuying ;
Li, Jianbo .
JOURNAL OF REINFORCED PLASTICS AND COMPOSITES, 2015, 34 (01) :28-36
[3]  
Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201
[4]   RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection [J].
Fan, Lue ;
Xiong, Xuan ;
Wang, Feng ;
Wang, Naiyan ;
Zhang, Zhaoxiang .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :2898-2907
[5]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Structure Aware Single-stage 3D Object Detection from Point Cloud [J].
He, Chenhang ;
Zeng, Hui ;
Huang, Jianqiang ;
Hua, Xian-Sheng ;
Zhang, Lei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11870-11879
[8]  
He QD, 2022, AAAI CONF ARTIF INTE, P870
[9]   Acquisition of Localization Confidence for Accurate Object Detection [J].
Jiang, Borui ;
Luo, Ruixuan ;
Mao, Jiayuan ;
Xiao, Tete ;
Jiang, Yuning .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :816-832
[10]  
Kingma DP, 2014, ADV NEUR IN, V27