Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU Supervision

被引：6

作者：

Ming, Qi ^{[1
]}

Miao, Lingjuan ^{[1
]}

Ma, Zhe ^{[1
]}

Zhao, Lin ^{[1
]}

Zhou, Zhiqiang ^{[1
]}

Huang, Xuhui ^{[1
]}

Chen, Yuanpei ^{[1
]}

Guo, Yufei ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Automat, Intelligent Sci & Technol Acad CASIC, Beijing, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00497

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Intersection-over-Union (IoU) is the most popular metric to evaluate regression performance in 3D object detection. Recently, there are also some methods applying IoU to the optimization of 3D bounding box regression. However, we demonstrate through experiments and mathematical proof that the 3D IoU loss suffers from abnormal gradient w.r.t. angular error and object scale, which further leads to slow convergence and suboptimal regression process, respectively. In this paper, we propose a Gradient-Corrected IoU (GCIoU) loss to achieve fast and accurate 3D bounding box regression. Specifically, a gradient correction strategy is designed to endow 3D IoU loss with a reasonable gradient. It ensures that the model converges quickly in the early stage of training, and helps to achieve fine-grained refinement of bounding boxes in the later stage. To solve suboptimal regression of 3D IoU loss for objects at different scales, we introduce a gradient rescaling strategy to adaptively optimize the step size. Finally, we integrate GCIoU Loss into multiple models to achieve stable performance gains and faster model convergence. Experiments on KITTI dataset demonstrate superiority of the proposed method. The code is available at https://github.com/ming71/GCIoU-loss.

引用

页码：5136 / 5145

页数：10

共 50 条

[41]

Yu Jiahui, 2016, P 24 ACM INT C MULT, P516

[42] Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds [J].

Zhang, Yifan ;

Hu, Qingyong ;

Xu, Guoquan ;

Ma, Yanxin ;

Wan, Jianwei ;

Guo, Yulan .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18931-18940

[43] Objects are Different: Flexible Monocular 3D Object Detection [J].

Zhang, Yunpeng ;

Lu, Jiwen ;

Zhou, Jie .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3288-3297

[44] Sem-Aug: Improving Camera-LiDAR Feature Fusion With Semantic Augmentation for 3D Vehicle Detection [J].

Zhao, Lin ;

Wang, Meiling ;

Yue, Yufeng .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :9358-9365

[45] SE-SSD: Self-Ensembling Single-Stage Object Detector From Point Cloud [J].

Zheng, Wu ;

Tang, Weiliang ;

Jiang, Li ;

Fu, Chi-Wing .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14489-14498

[46]

Zheng W, 2021, AAAI CONF ARTIF INTE, V35, P3555

[47] Rotation-Robust Intersection over Union for 3D Object Detection [J].

Zheng, Yu ;

Zhang, Danyang ;

Xie, Sinan ;

Lu, Jiwen ;

Zhou, Jie .

COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :464-480

[48]

Zheng ZH, 2020, AAAI CONF ARTIF INTE, V34, P12993

[49] IoU Loss for 2D/3D Object Detection [J].

Zhou, Dingfu ;

Fang, Jin ;

Song, Xibin ;

Guan, Chenye ;

Yin, Junbo ;

Dai, Yuchao ;

Yang, Ruigang .

2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, :85-94

[50]

Zhou Y., 2020, PROC C ROBOT LEARN, P923

← 1 2 3 4 5 →