Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU Supervision

被引：6

作者：

Ming, Qi ^{[1
]}

Miao, Lingjuan ^{[1
]}

Ma, Zhe ^{[1
]}

Zhao, Lin ^{[1
]}

Zhou, Zhiqiang ^{[1
]}

Huang, Xuhui ^{[1
]}

Chen, Yuanpei ^{[1
]}

Guo, Yufei ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Automat, Intelligent Sci & Technol Acad CASIC, Beijing, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00497

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Intersection-over-Union (IoU) is the most popular metric to evaluate regression performance in 3D object detection. Recently, there are also some methods applying IoU to the optimization of 3D bounding box regression. However, we demonstrate through experiments and mathematical proof that the 3D IoU loss suffers from abnormal gradient w.r.t. angular error and object scale, which further leads to slow convergence and suboptimal regression process, respectively. In this paper, we propose a Gradient-Corrected IoU (GCIoU) loss to achieve fast and accurate 3D bounding box regression. Specifically, a gradient correction strategy is designed to endow 3D IoU loss with a reasonable gradient. It ensures that the model converges quickly in the early stage of training, and helps to achieve fine-grained refinement of bounding boxes in the later stage. To solve suboptimal regression of 3D IoU loss for objects at different scales, we introduce a gradient rescaling strategy to adaptively optimize the step size. Finally, we integrate GCIoU Loss into multiple models to achieve stable performance gains and faster model convergence. Experiments on KITTI dataset demonstrate superiority of the proposed method. The code is available at https://github.com/ming71/GCIoU-loss.

引用

页码：5136 / 5145

页数：10

共 50 条

[11] PointPillars: Fast Encoders for Object Detection from Point Clouds [J].

Lang, Alex H. ;

Vora, Sourabh ;

Caesar, Holger ;

Zhou, Lubing ;

Yang, Jiong ;

Beijbom, Oscar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697

[12] From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder [J].

Li, Jiale ;

Dai, Hang ;

Shao, Ling ;

Ding, Yong .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :4622-4631

[13] RangeIoUDet: Range Image based Real-Time 3D Object Detector Optimized by Intersection over Union [J].

Liang, Zhidong ;

Zhang, Zehan ;

Zhang, Ming ;

Zhao, Xian ;

Pu, Shiliang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7136-7145

[14]

Liu Z, 2020, AAAI CONF ARTIF INTE, V34, P11677

[15] Delving into Localization Errors for Monocular 3D Object Detection [J].

Ma, Xinzhu ;

Zhang, Yinmin ;

Xu, Dan ;

Zhou, Dongzhan ;

Yi, Shuai ;

Li, Haojie ;

Ouyang, Wanli .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4719-4728

[16] Task interleaving and orientation estimation for high-precision oriented object detection in aerial images [J].

Ming, Qi ;

Miao, Lingjuan ;

Zhou, Zhiqiang ;

Song, Junjie ;

Dong, Yunpeng ;

Yang, Xue .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 196 :241-255

[17] CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote-Sensing Images [J].

Ming, Qi ;

Miao, Lingjuan ;

Zhou, Zhiqiang ;

Dong, Yunpeng .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[18]

Ming Q, 2021, AAAI CONF ARTIF INTE, V35, P2355

[19] HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection [J].

Noh, Jongyoun ;

Lee, Sanghoon ;

Ham, Bumsub .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14600-14609

[20]

OpenPCDet Development Team, 2020, Openpcdet: An Opensource Toolbox for 3D Object Detection From Point Clouds

← 1 2 3 4 5 →