RangeIoUDet: Range Image based Real-Time 3D Object Detector Optimized by Intersection over Union

被引:54
作者
Liang, Zhidong [1 ]
Zhang, Zehan [1 ]
Zhang, Ming [1 ]
Zhao, Xian [1 ]
Pu, Shiliang [1 ]
机构
[1] Hikvision Res Inst, Montreal, PQ, Canada
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
国家重点研发计划;
关键词
D O I
10.1109/CVPR46437.2021.00706
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time and high-performance 3D object detection is an attractive research direction in autonomous driving. Recent studies prefer point based or voxel based convolution for achieving high performance. However, these methods suffer from the unsatisfied efficiency or complex customized convolution, making them unsuitable for applications with real-time requirements. In this paper, we present an efficient and effective 3D object detection framework, named RangeIoUDet that uses the range image as input. Benefiting from the dense representation of the range image, RangeIoUDet is entirely constructed based on 2D convolution, making it possible to have a fast inference speed. This model learns pointwise features from the range image, which is then passed to a region proposal network for predicting 3D bounding boxes. We optimize the pointwise feature and the 3D box via the point-based IoU and box-based IoU supervision, respectively. The point-based IoU supervision is proposed to make the network better learn the implicit 3D information encoded in the range image. The 3D Hybrid GIoU loss is introduced to generate high-quality boxes while providing an accurate quality evaluation. Through the point-based IoU and the box-based IoU, RangeIoUDet outperforms all single-stage models on the KITTI dataset, while running at 45 FPS for inference. Experiments on the self-built dataset further prove its effectiveness on different LIDAR sensors and object categories.
引用
收藏
页码:7136 / 7145
页数:10
相关论文
共 38 条
[1]   The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks [J].
Berman, Maxim ;
Triki, Amal Rannen ;
Blaschko, Matthew B. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4413-4421
[2]  
Bewley Alex, 2020, CORL
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]   Ameliorative effects of Compound K and ginsenoside Rh1 on non-alcoholic fatty liver disease in rats [J].
Chen, Xu-Jia ;
Liu, Wen-Jing ;
Wen, Meng-Liang ;
Liang, Hong ;
Wu, Shao-Mei ;
Zhu, Yun-Zhen ;
Zhao, Jiang-Yuan ;
Dong, Xiang-Qian ;
Li, Ming-Gang ;
Bian, Li ;
Zou, Cheng-Gang ;
Ma, Lan-Qing .
SCIENTIFIC REPORTS, 2017, 7
[5]  
Chen YL, 2019, IEEE I CONF COMP VIS, P9774, DOI [10.1109/ICCV.2019.00987, 10.1109/iccv.2019.00987]
[6]   PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments [J].
Chen, Zhiming ;
Chen, Kean ;
Lin, Weiyao ;
See, John ;
Yu, Hui ;
Ke, Yan ;
Yang, Cong .
COMPUTER VISION - ECCV 2020, PT V, 2020, 12350 :195-211
[7]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[8]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[9]   Structure Aware Single-stage 3D Object Detection from Point Cloud [J].
He, Chenhang ;
Zeng, Hui ;
Huang, Jianqiang ;
Hua, Xian-Sheng ;
Zhang, Lei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11870-11879
[10]  
Huang Tengteng, 2020, P EUR C COMP VIS ECC