Improved Point-Voxel Region Convolutional Neural Network: 3D Object Detectors for Autonomous Driving

被引:52
作者
Li, Yujie [1 ]
Yang, Shuo [2 ]
Zheng, Yuchao [2 ]
Lu, Huimin [3 ]
机构
[1] Yangzhou Univ, Sch Informat Engn, Yangzhou 225009, Jiangsu, Peoples R China
[2] Kyushu Inst Technol, Sch Engn, Kitakyushu, Fukuoka 8040015, Japan
[3] Qingdao Univ, Sch Data Sci & Software Engn, Qingdao 266071, Peoples R China
关键词
3D object detection; region proposal method; point cloud data processing;
D O I
10.1109/TITS.2021.3071790
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Recently, 3D object detection based on deep learning has achieved impressive performance in complex indoor and outdoor scenes. Among the methods, the two-stage detection method performs the best; however, this method still needs improved accuracy and efficiency, especially for small size objects or autonomous driving scenes. In this paper, we propose an improved 3D object detection method based on a two-stage detector called the Improved Point-Voxel Region Convolutional Neural Network (IPV-RCNN). Our proposed method contains online training for data augmentation, upsampling convolution and k-means clustering for the bounding box to achieve 3D detection tasks from raw point clouds. The evaluation results on the KITTI 3D dataset show that the IPV-RCNN achieved a 96% mAP, which is 3% more accurate than the state-of-the-art detectors.
引用
收藏
页码:9311 / 9317
页数:7
相关论文
共 23 条
[1]   Graph-Based Object Classification for Neuromorphic Vision Sensing [J].
Bi, Yin ;
Chadha, Aaron ;
Abbas, Alhabib ;
Bourtsoulatze, Eirina ;
Andreopoulos, Yiannis .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :491-501
[2]  
Chen YL, 2019, IEEE I CONF COMP VIS, P9774, DOI [10.1109/ICCV.2019.00987, 10.1109/iccv.2019.00987]
[3]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[4]  
Guibas, 2017, ADV NEURAL INFORM PR, P5099
[5]   Joint Monocular 3D Vehicle Detection and Tracking [J].
Hu, Hou-Ning ;
Cai, Qi-Zhi ;
Wang, Dequan ;
Lin, Ji ;
Sun, Min ;
Krahenbuhl, Philipp ;
Darrell, Trevor ;
Yu, Fisher .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5389-5398
[6]   Multi-view PointNet for 3D Scene Understanding [J].
Jaritz, Maximilian ;
Gu, Jiayuan ;
Su, Hao .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :3995-4003
[7]   PointPillars: Fast Encoders for Object Detection from Point Clouds [J].
Lang, Alex H. ;
Vora, Sourabh ;
Caesar, Holger ;
Zhou, Lubing ;
Yang, Jiong ;
Beijbom, Oscar .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697
[8]  
Lehner J., 2019, ARXIV PREPRINT ARXIV, P1
[9]   Stereo R-CNN based 3D Object Detection for Autonomous Driving [J].
Li, Peiliang ;
Chen, Xiaozhi ;
Shen, Shaojie .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7636-7644
[10]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944