Improved Point-Voxel Region Convolutional Neural Network: 3D Object Detectors for Autonomous Driving

被引：52

作者：

Li, Yujie ^{[1
]}

Yang, Shuo ^{[2
]}

Zheng, Yuchao ^{[2
]}

Lu, Huimin ^{[3
]}

机构：

[1] Yangzhou Univ, Sch Informat Engn, Yangzhou 225009, Jiangsu, Peoples R China

[2] Kyushu Inst Technol, Sch Engn, Kitakyushu, Fukuoka 8040015, Japan

[3] Qingdao Univ, Sch Data Sci & Software Engn, Qingdao 266071, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 07期

关键词：

3D object detection; region proposal method; point cloud data processing;

D O I：

10.1109/TITS.2021.3071790

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Recently, 3D object detection based on deep learning has achieved impressive performance in complex indoor and outdoor scenes. Among the methods, the two-stage detection method performs the best; however, this method still needs improved accuracy and efficiency, especially for small size objects or autonomous driving scenes. In this paper, we propose an improved 3D object detection method based on a two-stage detector called the Improved Point-Voxel Region Convolutional Neural Network (IPV-RCNN). Our proposed method contains online training for data augmentation, upsampling convolution and k-means clustering for the bounding box to achieve 3D detection tasks from raw point clouds. The evaluation results on the KITTI 3D dataset show that the IPV-RCNN achieved a 96% mAP, which is 3% more accurate than the state-of-the-art detectors.

引用

页码：9311 / 9317

页数：7

共 23 条

[1] Graph-Based Object Classification for Neuromorphic Vision Sensing [J].

Bi, Yin ;

Chadha, Aaron ;

Abbas, Alhabib ;

Bourtsoulatze, Eirina ;

Andreopoulos, Yiannis .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :491-501

[2]

Chen YL, 2019, IEEE I CONF COMP VIS, P9774, DOI [10.1109/ICCV.2019.00987, 10.1109/iccv.2019.00987]

[3]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[4]

Guibas, 2017, ADV NEURAL INFORM PR, P5099

[5] Joint Monocular 3D Vehicle Detection and Tracking [J].

Hu, Hou-Ning ;

Cai, Qi-Zhi ;

Wang, Dequan ;

Lin, Ji ;

Sun, Min ;

Krahenbuhl, Philipp ;

Darrell, Trevor ;

Yu, Fisher .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5389-5398

[6] Multi-view PointNet for 3D Scene Understanding [J].

Jaritz, Maximilian ;

Gu, Jiayuan ;

Su, Hao .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :3995-4003

[7] PointPillars: Fast Encoders for Object Detection from Point Clouds [J].

Lang, Alex H. ;

Vora, Sourabh ;

Caesar, Holger ;

Zhou, Lubing ;

Yang, Jiong ;

Beijbom, Oscar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697

[8]

Lehner J., 2019, ARXIV PREPRINT ARXIV, P1

[9] Stereo R-CNN based 3D Object Detection for Autonomous Driving [J].

Li, Peiliang ;

Chen, Xiaozhi ;

Shen, Shaojie .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7636-7644

[10] Feature Pyramid Networks for Object Detection [J].

Lin, Tsung-Yi ;

Dollar, Piotr ;

Girshick, Ross ;

He, Kaiming ;

Hariharan, Bharath ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944

← 1 2 3 →