Real-Time Point Cloud Object Detection via Voxel-Point Geometry Abstraction

被引：11

作者：

Shi, Guangsheng ^{[1
]}

Wang, Ke ^{[1
]}

Li, Ruifeng ^{[1
]}

Ma, Chao ^{[2
]}

机构：

[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150001, Peoples R China

[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Proposals; Point cloud compression; Feature extraction; Object detection; Representation learning; Geometry; 3D object detection; deep learning; autonomous driving; point clouds; NETWORKS; VEHICLE;

D O I：

10.1109/TITS.2023.3259582

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Recent advances in 3D object detection typically learn voxel-based or point-based representations on point clouds. Point-based methods preserve precise point positions but incur high computational load, whereas voxel-based methods rasterize unordered points into voxel grids efficiently but give rise to an accuracy bottleneck. To take advantage of voxel-and point-based representations, we develop an effective and efficient 3D object detector via a novel voxel-point geometry abstraction scheme. Our motivation is to use coarse voxel representation to accelerate proposal generation while using precise point representation to facilitate proposal refinement. For voxel representation learning, we propose a context enrichment module with a novel 3D sparse interpolation layer to augment raw points with multi-scale context. We further develop a point-based RoI pooling module with explicit position augmentation for proposal refinement. Extensive experiments on the widely used KITTI Dataset and the latest Waymo Open Dataset show that the proposed algorithm outperforms state-of-the-art point-voxel-based methods while running at 24 FPS on the TITAN XP GPU.

引用

页码：5971 / 5982

页数：12

共 58 条

[31] Deep Learning-Based Vehicle Behavior Prediction for Autonomous Driving Applications: A Review [J].

Mozaffari, Sajjad ;

Al-Jarrah, Omar Y. ;

Dianati, Mehrdad ;

Jennings, Paul ;

Mouzakitis, Alexandros .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) :33-47

[32] Deep Learning for Safe Autonomous Driving: Current Challenges and Future Directions [J].

Muhammad, Khan ;

Ullah, Amin ;

Lloret, Jaime ;

Del Ser, Javier ;

de Albuquerque, Victor Hugo C. .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (07) :4316-4336

[33]

Muresan MP, 2017, INT C INTELL COMP CO, P317, DOI 10.1109/ICCP.2017.8117023

[34]

Qi C R, 2017, P 31 INT C NEUR INF, V2017, P5099

[35]

Qi CR, 2017, ADV NEUR IN, V30

[36] Frustum PointNets for 3D Object Detection from RGB-D Data [J].

Qi, Charles R. ;

Liu, Wei ;

Wu, Chenxia ;

Su, Hao ;

Guibas, Leonidas J. .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :918-927

[37] Autonomous Vehicles That Interact With Pedestrians: A Survey of Theory and Practice [J].

Rasouli, Amir ;

Tsotsos, John K. .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) :900-918

[38] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].

Ren, Shaoqing ;

He, Kaiming ;

Girshick, Ross ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149

[39]

Shi SS, 2020, PROC CVPR IEEE, P10526, DOI 10.1109/CVPR42600.2020.01054

[40] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud [J].

Shi, Shaoshuai ;

Wang, Xiaogang ;

Li, Hongsheng .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :770-779

← 1 2 3 4 5 6 →