PillarNet: Real-Time and High-Performance Pillar-Based 3D Object Detection

被引:108
作者
Shi, Guangsheng [1 ]
Li, Ruifeng [1 ]
Ma, Chao [2 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin, Peoples R China
[2] Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, AI Inst, Shanghai, Peoples R China
来源
COMPUTER VISION, ECCV 2022, PT X | 2022年 / 13670卷
关键词
3D object detection; Point cloud; Autonomous driving;
D O I
10.1007/978-3-031-20080-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time and high-performance 3D object detection is of critical importance for autonomous driving. Recent top-performing 3D object detectors mainly rely on point-based or 3D voxel-based convolutions, which are both computationally inefficient for onboard deployment. In contrast, pillar-based methods use solely 2D convolutions, which consume less computation resources, but they lag far behind their voxel-based counterparts in detection accuracy. In this paper, by examining the primary performance gap between pillar- and voxel-based detectors, we develop a real-time and high-performance pillar-based detector, dubbed PillarNet. The proposed PillarNet consists of a powerful encoder network for effective pillar feature learning, a neck network for spatial-semantic feature fusion and the commonly used detect head. Using only 2D convolutions, PillarNet is flexible to an optional pillar size and compatible with classical 2D CNN backbones, such as VGGNet and ResNet. Additionally, PillarNet benefits from our designed orientation-decoupled IoU regression loss along with the IoU-aware prediction branch. Extensive experimental results on the large-scale nuScenes Dataset and Waymo Open Dataset demonstrate that the proposed PillarNet performs well over state-of-the-art 3D detectors in terms of effectiveness and efficiency.
引用
收藏
页码:35 / 52
页数:18
相关论文
共 53 条
[1]  
Bewley A., 2020, C ROBOT LEARNING COR
[2]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[3]  
Chen Q., 2021, arXiv
[4]   Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots [J].
Chen, Qi ;
Sun, Lin ;
Wang, Zhixin ;
Jia, Kui ;
Yuille, Alan .
COMPUTER VISION - ECCV 2020, PT XXI, 2020, 12366 :68-84
[5]  
Chen Qi, 2020, Adv Neural Inf Process Syst, V33, P21224
[6]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[7]  
Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201
[8]  
Fan Lue, 2021, ICCV
[9]  
Graham B, 2017, Arxiv, DOI arXiv:1706.01307
[10]   3D Semantic Segmentation with Submanifold Sparse Convolutional Networks [J].
Graham, Benjamin ;
Engelcke, Martin ;
van der Maaten, Laurens .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9224-9232