3DSSD: Point-based 3D Single Stage Object Detector

被引:831
作者
Yang, Zetong [1 ]
Sun, Yanan [2 ]
Liu, Shu [3 ]
Jia, Jiaya [1 ,3 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] SmartMore, Hong Kong, Peoples R China
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.01105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prevalence of voxel-based 3D single-stage detectors contrast with underexplored point-based methods. In this paper, we present a lightweight point-based 3D single stage object detector 3DSSD to achieve decent balance of accuracy and efficiency. In this paradigm, all upsampling layers and the refinement stage, which are indispensable in all existing point-based methods, are abandoned. We instead propose a fusion sampling strategy in downsampling process to make detection on less representative points feasible. A delicate box prediction network, including a candidate generation layer and an anchor-free regression head with a 3D center-ness assignment strategy, is developed to meet the demand of high accuracy and speed. Our 3DSSD paradigm is an elegant single-stage anchor-free one. We evaluate it on widely used KITTI dataset and more challenging nuScenes dataset. Our method outperforms all state-of-the-art voxel-based single-stage methods by a large margin, and even yields comparable performance with two-stage point-based methods, with amazing inference speed of 25+ FPS, 2x faster than former state-of-the-art point-based methods.
引用
收藏
页码:11037 / 11045
页数:9
相关论文
共 50 条
[41]   Embracing Single Stride 3D Object Detector with Sparse Transformer [J].
Fan, Lue ;
Pang, Ziqi ;
Zhang, Tianyuan ;
Wang, Yu-Xiong ;
Zhao, Hang ;
Wang, Feng ;
Wang, Naiyan ;
Zhang, Zhaoxiang .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :8448-8458
[42]   Effective 3D object detection based on detector and tracker [J].
Nie, Weizhi ;
Liu, Anan ;
Wang, Zhongyang ;
Su, Yuting .
NEUROCOMPUTING, 2016, 215 :63-70
[43]   STD: Sparse-to-Dense 3D Object Detector for Point Cloud [J].
Yang, Zetong ;
Sun, Yanan ;
Liu, Shu ;
Shen, Xiaoyong ;
Jia, Jiaya .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1951-1960
[44]   A 3D HUMAN TEETH DATABASE CONSTRUCTION BASED ON A POINT-BASED SHAPE REGISTRATION [J].
Abdelmunim, H. ;
Chen, D. ;
Farag, A. A. ;
Pusateri, R. ;
Carter, C. ;
Miller, M. ;
Farman, A. ;
Tasman, D. .
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, :1617-1620
[45]   A Single Stage and Single View 3D Point Cloud Reconstruction Network Based on DetNet [J].
Li, Bin ;
Zhu, Shiao ;
Lu, Yi .
SENSORS, 2022, 22 (21)
[46]   Point-Based Deep Neural Network for 3D Facial Expression Recognition [J].
Trimech, Imen Hamrouni ;
Maalej, Ahmed ;
Ben Amara, Najoua Essoukri .
2020 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2020), 2020, :164-171
[47]   Generation of point-based 3D statistical shape models for anatomical objects [J].
Lorenz, C ;
Krahnstöver, N .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 77 (02) :175-191
[48]   Evolutionary optimization of feature representation for 3D point-based model classification [J].
Tong, Xin ;
Wong, Hau-san ;
Ma, Bo ;
Ip, Horace H. S. .
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, :707-+
[49]   HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection [J].
Noh, Jongyoun ;
Lee, Sanghoon ;
Ham, Bumsub .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14600-14609
[50]   Efficient Point-based Pattern Search in 3D Motion Capture Databases [J].
Beecks, Christian ;
Grass, Alexander .
2018 IEEE 6TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2018), 2018, :230-235