F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving

被引:6
作者
Tao, Chongben [1 ]
Fu, Shiping [1 ]
Wang, Chen [1 ]
Luo, Xizhao [2 ]
Li, Huayi [1 ]
Gao, Zhen [3 ]
Zhang, Zufeng [4 ]
Zheng, Sifa [4 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[3] McMaster Univ, Fac Engn, Hamilton, ON L8S 0A3, Canada
[4] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Three-dimensional displays; Feature extraction; Point cloud compression; Object detection; Heuristic algorithms; Estimation; Proposals; 3-D object detection; autonomous driving; fully convolutional network (FCN); point voxel fusion; sliding frustum;
D O I
10.1109/JIOT.2022.3231369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current 3-D object detection technology for autonomous driving usually cannot efficiently utilize local sensitive points. Meanwhile, contextual feature extracted from a object is not sufficient, which easily leads to deteriorated detection accuracy of the final object estimation. For the problems, a point-voxel-based 3-D dynamic object detection algorithm is proposed. First, local points are grouped with a camera frustum. Then, the global feature extracted by the submanifold 3-D voxel CNNs is aggregated into frustum key points. Second, a module of vector pool with feature aggregation is used to aggregate multiscale features of the point cloud. Moreover, the frustum raw feature and BEV feature are used for feature extension. Subsequently, the fine multiscale feature extracted from the point cloud is used as input to a subsequent fully convolutional network for final classification and continuous estimation of oriented 3-D boxes. The proposed method was compared with other state-of-the-art algorithms on the KITTI, Waymo, and nuScenes data sets. Experimental results showed that the proposed algorithm was better in accuracy, robustness, and generalization capabilities in 3-D dynamic object detection. Experiments on a real scenario and extensive ablation studies also demonstrated that the proposed algorithm not only effectively controls computational cost but also achieved more efficient results in 3-D object detection.
引用
收藏
页码:8031 / 8045
页数:15
相关论文
共 59 条
  • [1] Bijelic Mario, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P11679, DOI 10.1109/CVPR42600.2020.01170
  • [2] Caesar H, 2020, PROC CVPR IEEE, P11618, DOI 10.1109/CVPR42600.2020.01164
  • [3] Cascading Scene and Viewpoint Feature Learning for Pedestrian Gender Recognition
    Cai, Lei
    Zeng, Huanqiang
    Zhu, Jianqing
    Cao, Jiuwen
    Wang, Yongtao
    Ma, Kai-Kuang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) : 3014 - 3026
  • [4] You Only Look One-level Feature
    Chen, Qiang
    Wang, Yingming
    Yang, Tong
    Zhang, Xiangyu
    Cheng, Jian
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13034 - 13043
  • [5] Multi-View 3D Object Detection Network for Autonomous Driving
    Chen, Xiaozhi
    Ma, Huimin
    Wan, Ji
    Li, Bo
    Xia, Tian
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6526 - 6534
  • [6] Focal Sparse Convolutional Networks for 3D Object Detection
    Chen, Yukang
    Li, Yanwei
    Zhang, Xiangyu
    Sun, Jian
    Jia, Jiaya
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5418 - 5427
  • [7] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [8] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [9] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [10] CoFF: Cooperative Spatial Feature Fusion for 3-D Object Detection on Autonomous Vehicles
    Guo, Jingda
    Carrillo, Dominic
    Tang, Sihai
    Chen, Qi
    Yang, Qing
    Fu, Song
    Wang, Xi
    Wang, Nannan
    Palacharla, Paparao
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (14) : 11078 - 11087