SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection

被引:6
|
作者
Zhang, Hui [1 ,2 ]
Luo, Guiyang [3 ]
Wang, Xiao [4 ]
Li, Yidong [1 ,2 ]
Ding, Weiping [5 ]
Wang, Fei-Yue [6 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Key Lab Big Data & Artificial Intelligence Transp, Minist Educ, Beijing 100044, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[4] Anhui Univ, Engn Res Ctr Autonomous Unmanned Syst Technol, Minist Educ, Hefei 230031, Peoples R China
[5] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Three-dimensional displays; Detectors; Object detection; Proposals; Point cloud compression; Shape; 3D object detection; autonomous driving; point-voxel detectors; CNN;
D O I
10.1109/TNNLS.2023.3339889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point-voxel 3D object detectors have achieved impressive performance in complex traffic scenes. However, they utilize the 3D sparse convolution (spconv) layers with fixed receptive fields, such as voxel-based detectors, and inherit the fixed sphere radius from point-based methods for generating the features of keypoints, which make them weak in adaptively modeling various geometrical deformations and sizes of real objects. To tackle this issue, we propose a shape-adaptive set abstraction network (SASAN) for point-voxel 3D object detection. First, the proposal and offset generation module is adopted to learn the coordinates and confidences of 3D proposals and shape-adaptive offsets of the certain number of offset points for each voxel. Meanwhile, an extra offset supervision task is employed to guide the learning of shifting values of offset points, aiming at motivating the predicted offsets to preferably adapt to the various shapes of objects. Then, the shape-adaptive set abstraction module is proposed to extract multiscale keypoints features by grouping the neighboring offset points' features, as well as features learned from adjacent raw points and the 2-D bird-view map. Finally, the region of interest (RoI)-grid proposal refinement module is used to aggregate the keypoints features for further proposal refinement and confidence prediction. Extensive experiments on the competitive KITTI 3D detection benchmark demonstrate that the proposed SASAN gains superior performance as compared with state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [41] PVF-NET: Point & Voxel Fusion 3D Object Detection Framework for Point Cloud
    Cui, Zhihao
    Zhang, Zhenhua
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 125 - 133
  • [42] IPVNet: Learning implicit point-voxel features for open-surface 3D reconstruction
    Arshad, Mohammad Samiul
    Beksi, William J.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
  • [43] 改进Point-Voxel特征提取的3D小目标检测
    李宇轩
    陈壹华
    温兴
    严彬彬
    张航
    微电子学与计算机, 2023, 40 (02) : 50 - 58
  • [44] OA-Net: outlier weakening and adaptive voxel encoding-based 3d object detection network
    Chuanxu Wang
    Jianwei Qin
    Xiaoshan Fu
    Multimedia Tools and Applications, 2024, 83 : 36433 - 36453
  • [45] OA-Net: outlier weakening and adaptive voxel encoding-based 3d object detection network
    Wang, Chuanxu
    Qin, Jianwei
    Fu, Xiaoshan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36433 - 36453
  • [46] PP-RCNN: Point-Pillars Feature Set Abstraction for 3D Real-time Object Detection
    Tu, Jiayin
    Wang, Ping
    Liu, Fuqiang
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [47] 3D object detection network based on symmetric shape generation
    Tu X.
    Zheng S.
    Yu S.
    Li W.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (06): : 252 - 263
  • [48] Adaptive learning point cloud and image diversity feature fusion network for 3D object detection
    Yan, Weiqing
    Liu, Shile
    Liu, Hao
    Yue, Guanghui
    Wang, Xuan
    Song, Yongchao
    Xu, Jindong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 2825 - 2837
  • [49] Adaptive learning point cloud and image diversity feature fusion network for 3D object detection
    Weiqing Yan
    Shile Liu
    Hao Liu
    Guanghui Yue
    Xuan Wang
    Yongchao Song
    Jindong Xu
    Complex & Intelligent Systems, 2024, 10 : 2825 - 2837
  • [50] Planar object detection from 3D point clouds based on pyramid voxel representation
    Hu, Zhaozheng
    Bai, Dongfang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (22) : 24343 - 24357