SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection

被引:6
|
作者
Zhang, Hui [1 ,2 ]
Luo, Guiyang [3 ]
Wang, Xiao [4 ]
Li, Yidong [1 ,2 ]
Ding, Weiping [5 ]
Wang, Fei-Yue [6 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Key Lab Big Data & Artificial Intelligence Transp, Minist Educ, Beijing 100044, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[4] Anhui Univ, Engn Res Ctr Autonomous Unmanned Syst Technol, Minist Educ, Hefei 230031, Peoples R China
[5] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Three-dimensional displays; Detectors; Object detection; Proposals; Point cloud compression; Shape; 3D object detection; autonomous driving; point-voxel detectors; CNN;
D O I
10.1109/TNNLS.2023.3339889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point-voxel 3D object detectors have achieved impressive performance in complex traffic scenes. However, they utilize the 3D sparse convolution (spconv) layers with fixed receptive fields, such as voxel-based detectors, and inherit the fixed sphere radius from point-based methods for generating the features of keypoints, which make them weak in adaptively modeling various geometrical deformations and sizes of real objects. To tackle this issue, we propose a shape-adaptive set abstraction network (SASAN) for point-voxel 3D object detection. First, the proposal and offset generation module is adopted to learn the coordinates and confidences of 3D proposals and shape-adaptive offsets of the certain number of offset points for each voxel. Meanwhile, an extra offset supervision task is employed to guide the learning of shifting values of offset points, aiming at motivating the predicted offsets to preferably adapt to the various shapes of objects. Then, the shape-adaptive set abstraction module is proposed to extract multiscale keypoints features by grouping the neighboring offset points' features, as well as features learned from adjacent raw points and the 2-D bird-view map. Finally, the region of interest (RoI)-grid proposal refinement module is used to aggregate the keypoints features for further proposal refinement and confidence prediction. Extensive experiments on the competitive KITTI 3D detection benchmark demonstrate that the proposed SASAN gains superior performance as compared with state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] 3D Point-Voxel Correlation Fields for Scene Flow Estimation
    Wang, Ziyi
    Wei, Yi
    Rao, Yongming
    Zhou, Jie
    Lu, Jiwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13621 - 13635
  • [22] PVDECONV: POINT-VOXEL DECONVOLUTION FOR AUTOENCODING CAD CONSTRUCTION IN 3D
    Cherenkova, Kseniya
    Aouada, Djamila
    Gusev, Gleb
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2741 - 2745
  • [23] Boundary-Aware Set Abstraction for 3D Object Detection
    Huang, Zhe
    Wang, Yongcai
    Tang, Xingui
    Sun, Hongyu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [24] SI-RCNN: A Shape-Invariant Set-Abstraction for 3D Object detection
    Wu, Xiaoping
    Luo, Guiyang
    Yuan, Quan
    Li, Jinglin
    Yang, Fangchun
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2965 - 2971
  • [25] HPV-RCNN: Hybrid Point-Voxel Two-Stage Network for LiDAR Based 3-D Object Detection
    Feng, Chen
    Xiang, Chao
    Xie, Xiaopo
    Zhang, Yuan
    Yang, Mingchuan
    Li, Xuesong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (06) : 3066 - 3076
  • [26] SASA: Semantics-Augmented Set Abstraction for Point-Based 3D Object Detection
    Chen, Chen
    Chen, Zhe
    Zhang, Jing
    Tao, Dacheng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 221 - 229
  • [27] SAE3D: Set Abstraction Enhancement Network for 3D Object Detection Based Distance Features
    Zhang, Zheng
    Bao, Zhiping
    Tian, Qing
    Lyu, Zhuoyang
    SENSORS, 2024, 24 (01)
  • [28] PSA-Det3D: Pillar set abstraction for 3D object detection
    Huang, Zhicong
    Zheng, Zhijie
    Zhao, Jingwen
    Hu, Haifeng
    Wang, Zixin
    Chen, Dihu
    PATTERN RECOGNITION LETTERS, 2023, 168 : 138 - 145
  • [29] 3D point cloud object detection method in view of voxel based on graph convolution network
    Zhao Y.
    Arxidin A.
    Chen R.
    Zhou Y.
    Zhang Q.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2021, 50 (10):
  • [30] Voxel Transformer for 3D Object Detection
    Mao, Jiageng
    Xue, Yujing
    Niu, Minzhe
    Bai, Haoyue
    Feng, Jiashi
    Liang, Xiaodan
    Xu, Hang
    Xu, Chunjing
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3144 - 3153