SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection

被引:6
|
作者
Zhang, Hui [1 ,2 ]
Luo, Guiyang [3 ]
Wang, Xiao [4 ]
Li, Yidong [1 ,2 ]
Ding, Weiping [5 ]
Wang, Fei-Yue [6 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Key Lab Big Data & Artificial Intelligence Transp, Minist Educ, Beijing 100044, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[4] Anhui Univ, Engn Res Ctr Autonomous Unmanned Syst Technol, Minist Educ, Hefei 230031, Peoples R China
[5] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Three-dimensional displays; Detectors; Object detection; Proposals; Point cloud compression; Shape; 3D object detection; autonomous driving; point-voxel detectors; CNN;
D O I
10.1109/TNNLS.2023.3339889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point-voxel 3D object detectors have achieved impressive performance in complex traffic scenes. However, they utilize the 3D sparse convolution (spconv) layers with fixed receptive fields, such as voxel-based detectors, and inherit the fixed sphere radius from point-based methods for generating the features of keypoints, which make them weak in adaptively modeling various geometrical deformations and sizes of real objects. To tackle this issue, we propose a shape-adaptive set abstraction network (SASAN) for point-voxel 3D object detection. First, the proposal and offset generation module is adopted to learn the coordinates and confidences of 3D proposals and shape-adaptive offsets of the certain number of offset points for each voxel. Meanwhile, an extra offset supervision task is employed to guide the learning of shifting values of offset points, aiming at motivating the predicted offsets to preferably adapt to the various shapes of objects. Then, the shape-adaptive set abstraction module is proposed to extract multiscale keypoints features by grouping the neighboring offset points' features, as well as features learned from adjacent raw points and the 2-D bird-view map. Finally, the region of interest (RoI)-grid proposal refinement module is used to aggregate the keypoints features for further proposal refinement and confidence prediction. Extensive experiments on the competitive KITTI 3D detection benchmark demonstrate that the proposed SASAN gains superior performance as compared with state-of-the-art methods.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shaoshuai Shi
    Li Jiang
    Jiajun Deng
    Zhe Wang
    Chaoxu Guo
    Jianping Shi
    Xiaogang Wang
    Hongsheng Li
    International Journal of Computer Vision, 2023, 131 : 531 - 551
  • [2] ASPVNet: Attention Based Sparse Point-Voxel Network for 3D Object Detection
    Yu, Bingxin
    Wang, Lu
    He, Yuhong
    Wang, Xiaoyang
    Cheng, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 161 - 176
  • [3] PV-RCNN plus plus : Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shi, Shaoshuai
    Jiang, Li
    Deng, Jiajun
    Wang, Zhe
    Guo, Chaoxu
    Shi, Jianping
    Wang, Xiaogang
    Li, Hongsheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 531 - 551
  • [4] HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
    Fan, Baojie
    Zhang, Kexin
    Tian, Jiandong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 8997 - 9009
  • [5] Point-Voxel Fusion for Multimodal 3D Detection
    Wang, Ke
    Zhang, Zhichuang
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1716 - 1719
  • [6] Point-Voxel Based Geometry-Adaptive Network for 3D Point Cloud Analysis
    Zhao, Tian-Meng
    Zeng, Hui
    Zhang, Bao-Qing
    Liu, Hong-Min
    Fan, Bin
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (05) : 1167 - 1179
  • [7] 3D Shape Generation and Completion through Point-Voxel Diffusion
    Zhou, Linqi
    Du, Yilun
    Wu, Jiajun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5806 - 5815
  • [8] PV-RCNN++: semantical point-voxel feature interaction for 3D object detection
    Peng Wu
    Lipeng Gu
    Xuefeng Yan
    Haoran Xie
    Fu Lee Wang
    Gary Cheng
    Mingqiang Wei
    The Visual Computer, 2023, 39 (6) : 2425 - 2440
  • [9] JPV-Net: Joint Point-Voxel Representations for Accurate 3D Object Detection
    Song, Nan
    Jiang, Tianyuan
    Yao, Jian
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2271 - 2279
  • [10] Point-Voxel and Bird-Eye-View Representation Aggregation Network for Single Stage 3D Object Detection
    Ning, Kanglin
    Liu, Yanfei
    Su, Yanzhao
    Jiang, Ke
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 3223 - 3235