S2CNet: Semantic and Structure Completion Network for 3D Object Detection

被引:0
|
作者
Shi, Chao [1 ]
Zhang, Chongyang [1 ,2 ]
Luo, Yan [1 ]
Qian, Zefeng [1 ]
Zhao, Muming [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 200240, Peoples R China
关键词
Feature extraction; Semantics; Proposals; Three-dimensional displays; Point cloud compression; Detectors; Object detection; 3D object detection; point cloud; feature completion; autonomous driving;
D O I
10.1109/TITS.2024.3429139
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, due to the inherent sparsity of point clouds, certain objects exhibit structure incompleteness in occluded and distant areas, which hampers the accurate perception of objects in 3D space. To tackle this challenge, we propose Semantic and Structure Completion Network (S(2)CNet) for 3D object detection. Concretely, we design the Semantic Completion (SeC) module to generate semantic features in Bird's-Eye-View (BEV) space, utilizing a teacher-student paradigm. Notably, we adopt a coarse-to-fine guidance strategy to encourage student network to generate semantic features specifically within foreground regions. This ensures that the student network focuses on the generation of foreground object features. Besides, we introduce an attention-based module to adaptively fuse the generated features and raw features. SeC module faces particular limitation when dealing with objects containing only a few points, in such case, the network is prone to generating low quality proposals with inaccurate localization. Complementary to SeC module, we introduce the Structure Completion (StC) module, in which a group of structural proposals are obtained by traversing most structures in a structure-guided manner, and thus at least one proposal with ground truth similar structure can be guaranteed. Extensive experiments on the KITTI and nuScenes benchmarks demonstrate the effectiveness of our method, especially for the hard setting objects with fewer points.
引用
收藏
页码:17134 / 17146
页数:13
相关论文
共 50 条
  • [21] FSD V2: Improving Fully Sparse 3D Object Detection With Virtual Voxels
    Fan, Lue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 1279 - 1292
  • [22] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719
  • [23] 3D MSSD: A multilayer spatial structure 3D object detection network for mobile LiDAR point clouds
    Wang, Zongyue
    Xia, Qiming
    Du, Jing
    Huang, Shangfeng
    Su, Jinhe
    Marcato Junior, Jose
    Li, Jonathan
    Cai, Guorong
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2021, 102
  • [24] SSA3D: Semantic Segmentation Assisted One-Stage Three-Dimensional Vehicle Object Detection
    Huang, Shangfeng
    Cai, Guorong
    Wang, Zongyue
    Xia, Qiming
    Wang, Ruisheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 14764 - 14778
  • [25] BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection
    Song, Yang
    Wang, Lin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (02): : 1457 - 1464
  • [26] RoIFusion: 3D Object Detection From LiDAR and Vision
    Chen, Can
    Fragonara, Luca Zanotti
    Tsourdos, Antonios
    IEEE ACCESS, 2021, 9 (09): : 51710 - 51721
  • [27] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [28] VoPiFNet: Voxel-Pixel Fusion Network for Multi-Class 3D Object Detection
    Wang, Chia-Hung
    Chen, Hsueh-Wei
    Chen, Yi
    Hsiao, Pei-Yung
    Fu, Li-Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 8527 - 8537
  • [29] CMAN: Leaning Global Structure Correlation for Monocular 3D Object Detection
    Cao, Yuanzhouhan
    Zhang, Hui
    Li, Yidong
    Ren, Chao
    Lang, Congyan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24727 - 24737
  • [30] AnchorPoint: Query Design for Transformer-Based 3D Object Detection and Tracking
    Liu, Hao
    Ma, Yanni
    Wang, Hanyun
    Zhang, Chaobo
    Guo, Yulan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (10) : 10988 - 11000