S2CNet: Semantic and Structure Completion Network for 3D Object Detection

被引:0
|
作者
Shi, Chao [1 ]
Zhang, Chongyang [1 ,2 ]
Luo, Yan [1 ]
Qian, Zefeng [1 ]
Zhao, Muming [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 200240, Peoples R China
关键词
Feature extraction; Semantics; Proposals; Three-dimensional displays; Point cloud compression; Detectors; Object detection; 3D object detection; point cloud; feature completion; autonomous driving;
D O I
10.1109/TITS.2024.3429139
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, due to the inherent sparsity of point clouds, certain objects exhibit structure incompleteness in occluded and distant areas, which hampers the accurate perception of objects in 3D space. To tackle this challenge, we propose Semantic and Structure Completion Network (S(2)CNet) for 3D object detection. Concretely, we design the Semantic Completion (SeC) module to generate semantic features in Bird's-Eye-View (BEV) space, utilizing a teacher-student paradigm. Notably, we adopt a coarse-to-fine guidance strategy to encourage student network to generate semantic features specifically within foreground regions. This ensures that the student network focuses on the generation of foreground object features. Besides, we introduce an attention-based module to adaptively fuse the generated features and raw features. SeC module faces particular limitation when dealing with objects containing only a few points, in such case, the network is prone to generating low quality proposals with inaccurate localization. Complementary to SeC module, we introduce the Structure Completion (StC) module, in which a group of structural proposals are obtained by traversing most structures in a structure-guided manner, and thus at least one proposal with ground truth similar structure can be guaranteed. Extensive experiments on the KITTI and nuScenes benchmarks demonstrate the effectiveness of our method, especially for the hard setting objects with fewer points.
引用
收藏
页码:17134 / 17146
页数:13
相关论文
共 50 条
  • [31] STFNET: Sparse Temporal Fusion for 3D Object Detection in LiDAR Point Cloud
    Meng, Xin
    Zhou, Yuan
    Ma, Jun
    Jiang, Fangdi
    Qi, Yongze
    Wang, Cui
    Kim, Jonghyuk
    Wang, Shifeng
    IEEE SENSORS JOURNAL, 2025, 25 (03) : 5866 - 5877
  • [32] Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection
    Cserni, Marton
    Rovid, Andras
    IEEE ACCESS, 2024, 12 : 167153 - 167167
  • [33] Transformer3D-Det: Improving 3D Object Detection by Vote Refinement
    Zhao, Lichen
    Guo, Jinyang
    Xu, Dong
    Sheng, Lu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4735 - 4746
  • [34] SVDnet: Singular Value Control and Distance Alignment Network for 3D Object Detection
    Chang, Ming-Jen
    Cheng, Chih-Jen
    Hsiao, Ching-Chun
    Li, Yung-Hui
    Huang, Ching-Chun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 9281 - 9295
  • [35] Exploring Diversity-Based Active Learning for 3D Object Detection in Autonomous Driving
    Lin, Jinpeng
    Liang, Zhihao
    Deng, Shengheng
    Cai, Lile
    Jiang, Tao
    Li, Tianrui
    Jia, Kui
    Xu, Xun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15454 - 15466
  • [36] SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection
    Dai, Kun
    Jiang, Zhiqiang
    Xie, Tao
    Wang, Ke
    Liu, Dedong
    Fan, Zhendong
    Li, Ruifeng
    Zhao, Lijun
    Omar, Mohamed
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 637 - 651
  • [37] D-S Augmentation: Density-Semantics Augmentation for 3-D Object Detection
    Liu, Zhiqiang
    Shi, Peicheng
    Qi, Heng
    Yang, Aixi
    IEEE SENSORS JOURNAL, 2023, 23 (03) : 2760 - 2772
  • [38] Geometry-Guided Point Generation for 3D Object Detection
    Wang, Kai
    Zhou, Mingliang
    Lin, Qing
    Niu, Guanglin
    Zhang, Xiaowei
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 136 - 140
  • [39] P2V-RCNN: Point to Voxel Feature Learning for 3D Object Detection From Point Clouds
    Li, Jiale
    Sun, Yu
    Luo, Shujie
    Zhu, Ziqi
    Dai, Hang
    Krylov, Andrey S.
    Ding, Yong
    Shao, Ling
    IEEE ACCESS, 2021, 9 : 98249 - 98260
  • [40] Multi-Sem Fusion: Multimodal Semantic Fusion for 3-D Object Detection
    Xu, Shaoqing
    Li, Fang
    Song, Ziying
    Fang, Jin
    Wang, Sifen
    Yang, Zhi-Xin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14