S2CNet: Semantic and Structure Completion Network for 3D Object Detection

被引:0
|
作者
Shi, Chao [1 ]
Zhang, Chongyang [1 ,2 ]
Luo, Yan [1 ]
Qian, Zefeng [1 ]
Zhao, Muming [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 200240, Peoples R China
关键词
Feature extraction; Semantics; Proposals; Three-dimensional displays; Point cloud compression; Detectors; Object detection; 3D object detection; point cloud; feature completion; autonomous driving;
D O I
10.1109/TITS.2024.3429139
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, due to the inherent sparsity of point clouds, certain objects exhibit structure incompleteness in occluded and distant areas, which hampers the accurate perception of objects in 3D space. To tackle this challenge, we propose Semantic and Structure Completion Network (S(2)CNet) for 3D object detection. Concretely, we design the Semantic Completion (SeC) module to generate semantic features in Bird's-Eye-View (BEV) space, utilizing a teacher-student paradigm. Notably, we adopt a coarse-to-fine guidance strategy to encourage student network to generate semantic features specifically within foreground regions. This ensures that the student network focuses on the generation of foreground object features. Besides, we introduce an attention-based module to adaptively fuse the generated features and raw features. SeC module faces particular limitation when dealing with objects containing only a few points, in such case, the network is prone to generating low quality proposals with inaccurate localization. Complementary to SeC module, we introduce the Structure Completion (StC) module, in which a group of structural proposals are obtained by traversing most structures in a structure-guided manner, and thus at least one proposal with ground truth similar structure can be guaranteed. Extensive experiments on the KITTI and nuScenes benchmarks demonstrate the effectiveness of our method, especially for the hard setting objects with fewer points.
引用
收藏
页码:17134 / 17146
页数:13
相关论文
共 50 条
  • [1] SCDA-Net: Structure Completion and Density Awareness Network for LiDAR-Based 3D Object Detection
    Wu, Shuwen
    Yang, Jinfu
    Ma, Jiaqi
    Zhang, Shaochen
    Hao, Tianhao
    Li, Mingai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4268 - 4275
  • [2] Super Sparse 3D Object Detection
    Fan, Lue
    Yang, Yuxue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
  • [3] Relation Graph Network for 3D Object Detection in Point Clouds
    Feng, Mingtao
    Gilani, Syed Zulqarnain
    Wang, Yaonan
    Zhang, Liang
    Mian, Ajmal
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 92 - 107
  • [4] SACINet: Semantic-Aware Cross-Modal Interaction Network for Real-Time 3D Object Detection
    Yang, Ying
    Yin, Hui
    Chong, Ai-Xin
    Wan, Jin
    Liu, Qing-Yi
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3917 - 3927
  • [5] Local-to-Global Semantic Learning for Multi-View 3D Object Detection From Point Cloud
    Qiao, Renzhong
    Ji, Hongbing
    Zhu, Zhigang
    Zhang, Wenbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9371 - 9385
  • [6] SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection
    Zhang, Hui
    Luo, Guiyang
    Wang, Xiao
    Li, Yidong
    Ding, Weiping
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 2465 - 2479
  • [7] PSNS-SSD: Pixel-Level Suppressed Nonsalient Semantic and Multicoupled Channel Enhancement Attention for 3D Object Detection
    Song, Xiaogang
    Zhou, Zhenhua
    Zhang, Lei
    Lu, Xiaofeng
    Hei, Xinhong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 603 - 610
  • [8] Semantic Scene Completion With 2D and 3D Feature Fusion
    Park, Sang-Min
    Ha, Jong-Eun
    IEEE ACCESS, 2024, 12 : 141594 - 141603
  • [9] WSSIC-Net: Weakly-Supervised Semantic Instance Completion of 3D Point Cloud Scenes
    Fu, Zhiheng
    Guo, Yulan
    Chen, Minglin
    Hu, Qingyong
    Laga, Hamid
    Boussaid, Farid
    Bennamoun, Mohammed
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 2008 - 2019
  • [10] ESGN: Efficient Stereo Geometry Network for Fast 3D Object Detection
    Gao, Aqi
    Pang, Yanwei
    Nie, Jing
    Shao, Zhuang
    Cao, Jiale
    Guo, Yishun
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2000 - 2009