S2CNet: Semantic and Structure Completion Network for 3D Object Detection

被引:0
|
作者
Shi, Chao [1 ]
Zhang, Chongyang [1 ,2 ]
Luo, Yan [1 ]
Qian, Zefeng [1 ]
Zhao, Muming [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 200240, Peoples R China
关键词
Feature extraction; Semantics; Proposals; Three-dimensional displays; Point cloud compression; Detectors; Object detection; 3D object detection; point cloud; feature completion; autonomous driving;
D O I
10.1109/TITS.2024.3429139
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, due to the inherent sparsity of point clouds, certain objects exhibit structure incompleteness in occluded and distant areas, which hampers the accurate perception of objects in 3D space. To tackle this challenge, we propose Semantic and Structure Completion Network (S(2)CNet) for 3D object detection. Concretely, we design the Semantic Completion (SeC) module to generate semantic features in Bird's-Eye-View (BEV) space, utilizing a teacher-student paradigm. Notably, we adopt a coarse-to-fine guidance strategy to encourage student network to generate semantic features specifically within foreground regions. This ensures that the student network focuses on the generation of foreground object features. Besides, we introduce an attention-based module to adaptively fuse the generated features and raw features. SeC module faces particular limitation when dealing with objects containing only a few points, in such case, the network is prone to generating low quality proposals with inaccurate localization. Complementary to SeC module, we introduce the Structure Completion (StC) module, in which a group of structural proposals are obtained by traversing most structures in a structure-guided manner, and thus at least one proposal with ground truth similar structure can be guaranteed. Extensive experiments on the KITTI and nuScenes benchmarks demonstrate the effectiveness of our method, especially for the hard setting objects with fewer points.
引用
收藏
页码:17134 / 17146
页数:13
相关论文
共 50 条
  • [41] ContextNet: Leveraging Comprehensive Contextual Information for Enhanced 3D Object Detection
    Pei, Caiyan
    Zhang, Shuai
    Cao, Lijun
    Zhao, Liqiang
    IEEE ACCESS, 2024, 12 : 106744 - 106756
  • [42] Multimodal 3D Object Detection Based on Sparse Interaction in Internet of Vehicles
    Li, Hui
    Ge, Tongao
    Bai, Keqiang
    Nie, Gaofeng
    Xu, Lingwei
    Ai, Xiaoxue
    Cao, Song
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 2174 - 2186
  • [43] Collaborative 3D Object Detection for Autonomous Vehicles via Learnable Communications
    Wang, J.
    Zeng, Y.
    Gong, Y.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 9804 - 9816
  • [44] BEVFusion With Dual Hard Instance Probing for Multimodal 3D Object Detection
    Kim, Taeho
    Kim, Joohee
    IEEE ACCESS, 2025, 13 : 25546 - 25556
  • [45] DO-SA&R: Distant Object Augmented Set Abstraction and Regression for Point-Based 3D Object Detection
    He, Xuan
    Wang, Zian
    Lin, Jiacheng
    Nai, Ke
    Yuan, Jin
    Li, Zhiyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5852 - 5864
  • [46] CoBEV: Elevating Roadside 3D Object Detection With Depth and Height Complementarity
    Shi, Hao
    Pang, Chengshan
    Zhang, Jiaming
    Yang, Kailun
    Wu, Yuhao
    Ni, Huajian
    Lin, Yining
    Stiefelhagen, Rainer
    Wang, Kaiwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5424 - 5439
  • [47] MD3D: Mixture-Density-Based 3D Object Detection in Point Clouds
    Choi, Jaeseok
    Song, Yeji
    Kim, Yerim
    Yoo, Jaeyoung
    Kwak, Nojun
    IEEE ACCESS, 2022, 10 : 104011 - 104022
  • [48] Scalable 3D Object Detection Pipeline With Center-Based Sequential Feature Aggregation for Intelligent Vehicles
    Jiang, Qi
    Hu, Chuan
    Zhao, Baixuan
    Huang, Yonghui
    Zhang, Xi
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1512 - 1523
  • [49] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [50] MonoGRNet: A General Framework for Monocular 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184