S2CNet: Semantic and Structure Completion Network for 3D Object Detection

被引：0

作者：

Shi, Chao ^{[1
]}

Zhang, Chongyang ^{[1
,2
]}

Luo, Yan ^{[1
]}

Qian, Zefeng ^{[1
]}

Zhao, Muming ^{[3
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

[3] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 11期

关键词：

Feature extraction; Semantics; Proposals; Three-dimensional displays; Point cloud compression; Detectors; Object detection; 3D object detection; point cloud; feature completion; autonomous driving;

D O I：

10.1109/TITS.2024.3429139

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, due to the inherent sparsity of point clouds, certain objects exhibit structure incompleteness in occluded and distant areas, which hampers the accurate perception of objects in 3D space. To tackle this challenge, we propose Semantic and Structure Completion Network (S(2)CNet) for 3D object detection. Concretely, we design the Semantic Completion (SeC) module to generate semantic features in Bird's-Eye-View (BEV) space, utilizing a teacher-student paradigm. Notably, we adopt a coarse-to-fine guidance strategy to encourage student network to generate semantic features specifically within foreground regions. This ensures that the student network focuses on the generation of foreground object features. Besides, we introduce an attention-based module to adaptively fuse the generated features and raw features. SeC module faces particular limitation when dealing with objects containing only a few points, in such case, the network is prone to generating low quality proposals with inaccurate localization. Complementary to SeC module, we introduce the Structure Completion (StC) module, in which a group of structural proposals are obtained by traversing most structures in a structure-guided manner, and thus at least one proposal with ground truth similar structure can be guaranteed. Extensive experiments on the KITTI and nuScenes benchmarks demonstrate the effectiveness of our method, especially for the hard setting objects with fewer points.

引用

页码：17134 / 17146

页数：13

共 50 条

[21] FSD V2: Improving Fully Sparse 3D Object Detection With Virtual Voxels
Fan, Lue
Wang, Feng
Wang, Naiyan
Zhang, Zhaoxiang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 1279 - 1292
[22] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
Cai, Qi
Pan, Yingwei
Yao, Ting
Mei, Tao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719
[23] 3D MSSD: A multilayer spatial structure 3D object detection network for mobile LiDAR point clouds
Wang, Zongyue
Xia, Qiming
Du, Jing
Huang, Shangfeng
Su, Jinhe
Marcato Junior, Jose
Li, Jonathan
Cai, Guorong
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2021, 102
[24] SSA3D: Semantic Segmentation Assisted One-Stage Three-Dimensional Vehicle Object Detection
Huang, Shangfeng
Cai, Guorong
Wang, Zongyue
Xia, Qiming
Wang, Ruisheng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 14764 - 14778
[25] BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection
Song, Yang
Wang, Lin
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (02): : 1457 - 1464
[26] RoIFusion: 3D Object Detection From LiDAR and Vision
Chen, Can
Fragonara, Luca Zanotti
Tsourdos, Antonios
IEEE ACCESS, 2021, 9 (09): : 51710 - 51721
[27] Shape-Aware Monocular 3D Object Detection
Chen, Wei
Zhao, Jie
Zhao, Wan-Lei
Wu, Song-Yuan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
[28] VoPiFNet: Voxel-Pixel Fusion Network for Multi-Class 3D Object Detection
Wang, Chia-Hung
Chen, Hsueh-Wei
Chen, Yi
Hsiao, Pei-Yung
Fu, Li-Chen
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 8527 - 8537
[29] CMAN: Leaning Global Structure Correlation for Monocular 3D Object Detection
Cao, Yuanzhouhan
Zhang, Hui
Li, Yidong
Ren, Chao
Lang, Congyan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24727 - 24737
[30] AnchorPoint: Query Design for Transformer-Based 3D Object Detection and Tracking
Liu, Hao
Ma, Yanni
Wang, Hanyun
Zhang, Chaobo
Guo, Yulan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (10) : 10988 - 11000

← 1 2 3 4 5 →