SEGANet: 3D object detection with shape-enhancement and geometry-aware network

被引：5

作者：

Zhou, Jing ^{[1
]}

Hu, Yiyu ^{[1
]}

Lai, Zhongyuan ^{[1
]}

Wang, Tianjiang ^{[2
]}

机构：

[1] Jianghan Univ, Sch Artificial Intelligence, Wuhan 430056, Hubei, Peoples R China

[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Hubei, Peoples R China

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2023年 / 110卷

基金：

中国国家自然科学基金;

关键词：

3d object detection; Weakly sensing objects; Point cloud completion; Sparse attention; Transformer;

D O I：

10.1016/j.compeleceng.2023.108888

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

3D object detection approaches from point clouds develop rapidly. However, the distribution of point clouds is unbalanced in the real scene, and thus the distant or occluded objects suffer from too few points to be perceived. This case damages the overall detection accuracy. Hence, we propose a novel two-stage 3D object detection framework, the Shape-Enhancement and Geometry-Aware Network (SEGANet), which aims to mitigate the negative impact of unbalanced point distribution for boosting detection performance. In stage 1, we first capture fine-grained structural knowledge with the assistance of point-wise features from voxels to generate proposals. And in stage 2, we construct a shape enhancement module to reconstruct complete surface points for objects within proposals, then establish an elaborate geometric relevance-aware Transformer module to aggregate high-correlated feature pairs of reconstructed-known parts and decode vital geometric relations of aggregated features. Thus, critical geometric clues are supplied for objects from the data and feature levels, achieving enhanced features for box refinement. Extensive experiments on KITTI and Waymo datasets show that SEGANet achieves low model complexity and excellent detection accuracy, especially leading the baseline method by 2.18% gain in overall detection accuracy and 1.8% gain in average accuracy of weakly sensing objects. This verifies that SEGANet effectively alleviates the impact of point imbalance to significantly boost detection performance.

引用

页数：19

共 30 条

[1]

Barrera Alejandro, 2020, 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), DOI 10.1109/ITSC45102.2020.9294293

[2] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[3]

Chen M, 2020, PR MACH LEARN RES, V119

[4]

Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201

[5] PCT: Point cloud transformer [J].

Guo, Meng-Hao ;

Cai, Jun-Xiong ;

Liu, Zheng-Ning ;

Mu, Tai-Jiang ;

Martin, Ralph R. ;

Hu, Shi-Min .

COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199

[6] Structure Aware Single-stage 3D Object Detection from Point Cloud [J].

He, Chenhang ;

Zeng, Hui ;

Huang, Jianqiang ;

Hua, Xian-Sheng ;

Zhang, Lei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11870-11879

[7]

He QD, 2022, AAAI CONF ARTIF INTE, P870

[8] PointPillars: Fast Encoders for Object Detection from Point Clouds [J].

Lang, Alex H. ;

Vora, Sourabh ;

Caesar, Holger ;

Zhou, Lubing ;

Yang, Jiong ;

Beijbom, Oscar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697

[9]

Li Z, 2022, PATTERN RECOGNIT, V128

[10] Extracting geometric and semantic point cloud features with gateway attention for accurate 3D object detection [J].

Liu, Huaijin ;

Du, Jixiang ;

Zhang, Yong ;

Zhang, Hongbo .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123

← 1 2 3 →