SAT-GCN: Self-attention graph convolutional network-based 3D object detection for autonomous driving

被引：57

作者：

Wang, Li ^{[1
,2
]}

Song, Ziying ^{[3
]}

Zhang, Xinyu ^{[1
,2
]}

Wang, Chenfei ^{[1
,2
]}

Zhang, Guoxin ^{[4
]}

Zhu, Lei ^{[5
]}

Li, Jun ^{[1
,2
]}

Liu, Huaping ^{[6
,7
]}

机构：

[1] Tsinghua Univ, State Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China

[2] Tsinghua Univ, Sch Vehicle & Mobil, Beijing 100084, Peoples R China

[3] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China

[4] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China

[5] Mogo Auto Intelligence & Telemet Informat Technol, Beijing 100013, Peoples R China

[6] Tsinghua Univ, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China

[7] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 259卷

基金：

国家高技术研究发展计划(863计划); 中国国家自然科学基金;

关键词：

3D object detection; Graph convolutional network; Self-attention mechanism; VEHICLE DETECTION; POINT CLOUD; LIDAR;

D O I：

10.1016/j.knosys.2022.110080

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Accurate 3D object detection from point clouds is critical for autonomous vehicles. However, point cloud data collected by LiDAR sensors are inherently sparse, especially at long distances. In addition, most existing 3D object detectors extract local features and ignore interactions between features, producing weak semantic information that significantly limits detection performance. We propose a self-attention graph convolutional network (SAT-GCN), which utilizes a GCN and self-attention to enhance semantic representations by aggregating neighborhood information and focusing on vital relationships. SAT-GCN consists of three modules: vertex feature extraction (VFE), self-attention with dimension reduction (SADR), and far distance feature suppression (FDFS). VFE extracts neighboring relationships between features using GCN after encoding a raw point cloud. SADR performs further weight augmentation for crucial neighboring relationships through self-attention. FDFS suppresses meaningless edges formed by sparse point cloud distributions in remote areas and generates corre-sponding global features. Extensive experiments are conducted on the widely used KITTI and nuScenes 3D object detection benchmarks. The results demonstrate significant improvements in mainstream methods, PointPillars, SECOND, and PointRCNN, improving the mean of AP 3D by 4.88%, 5.02%, and 2.79% on KITTI test dataset. SAT-GCN can boost the detection accuracy of the point cloud, especially at medium and long distances. Furthermore, adding the SAT-GCN module has a limited impact on the real-time performance and model parameters.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：13

共 50 条

[31] GSAN: Graph Self-Attention Network for Learning Spatial-Temporal Interaction Representation in Autonomous Driving
Ye, Luyao
Wang, Zezhong
Chen, Xinhong
Wang, Jianping
Wu, Kui
Lu, Kejie
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) : 9190 - 9204
[32] R-CNN Based 3D Object Detection for Autonomous Driving
Hu, Hongyu
Zhao, Tongtong
Wang, Qi
Gao, Fei
He, Lei
CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 918 - 929
[33] Point-Level Fusion and Channel Attention for 3D Object Detection in Autonomous Driving
Shen, Juntao
Fang, Zheng
Huang, Jin
SENSORS, 2025, 25 (04)
[34] Monocular 3D Object Detection for Autonomous Driving Based on Contextual Transformer
She, Xiangyang
Yan, Weijia
Dong, Lihong
Computer Engineering and Applications, 2024, 60 (19) : 178 - 189
[35] SODA: Similar 3D Object Detection Accelerator at Network Edge for Autonomous Driving
Xu, Wenquan
Song, Haoyu
Hou, Linyang
Zheng, Hui
Zheng, Xinggong
Zhang, Chuwen
Hu, Wei
Wang, Yi
Liu, Bin
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
[36] Stereo CenterNet-based 3D object detection for autonomous driving
Shi, Yuguang
Guo, Yu
Mi, Zhenqiang
Li, Xinjie
NEUROCOMPUTING, 2022, 471 : 219 - 229
[37] 3D Object Detection for Autonomous Driving: A Practical Survey
Ramajo-Ballester, Alvaro
de la Escalera Hueso, Arturo
Armingol Moreno, Jose Maria
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS, VEHITS 2023, 2023, : 64 - 73
[38] 3D Object Detection for Autonomous Driving: A Comprehensive Survey
Jiageng Mao
Shaoshuai Shi
Xiaogang Wang
Hongsheng Li
International Journal of Computer Vision, 2023, 131 : 1909 - 1963
[39] 3D Object Detection for Autonomous Driving: A Comprehensive Survey
Mao, Jiageng
Shi, Shaoshuai
Wang, Xiaogang
Li, Hongsheng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 1909 - 1963
[40] On Offline Evaluation of 3D Object Detection for Autonomous Driving
Schreier, Tim
Renz, Katrin
Geiger, Andreas
Chitta, Kashyap
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4086 - 4091

← 1 2 3 4 5 →