Dynamic graph transformer for 3D object detection

被引：20

作者：

Ren, Siyuan ^{[1
]}

Pan, Xiao ^{[2
]}

Zhao, Wenjie ^{[1
]}

Nie, Binling ^{[3
]}

Han, Bo ^{[1
]}

机构：

[1] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310000, Zhejiang, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Zhejiang, Peoples R China

[3] Hangzhou Dianzi Univ, Hangzhou 310000, Zhejiang, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 259卷

基金：

中国国家自然科学基金;

关键词：

3D object detection; Point cloud; Transformer; Graph structure learning; Automatic driving;

D O I：

10.1016/j.knosys.2022.110085

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

LiDAR-based 3D detection is critical in autonomous driving perception systems. However, point-based 3D object detection that directly learns from point clouds is challenging owing to the sparsity and irregularity of LiDAR point clouds. Existing point-based methods are limited by fixed local relationships and the sparsity of distant and occluded objects. To address these issues, we propose a dynamic graph transformer 3D object detection network (DGT-Det3D) based on a dynamic graph transformer (DGT) module and a proposal-aware fusion (PAF) module. The DGT module is built on a dynamic graph and graph-aware self-attention module, which adaptively concentrates on the foreground points and encodes the graph to capture long-range dependencies. With the DGT module, DGT-Det3D has better capability to detect distant and occluded objects. To further refine the proposals, our PAF module fully integrates the proposal-aware spatial information and combines it with the point-wise semantic features from the first stage. Extensive experiments on the KITTI dataset demonstrate that our approach achieves state-of-the-art accuracy for point-based methods. In addition, DGT brings significant improvements when combined with state-of-the-art methods on the Waymo open dataset.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：11

共 50 条

[1] Voxel Transformer for 3D Object Detection
Mao, Jiageng
Xue, Yujing
Niu, Minzhe
Bai, Haoyue
Feng, Jiashi
Liang, Xiaodan
Xu, Hang
Xu, Chunjing
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3144 - 3153
[2] Efficient Transformer-based 3D Object Detection with Dynamic Token Halting
Ye, Mao
Meyer, Gregory P.
Chai, Yuning
Liu, Qiang
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8404 - 8416
[3] Multimodal Transformer for Automatic 3D Annotation and Object Detection
Liu, Chang
Qian, Xiaoyan
Huang, Binxiao
Qi, Xiaojuan
Lam, Edmund
Tan, Siew-Chong
Wong, Ngai
COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 657 - 673
[4] SEFormer: Structure Embedding Transformer for 3D Object Detection
Feng, Xiaoyu
Du, Heming
Fan, Hehe
Duan, Yueqi
Liu, Yongpan
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 632 - 640
[5] Long-Short Range Adaptive Transformer With Dynamic Sampling for 3D Object Detection
Wang, Chuxin
Deng, Jiacheng
He, Jianfeng
Zhang, Tianzhu
Zhang, Zhe
Zhang, Yongdong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7616 - 7629
[6] DGFormer: Dynamic graph transformer for 3D human pose estimation
Chen, Zhangmeng
Dai, Ju
Bai, Junxuan
Pan, Junjun
PATTERN RECOGNITION, 2024, 152
[7] PointGAT: Graph attention networks for 3D object detection
Zhou H.
Wang W.
Liu G.
Zhou Q.
Intelligent and Converged Networks, 2022, 3 (02): : 204 - 216
[8] 3D Object Detection Method Combining on Graph Sampling and Graph Attention
Li, Wenju
Chu, Wanghui
Cui, Liu
Su, Pan
Zhang, Gan
Computer Engineering and Applications, 2023, 59 (09) : 237 - 244
[9] Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection From Point Clouds
Yin, Junbo
Shen, Jianbing
Gao, Xin
Crandall, David J.
Yang, Ruigang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9822 - 9835
[10] Object DGCNN: 3D Object Detection using Dynamic Graphs
Wang, Yue
Solomon, Justin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →