Dynamic graph transformer for 3D object detection

被引：20

作者：

Ren, Siyuan ^{[1
]}

Pan, Xiao ^{[2
]}

Zhao, Wenjie ^{[1
]}

Nie, Binling ^{[3
]}

Han, Bo ^{[1
]}

机构：

[1] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310000, Zhejiang, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Zhejiang, Peoples R China

[3] Hangzhou Dianzi Univ, Hangzhou 310000, Zhejiang, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 259卷

基金：

中国国家自然科学基金;

关键词：

3D object detection; Point cloud; Transformer; Graph structure learning; Automatic driving;

D O I：

10.1016/j.knosys.2022.110085

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

LiDAR-based 3D detection is critical in autonomous driving perception systems. However, point-based 3D object detection that directly learns from point clouds is challenging owing to the sparsity and irregularity of LiDAR point clouds. Existing point-based methods are limited by fixed local relationships and the sparsity of distant and occluded objects. To address these issues, we propose a dynamic graph transformer 3D object detection network (DGT-Det3D) based on a dynamic graph transformer (DGT) module and a proposal-aware fusion (PAF) module. The DGT module is built on a dynamic graph and graph-aware self-attention module, which adaptively concentrates on the foreground points and encodes the graph to capture long-range dependencies. With the DGT module, DGT-Det3D has better capability to detect distant and occluded objects. To further refine the proposals, our PAF module fully integrates the proposal-aware spatial information and combines it with the point-wise semantic features from the first stage. Extensive experiments on the KITTI dataset demonstrate that our approach achieves state-of-the-art accuracy for point-based methods. In addition, DGT brings significant improvements when combined with state-of-the-art methods on the Waymo open dataset.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：11

共 50 条

[21] 3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking
Ding, Shuxiao
Rehder, Eike
Schneider, Lukas
Cordts, Marius
Gall, Juergen
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9750 - 9760
[22] Transformer3D-Det: Improving 3D Object Detection by Vote Refinement
Zhao, Lichen
Guo, Jinyang
Xu, Dong
Sheng, Lu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4735 - 4746
[23] Graph Convolutional Networks for 3D Object Detection on Radar Data
Meyer, Michael
Kuschk, Georg
Tomforde, Sven
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3053 - 3062
[24] A Hierarchical Graph Network for 3D Object Detection on Point Clouds
Chen, Jintai
Lei, Biwen
Song, Qingyu
Ying, Haochao
Chen, Danny Z.
Wu, Jian
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 389 - 398
[25] GRAPH-BASED APPROACH FOR 3D OBJECT DUPLICATE DETECTION
Vajda, Peter
Dufaux, Frederic
Minh, Thien Ha
Ebrahimi, Touradj
2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 254 - 257
[26] F-Transformer: Point Cloud Fusion Transformer for Cooperative 3D Object Detection
Wang, Jie
Luo, Guiyang
Yuan, Quan
Li, Jinglin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 171 - 182
[27] Relation Graph Network for 3D Object Detection in Point Clouds
Feng, Mingtao
Gilani, Syed Zulqarnain
Wang, Yaonan
Zhang, Liang
Mian, Ajmal
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 92 - 107
[28] CasFormer: Cascaded Transformer Based on Dynamic Voxel Pyramid for 3D Object Detection from Point Clouds
Li, Xinglong
Zhang, Xiaowei
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 299 - 311
[29] ReAGFormer: Reaggregation Transformer with Affine Group Features for 3D Object Detection
Lu, Chenguang
Yue, Kang
Liu, Yue
COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 262 - 279
[30] BEV transformer for visual 3D object detection applied with retentive mechanism
Pan, Jincheng
Huang, Xiaoci
Luo, Suyun
Ma, Fang
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2025,

← 1 2 3 4 5 →