Dynamic graph transformer for 3D object detection

被引:20
|
作者
Ren, Siyuan [1 ]
Pan, Xiao [2 ]
Zhao, Wenjie [1 ]
Nie, Binling [3 ]
Han, Bo [1 ]
机构
[1] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310000, Zhejiang, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Zhejiang, Peoples R China
[3] Hangzhou Dianzi Univ, Hangzhou 310000, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Point cloud; Transformer; Graph structure learning; Automatic driving;
D O I
10.1016/j.knosys.2022.110085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
LiDAR-based 3D detection is critical in autonomous driving perception systems. However, point-based 3D object detection that directly learns from point clouds is challenging owing to the sparsity and irregularity of LiDAR point clouds. Existing point-based methods are limited by fixed local relationships and the sparsity of distant and occluded objects. To address these issues, we propose a dynamic graph transformer 3D object detection network (DGT-Det3D) based on a dynamic graph transformer (DGT) module and a proposal-aware fusion (PAF) module. The DGT module is built on a dynamic graph and graph-aware self-attention module, which adaptively concentrates on the foreground points and encodes the graph to capture long-range dependencies. With the DGT module, DGT-Det3D has better capability to detect distant and occluded objects. To further refine the proposals, our PAF module fully integrates the proposal-aware spatial information and combines it with the point-wise semantic features from the first stage. Extensive experiments on the KITTI dataset demonstrate that our approach achieves state-of-the-art accuracy for point-based methods. In addition, DGT brings significant improvements when combined with state-of-the-art methods on the Waymo open dataset.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] 3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking
    Ding, Shuxiao
    Rehder, Eike
    Schneider, Lukas
    Cordts, Marius
    Gall, Juergen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9750 - 9760
  • [22] Transformer3D-Det: Improving 3D Object Detection by Vote Refinement
    Zhao, Lichen
    Guo, Jinyang
    Xu, Dong
    Sheng, Lu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4735 - 4746
  • [23] Graph Convolutional Networks for 3D Object Detection on Radar Data
    Meyer, Michael
    Kuschk, Georg
    Tomforde, Sven
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3053 - 3062
  • [24] A Hierarchical Graph Network for 3D Object Detection on Point Clouds
    Chen, Jintai
    Lei, Biwen
    Song, Qingyu
    Ying, Haochao
    Chen, Danny Z.
    Wu, Jian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 389 - 398
  • [25] GRAPH-BASED APPROACH FOR 3D OBJECT DUPLICATE DETECTION
    Vajda, Peter
    Dufaux, Frederic
    Minh, Thien Ha
    Ebrahimi, Touradj
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 254 - 257
  • [26] F-Transformer: Point Cloud Fusion Transformer for Cooperative 3D Object Detection
    Wang, Jie
    Luo, Guiyang
    Yuan, Quan
    Li, Jinglin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 171 - 182
  • [27] Relation Graph Network for 3D Object Detection in Point Clouds
    Feng, Mingtao
    Gilani, Syed Zulqarnain
    Wang, Yaonan
    Zhang, Liang
    Mian, Ajmal
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 92 - 107
  • [28] CasFormer: Cascaded Transformer Based on Dynamic Voxel Pyramid for 3D Object Detection from Point Clouds
    Li, Xinglong
    Zhang, Xiaowei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 299 - 311
  • [29] ReAGFormer: Reaggregation Transformer with Affine Group Features for 3D Object Detection
    Lu, Chenguang
    Yue, Kang
    Liu, Yue
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 262 - 279
  • [30] BEV transformer for visual 3D object detection applied with retentive mechanism
    Pan, Jincheng
    Huang, Xiaoci
    Luo, Suyun
    Ma, Fang
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2025,