Dynamic graph transformer for 3D object detection

被引:20
作者
Ren, Siyuan [1 ]
Pan, Xiao [2 ]
Zhao, Wenjie [1 ]
Nie, Binling [3 ]
Han, Bo [1 ]
机构
[1] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310000, Zhejiang, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Zhejiang, Peoples R China
[3] Hangzhou Dianzi Univ, Hangzhou 310000, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Point cloud; Transformer; Graph structure learning; Automatic driving;
D O I
10.1016/j.knosys.2022.110085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
LiDAR-based 3D detection is critical in autonomous driving perception systems. However, point-based 3D object detection that directly learns from point clouds is challenging owing to the sparsity and irregularity of LiDAR point clouds. Existing point-based methods are limited by fixed local relationships and the sparsity of distant and occluded objects. To address these issues, we propose a dynamic graph transformer 3D object detection network (DGT-Det3D) based on a dynamic graph transformer (DGT) module and a proposal-aware fusion (PAF) module. The DGT module is built on a dynamic graph and graph-aware self-attention module, which adaptively concentrates on the foreground points and encodes the graph to capture long-range dependencies. With the DGT module, DGT-Det3D has better capability to detect distant and occluded objects. To further refine the proposals, our PAF module fully integrates the proposal-aware spatial information and combines it with the point-wise semantic features from the first stage. Extensive experiments on the KITTI dataset demonstrate that our approach achieves state-of-the-art accuracy for point-based methods. In addition, DGT brings significant improvements when combined with state-of-the-art methods on the Waymo open dataset.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] RVT: Robotic View Transformer for 3D Object Manipulation
    Goyal, Ankit
    Xu, Jie
    Guo, Yijie
    Blukis, Valts
    Chao, Yu-Wei
    Fox, Dieter
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [32] Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection
    Chen, Zehui
    Li, Zhenyu
    Zhang, Shiquan
    Fang, Liangji
    Jiang, Qinhong
    Zhao, Feng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5999 - 6008
  • [33] MonoPSTR: Monocular 3-D Object Detection With Dynamic Position and Scale-Aware Transformer
    Yang, Fan
    He, Xuan
    Chen, Wenrui
    Zhou, Pengjie
    Li, Zhiyong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [34] TBFNT3D: Two-Branch Fusion Network With Transformer for Multimodal Indoor 3D Object Detection
    Cheng, Jun
    Zhang, Sheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6523 - 6530
  • [35] CT3D++: Improving 3D Object Detection with Keypoint-Induced Channel-wise Transformer
    Sheng, Hualian
    Cai, Sijia
    Zhao, Na
    Deng, Bing
    Liang, Qiao
    Zhao, Min-Jian
    Ye, Jieping
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, : 4817 - 4836
  • [36] Graph-DETR4D: Spatio-Temporal Graph Modeling for Multi-View 3D Object Detection
    Chen, Zehui
    Chen, Zheng
    Li, Zhenyu
    Zhang, Shiquan
    Fang, Liangji
    Jiang, Qinhong
    Wu, Feng
    Zhao, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4488 - 4500
  • [37] Transformer-Based Stereo-Aware 3D Object Detection From Binocular Images
    Sun, Hanqing
    Pang, Yanwei
    Cao, Jiale
    Xie, Jin
    Li, Xuelong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 19675 - 19687
  • [38] Graph Transformer for 3D point clouds classification and semantic segmentation
    Zhou, Wei
    Wang, Qian
    Jin, Weiwei
    Shi, Xinzhe
    He, Ying
    COMPUTERS & GRAPHICS-UK, 2024, 124
  • [39] Real-Time 3D Single Object Tracking With Transformer
    Shan, Jiayao
    Zhou, Sifan
    Cui, Yubo
    Fang, Zheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2339 - 2353
  • [40] A Systematic Survey of Transformer-Based 3D Object Detection for Autonomous Driving: Methods, Challenges and Trends
    Zhu, Minling
    Gong, Yadong
    Tian, Chunwei
    Zhu, Zuyuan
    DRONES, 2024, 8 (08)