Dynamic graph transformer for 3D object detection

被引:21
作者
Ren, Siyuan [1 ]
Pan, Xiao [2 ]
Zhao, Wenjie [1 ]
Nie, Binling [3 ]
Han, Bo [1 ]
机构
[1] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310000, Zhejiang, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Zhejiang, Peoples R China
[3] Hangzhou Dianzi Univ, Hangzhou 310000, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; Point cloud; Transformer; Graph structure learning; Automatic driving;
D O I
10.1016/j.knosys.2022.110085
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
LiDAR-based 3D detection is critical in autonomous driving perception systems. However, point-based 3D object detection that directly learns from point clouds is challenging owing to the sparsity and irregularity of LiDAR point clouds. Existing point-based methods are limited by fixed local relationships and the sparsity of distant and occluded objects. To address these issues, we propose a dynamic graph transformer 3D object detection network (DGT-Det3D) based on a dynamic graph transformer (DGT) module and a proposal-aware fusion (PAF) module. The DGT module is built on a dynamic graph and graph-aware self-attention module, which adaptively concentrates on the foreground points and encodes the graph to capture long-range dependencies. With the DGT module, DGT-Det3D has better capability to detect distant and occluded objects. To further refine the proposals, our PAF module fully integrates the proposal-aware spatial information and combines it with the point-wise semantic features from the first stage. Extensive experiments on the KITTI dataset demonstrate that our approach achieves state-of-the-art accuracy for point-based methods. In addition, DGT brings significant improvements when combined with state-of-the-art methods on the Waymo open dataset.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving
    Yuan, Zhenxun
    Song, Xiao
    Bai, Lei
    Wang, Zhe
    Ouyang, Wanli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2068 - 2078
  • [42] DS-Trans: A 3D Object Detection Method Based on a Deformable Spatiotemporal Transformer for Autonomous Vehicles
    Zhu, Yuan
    Xu, Ruidong
    Tao, Chongben
    An, Hao
    Wang, Huaide
    Sun, Zhipeng
    Lu, Ke
    REMOTE SENSING, 2024, 16 (09)
  • [43] 3D Object Detection Based on LiDAR Data
    Sahba, Ramin
    Sahba, Amin
    Jamshidi, Mo
    Rad, Paul
    2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 511 - 514
  • [44] Improved Two-Stage 3D Object Detection Algorithm for Roadside Scenes with Enhanced PointPillars and Transformer
    Wang Liangzi
    Huang Miaohua
    Liu Ruoying
    Bi Chengcheng
    Hu Yongkang
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (18)
  • [45] Improved 3D Object Detection Based on PointPillars
    Kong, Weiwei
    Du, Yusheng
    He, Leilei
    Li, Zejiang
    ELECTRONICS, 2024, 13 (15)
  • [46] 3D Fast Object Detection Based on Discriminant Images and Dynamic Distance Threshold Clustering
    Chen, Baifan
    Chen, Hong
    Yuan, Dian
    Yu, Lingli
    SENSORS, 2020, 20 (24) : 1 - 19
  • [47] Towards Accurate Microstructure Estimation via 3D Hybrid Graph Transformer
    Yang, Junqing
    Jiang, Haotian
    Tassew, Tewodros
    Sun, Peng
    Ma, Jiquan
    Xia, Yong
    Yap, Pew-Thian
    Chen, Geng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 25 - 34
  • [48] 3D Siamese Transformer Network for Single Object Tracking on Point Clouds
    Hui, Le
    Wang, Lingpeng
    Tang, Linghua
    Lan, Kaihao
    Xie, Jin
    Yang, Jian
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 293 - 310
  • [49] Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking
    Feng, Shihao
    Liang, Pengpeng
    Gao, Jin
    Cheng, Erkang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8066 - 8073
  • [50] DTSSD: Dual-Channel Transformer-Based Network for Point-Based 3D Object Detection
    Zheng, Zhijie
    Huang, Zhicong
    Zhao, Jingwen
    Hu, Haifeng
    Chen, Dihu
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 798 - 802