Dynamic graph transformer for 3D object detection

被引：21

作者：

Ren, Siyuan ^{[1
]}

Pan, Xiao ^{[2
]}

Zhao, Wenjie ^{[1
]}

Nie, Binling ^{[3
]}

Han, Bo ^{[1
]}

机构：

[1] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310000, Zhejiang, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Zhejiang, Peoples R China

[3] Hangzhou Dianzi Univ, Hangzhou 310000, Zhejiang, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 259卷

基金：

中国国家自然科学基金;

关键词：

3D object detection; Point cloud; Transformer; Graph structure learning; Automatic driving;

D O I：

10.1016/j.knosys.2022.110085

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

LiDAR-based 3D detection is critical in autonomous driving perception systems. However, point-based 3D object detection that directly learns from point clouds is challenging owing to the sparsity and irregularity of LiDAR point clouds. Existing point-based methods are limited by fixed local relationships and the sparsity of distant and occluded objects. To address these issues, we propose a dynamic graph transformer 3D object detection network (DGT-Det3D) based on a dynamic graph transformer (DGT) module and a proposal-aware fusion (PAF) module. The DGT module is built on a dynamic graph and graph-aware self-attention module, which adaptively concentrates on the foreground points and encodes the graph to capture long-range dependencies. With the DGT module, DGT-Det3D has better capability to detect distant and occluded objects. To further refine the proposals, our PAF module fully integrates the proposal-aware spatial information and combines it with the point-wise semantic features from the first stage. Extensive experiments on the KITTI dataset demonstrate that our approach achieves state-of-the-art accuracy for point-based methods. In addition, DGT brings significant improvements when combined with state-of-the-art methods on the Waymo open dataset.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：11

共 50 条

[41] Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving
Yuan, Zhenxun
Song, Xiao
Bai, Lei
Wang, Zhe
Ouyang, Wanli
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2068 - 2078
[42] DS-Trans: A 3D Object Detection Method Based on a Deformable Spatiotemporal Transformer for Autonomous Vehicles
Zhu, Yuan
Xu, Ruidong
Tao, Chongben
An, Hao
Wang, Huaide
Sun, Zhipeng
Lu, Ke
REMOTE SENSING, 2024, 16 (09)
[43] 3D Object Detection Based on LiDAR Data
Sahba, Ramin
Sahba, Amin
Jamshidi, Mo
Rad, Paul
2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 511 - 514
[44] Improved Two-Stage 3D Object Detection Algorithm for Roadside Scenes with Enhanced PointPillars and Transformer
Wang Liangzi
Huang Miaohua
Liu Ruoying
Bi Chengcheng
Hu Yongkang
LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (18)
[45] Improved 3D Object Detection Based on PointPillars
Kong, Weiwei
Du, Yusheng
He, Leilei
Li, Zejiang
ELECTRONICS, 2024, 13 (15)
[46] 3D Fast Object Detection Based on Discriminant Images and Dynamic Distance Threshold Clustering
Chen, Baifan
Chen, Hong
Yuan, Dian
Yu, Lingli
SENSORS, 2020, 20 (24) : 1 - 19
[47] Towards Accurate Microstructure Estimation via 3D Hybrid Graph Transformer
Yang, Junqing
Jiang, Haotian
Tassew, Tewodros
Sun, Peng
Ma, Jiquan
Xia, Yong
Yap, Pew-Thian
Chen, Geng
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 25 - 34
[48] 3D Siamese Transformer Network for Single Object Tracking on Point Clouds
Hui, Le
Wang, Lingpeng
Tang, Linghua
Lan, Kaihao
Xie, Jin
Yang, Jian
COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 293 - 310
[49] Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking
Feng, Shihao
Liang, Pengpeng
Gao, Jin
Cheng, Erkang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8066 - 8073
[50] DTSSD: Dual-Channel Transformer-Based Network for Point-Based 3D Object Detection
Zheng, Zhijie
Huang, Zhicong
Zhao, Jingwen
Hu, Haifeng
Chen, Dihu
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 798 - 802

← 1 2 3 4 5 →