3D Multi-Object Tracking Using Graph Neural Networks With Cross-Edge Modality Attention

被引：7

作者：

Buechner, Martin ^{[1
]}

Valada, Abhinav ^{[1
]}

机构：

[1] Univ Freiburg, Dept Comp Sci, D-79100 Freiburg, Germany

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2022年 / 7卷 / 04期

基金：

欧盟地平线“2020”;

关键词：

Computer vision; graph neural networks; autonomous driving;

D O I：

10.1109/LRA.2022.3191558

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Online 3D multi-object tracking (MOT) has witnessed significant research interest in recent years, largely driven by demand from the autonomous systems community. However, 3D offline MOT is relatively less explored. Labeling 3D trajectory scene data at a large scale while not relying on high-cost human experts is still an open research question. In this work, we propose Batch3DMOT which follows the tracking-by-detection paradigm and represents real-world scenes as directed, acyclic, and category-disjoint tracking graphs that are attributed using various modalities such as camera, LiDAR, and radar. We present a multi-modal graph neural network that uses a cross-edge attention mechanism mitigating modality intermittence, which translates into sparsity in the graph domain. Additionally, we present attention-weighted convolutions over frame-wise k-NN neighborhoods as suitable means to allow information exchange across disconnected graph components. We evaluate our approach using various sensor modalities and model configurations on the challenging nuScenes and KITTI datasets. Extensive experiments demonstrate that our proposed approach yields an overall improvement of 3.3% in the AMOTA score on nuScenes thereby setting the new state-of-the-art for 3D tracking and further enhancing false positive filtering.

引用

页码：9707 / 9714

页数：8

共 50 条

[31] 2D Histology Meets 3D Topology: Cytoarchitectonic Brain Mapping with Graph Neural Networks
Schiffer, Christian
Harmeling, Stefan
Amunts, Katrin
Dickscheid, Timo
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 395 - 404
[32] Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
Zhang, Yifan
Zhu, Zhiyu
Hou, Junhui
Wu, Dapeng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10614 - 10628
[33] CWGA-Net: Center-Weighted Graph Attention Network for 3D object detection from point clouds
Shu, Jun
Wu, Qi
Tan, Liang
Shu, Xinyi
Wan, Fengchun
IMAGE AND VISION COMPUTING, 2024, 152
[34] Recognition of Holoscopic 3D Video Hand Gesture Using Convolutional Neural Networks
Alnaim, Norah
Abbod, Maysam
Swash, Rafiq
TECHNOLOGIES, 2020, 8 (02)
[35] MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection
Lin, Zewei
Shen, Yanqing
Zhou, Sanping
Chen, Shitao
Zheng, Nanning
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 136 - 149
[36] Deciphering the contribution of 3D interactions between cis-regulatory elements and promoters to regulate gene expression using graph neural networks
Chen, Yang
Lei, Elissa P.
14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,
[37] SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection From Multi-view Camera Images With Global Cross-Sensor Attention
Doll, Simon
Schulz, Richard
Schneider, Lukas
Benzin, Viviane
Enzweiler, Markus
Lensch, Hendrik P. A.
COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 230 - 245
[38] Generalisable 3D printing error detection and correction via multi-head neural networks
Brion, Douglas A. J.
Pattinson, Sebastian W.
NATURE COMMUNICATIONS, 2022, 13 (01)
[39] Quasi/Periodic Noise Reduction in Images Using Modified Multiresolution-Convolutional Neural Networks for 3D Object Reconstructions and Comparison with Other Convolutional Neural Network Models
Espinosa-Bernal, Osmar Antonio
Pedraza-Ortega, Jesus Carlos
Aceves-Fernandez, Marco Antonio
Martinez-Suarez, Victor Manuel
Tovar-Arriaga, Saul
Ramos-Arreguin, Juan Manuel
Gorrostieta-Hurtado, Efren
COMPUTERS, 2024, 13 (06)
[40] Bi-Att3DDet: Attention-Based Bi-Directional Fusion for Multi-Modal 3D Object Detection
Gao, Xu
Zhao, Yaqian
Wang, Yanan
Shang, Jiandong
Zhang, Chunmin
Wu, Gang
SENSORS, 2025, 25 (03)

← 1 2 3 4 5 →