3D Multi-Object Tracking Using Graph Neural Networks With Cross-Edge Modality Attention

被引:7
作者
Buechner, Martin [1 ]
Valada, Abhinav [1 ]
机构
[1] Univ Freiburg, Dept Comp Sci, D-79100 Freiburg, Germany
基金
欧盟地平线“2020”;
关键词
Computer vision; graph neural networks; autonomous driving;
D O I
10.1109/LRA.2022.3191558
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Online 3D multi-object tracking (MOT) has witnessed significant research interest in recent years, largely driven by demand from the autonomous systems community. However, 3D offline MOT is relatively less explored. Labeling 3D trajectory scene data at a large scale while not relying on high-cost human experts is still an open research question. In this work, we propose Batch3DMOT which follows the tracking-by-detection paradigm and represents real-world scenes as directed, acyclic, and category-disjoint tracking graphs that are attributed using various modalities such as camera, LiDAR, and radar. We present a multi-modal graph neural network that uses a cross-edge attention mechanism mitigating modality intermittence, which translates into sparsity in the graph domain. Additionally, we present attention-weighted convolutions over frame-wise k-NN neighborhoods as suitable means to allow information exchange across disconnected graph components. We evaluate our approach using various sensor modalities and model configurations on the challenging nuScenes and KITTI datasets. Extensive experiments demonstrate that our proposed approach yields an overall improvement of 3.3% in the AMOTA score on nuScenes thereby setting the new state-of-the-art for 3D tracking and further enhancing false positive filtering.
引用
收藏
页码:9707 / 9714
页数:8
相关论文
共 50 条
  • [31] 2D Histology Meets 3D Topology: Cytoarchitectonic Brain Mapping with Graph Neural Networks
    Schiffer, Christian
    Harmeling, Stefan
    Amunts, Katrin
    Dickscheid, Timo
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 395 - 404
  • [32] Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
    Zhang, Yifan
    Zhu, Zhiyu
    Hou, Junhui
    Wu, Dapeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10614 - 10628
  • [33] CWGA-Net: Center-Weighted Graph Attention Network for 3D object detection from point clouds
    Shu, Jun
    Wu, Qi
    Tan, Liang
    Shu, Xinyi
    Wan, Fengchun
    IMAGE AND VISION COMPUTING, 2024, 152
  • [34] Recognition of Holoscopic 3D Video Hand Gesture Using Convolutional Neural Networks
    Alnaim, Norah
    Abbod, Maysam
    Swash, Rafiq
    TECHNOLOGIES, 2020, 8 (02)
  • [35] MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection
    Lin, Zewei
    Shen, Yanqing
    Zhou, Sanping
    Chen, Shitao
    Zheng, Nanning
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 136 - 149
  • [36] Deciphering the contribution of 3D interactions between cis-regulatory elements and promoters to regulate gene expression using graph neural networks
    Chen, Yang
    Lei, Elissa P.
    14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,
  • [37] SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection From Multi-view Camera Images With Global Cross-Sensor Attention
    Doll, Simon
    Schulz, Richard
    Schneider, Lukas
    Benzin, Viviane
    Enzweiler, Markus
    Lensch, Hendrik P. A.
    COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 230 - 245
  • [38] Generalisable 3D printing error detection and correction via multi-head neural networks
    Brion, Douglas A. J.
    Pattinson, Sebastian W.
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [39] Quasi/Periodic Noise Reduction in Images Using Modified Multiresolution-Convolutional Neural Networks for 3D Object Reconstructions and Comparison with Other Convolutional Neural Network Models
    Espinosa-Bernal, Osmar Antonio
    Pedraza-Ortega, Jesus Carlos
    Aceves-Fernandez, Marco Antonio
    Martinez-Suarez, Victor Manuel
    Tovar-Arriaga, Saul
    Ramos-Arreguin, Juan Manuel
    Gorrostieta-Hurtado, Efren
    COMPUTERS, 2024, 13 (06)
  • [40] Bi-Att3DDet: Attention-Based Bi-Directional Fusion for Multi-Modal 3D Object Detection
    Gao, Xu
    Zhao, Yaqian
    Wang, Yanan
    Shang, Jiandong
    Zhang, Chunmin
    Wu, Gang
    SENSORS, 2025, 25 (03)