Deep Association: End-to-end Graph-Based Learning for Multiple Object Tracking with Conv-Graph Neural Network

被引:28
|
作者
Ma, Cong [1 ]
Li, Yuan [1 ]
Yang, Fan [1 ]
Zhang, Ziwei [1 ]
Zhuang, Yueqing [1 ]
Jia, Huizhu [1 ]
Xie, Xiaodong [1 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China
来源
ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2019年
关键词
Surveillance Retrieval; Computer Vision; Multiple Object Tracking; Deep Association; Graph Neural Network Deep Learning;
D O I
10.1145/3323873.3325010
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multiple Object Tracking (MOT) has a wide range of applications in surveillance retrieval and autonomous driving. The majority of existing methods focus on extracting features by deep learning and hand-crafted optimizing bipartite graph or network flow. In this paper, we proposed an efficient end-to-end model, Deep Association Network (DAN), to learn the graph-based training data, which are constructed by spatial-temporal interaction of objects. DAN combines Convolutional Neural Network (CNN), Motion Encoder (ME) and Graph Neural Network (GNN). The CNNs and Motion Encoders extract appearance features from bounding box images and motion features from positions respectively, and then the GNN optimizes graph structure to associate the same object among frames together. In addition, we presented a novel end-to-end training strategy for Deep Association Network. Our experimental results demonstrate the effectiveness of DAN up to the state-of-the-art methods without extra-dataset on MOT16 and DukeMTMCT.
引用
收藏
页码:253 / 261
页数:9
相关论文
共 50 条
  • [31] End-to-End Single Shot Detector Using Graph-Based Learnable Duplicate Removal
    Ding, Shuxiao
    Rehder, Eike
    Schneider, Lukas
    Cordts, Marius
    Gall, Juergen
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 375 - 389
  • [32] End-to-end masked graph-based CRF for joint slot filling and intent detection
    Tang, Hao
    Ji, Donghong
    Zhou, Qiji
    NEUROCOMPUTING, 2020, 413 (413) : 348 - 359
  • [33] Robust Object Tracking With Discrete Graph-Based Multiple Experts
    Li, Jiatong
    Deng, Chenwei
    Da Xu, Richard Yi
    Tao, Dacheng
    Zhao, Baojun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (06) : 2736 - 2750
  • [34] A Survey of Intelligent End-to-End Networking Solutions: Integrating Graph Neural Networks and Deep Reinforcement Learning Approaches
    Tam, Prohim
    Ros, Seyha
    Song, Inseok
    Kang, Seungwoo
    Kim, Seokhoon
    ELECTRONICS, 2024, 13 (05)
  • [35] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
    Wang, Hanyu
    Wang, Ping
    Qian, Xueming
    IEEE ACCESS, 2018, 6 : 30296 - 30308
  • [36] End-To-End Deep Neural Network for Salient Object Detection in Complex Environments
    Wang, Yu
    Wang, Zhiteng
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2023, (202):
  • [37] DeepChess :End-to-End Deep Neural Network for Automatic Learning in Chess
    David, Omid E.
    Netanyahu, Nathan S.
    Wolf, Lior
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 88 - 96
  • [38] End-to-end heterogeneous graph neural networks for traffic assignment
    Liu, Tong
    Meidani, Hadi
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 165
  • [39] A deep learning network based end-to-end image composition
    Zhu, Xiaoyu
    Wang, Haodi
    Zhang, Zhiyi
    Wu, Xiuping
    Guo, Junqi
    Wu, Hao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101
  • [40] End-to-end deep speaker embedding learning using multi-scale attentional fusion and graph neural networks
    Kashani, Hamidreza Baradaran
    Jazmi, Siyavash
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 222