Deep Association: End-to-end Graph-Based Learning for Multiple Object Tracking with Conv-Graph Neural Network

被引：28

作者：

Ma, Cong ^{[1
]}

Li, Yuan ^{[1
]}

Yang, Fan ^{[1
]}

Zhang, Ziwei ^{[1
]}

Zhuang, Yueqing ^{[1
]}

Jia, Huizhu ^{[1
]}

Xie, Xiaodong ^{[1
]}

机构：

[1] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China

来源：

ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2019年

关键词：

Surveillance Retrieval; Computer Vision; Multiple Object Tracking; Deep Association; Graph Neural Network Deep Learning;

D O I：

10.1145/3323873.3325010

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Multiple Object Tracking (MOT) has a wide range of applications in surveillance retrieval and autonomous driving. The majority of existing methods focus on extracting features by deep learning and hand-crafted optimizing bipartite graph or network flow. In this paper, we proposed an efficient end-to-end model, Deep Association Network (DAN), to learn the graph-based training data, which are constructed by spatial-temporal interaction of objects. DAN combines Convolutional Neural Network (CNN), Motion Encoder (ME) and Graph Neural Network (GNN). The CNNs and Motion Encoders extract appearance features from bounding box images and motion features from positions respectively, and then the GNN optimizes graph structure to associate the same object among frames together. In addition, we presented a novel end-to-end training strategy for Deep Association Network. Our experimental results demonstrate the effectiveness of DAN up to the state-of-the-art methods without extra-dataset on MOT16 and DukeMTMCT.

引用

页码：253 / 261

页数：9

共 50 条

[31] End-to-End Single Shot Detector Using Graph-Based Learnable Duplicate Removal
Ding, Shuxiao
Rehder, Eike
Schneider, Lukas
Cordts, Marius
Gall, Juergen
PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 375 - 389
[32] End-to-end masked graph-based CRF for joint slot filling and intent detection
Tang, Hao
Ji, Donghong
Zhou, Qiji
NEUROCOMPUTING, 2020, 413 (413) : 348 - 359
[33] Robust Object Tracking With Discrete Graph-Based Multiple Experts
Li, Jiatong
Deng, Chenwei
Da Xu, Richard Yi
Tao, Dacheng
Zhao, Baojun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (06) : 2736 - 2750
[34] A Survey of Intelligent End-to-End Networking Solutions: Integrating Graph Neural Networks and Deep Reinforcement Learning Approaches
Tam, Prohim
Ros, Seyha
Song, Inseok
Kang, Seungwoo
Kim, Seokhoon
ELECTRONICS, 2024, 13 (05)
[35] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
Wang, Hanyu
Wang, Ping
Qian, Xueming
IEEE ACCESS, 2018, 6 : 30296 - 30308
[36] End-To-End Deep Neural Network for Salient Object Detection in Complex Environments
Wang, Yu
Wang, Zhiteng
JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2023, (202):
[37] DeepChess :End-to-End Deep Neural Network for Automatic Learning in Chess
David, Omid E.
Netanyahu, Nathan S.
Wolf, Lior
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 88 - 96
[38] End-to-end heterogeneous graph neural networks for traffic assignment
Liu, Tong
Meidani, Hadi
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 165
[39] A deep learning network based end-to-end image composition
Zhu, Xiaoyu
Wang, Haodi
Zhang, Zhiyi
Wu, Xiuping
Guo, Junqi
Wu, Hao
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101
[40] End-to-end deep speaker embedding learning using multi-scale attentional fusion and graph neural networks
Kashani, Hamidreza Baradaran
Jazmi, Siyavash
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 222

← 1 2 3 4 5 →