Deep learning for multiple object tracking: a survey

被引:75
作者
Xu, Yingkun [1 ]
Zhou, Xiaolong [1 ]
Chen, Shengyong [1 ,2 ]
Li, Fenfen [3 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
[3] Huaiyin Inst Technol, Comp & Software Engn, Huaian, Peoples R China
基金
中国国家自然科学基金;
关键词
learning (artificial intelligence); object detection; object tracking; neural nets; multiple object tracking; representation learning; network modelling; deep learning theory; multiobject tracking methods; end-to-end deep network construction; deep network structures; tracking conditions; public benchmark test; description enhancement; VISUAL TRACKING; MULTITARGET TRACKING; NETWORK; MODEL;
D O I
10.1049/iet-cvi.2018.5598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been proved effective in multiple object tracking, which confronts the difficulties of frequent occlusions, confusing appearance, in-and-out objects, and lack of enough labelled data. Recently, deep learning based multi-object tracking methods make a rapid progress from representation learning to network modelling due to the development of deep learning theory and benchmark setup. In this study, the authors summarise and analyse deep learning based multi-object tracking methods which are top-ranked in the public benchmark test. First, they investigate functionality of deep networks in these methods, and classify the methods into three categories as description enhancement using deep features, deep network embedding, and end-to-end deep network construction. Second, they review deep network structures in these methods, and detail the usage and training of these networks for multi-object tracking problem. Through experimental comparison of tracking results in the benchmarks in total and by group, they finally show the effectiveness of deep networks for tracking employed in different manners, and compare the advantages of these networks and their robustness under different tracking conditions. Moreover, they analyse the limitations of current methods, and draw some useful conclusions to facilitate the exploration of new directions for multi-object tracking.
引用
收藏
页码:355 / 368
页数:14
相关论文
共 109 条
[1]   People-tracking-by-detection and people-detection-by-tracking [J].
Andriluka, Mykhaylo ;
Roth, Stefan ;
Schiele, Bernt .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1873-1880
[2]  
[Anonymous], P 3 INT C LEARNING R
[3]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[4]  
[Anonymous], 2016, ARXIV PREPRINT ARXIV
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], 2015, ARXIV150401942
[7]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[8]  
[Anonymous], J COMPUTATIONAL VISU
[9]  
[Anonymous], 2018, ARXIV180203252
[10]  
[Anonymous], PROC CVPR IEEE