Online object tracking via motion-guided convolutional neural network (MGNet)

被引:11
作者
Gan, Weihao [1 ]
Lee, Ming-Sui [2 ]
Wu, Chi-hao [1 ]
Kuo, C. -C. [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Natl Taiwan Univ, Taipei, Taiwan
关键词
Object tracking; Online tracking; Convolutional neural network; Optical flow; Multi-domain learning; ROBUST VISUAL TRACKING;
D O I
10.1016/j.jvcir.2018.03.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tracking-by-detection (TBD) is widely used in visual object tracking. However, many TBD-based methods ignore the strong motion correlation between current and previous frames. In this work, a motion-guided convolutional neural network (MGNet) solution to online object tracking is proposed. The MGNet tracker is built upon the multi-domain convolutional neural network with two innovations: (1) a motion-guided candidate selection (MCS) scheme based on a dynamic prediction model is proposed to accurately and efficiently generate the candidate regions and (2) the spatial RGB and temporal optical flow are combined as inputs and processed in an unified end-to-end trained network, rather than a two-branch processing network. We compare the performance of the MGNet, the MDNet and several state-of-the-art online object trackers on the OTB and the VOT benchmark datasets, and demonstrate that the temporal correlation between any two consecutive frames in videos can be more effectively captured by the MGNet via extensive performance evaluation.
引用
收藏
页码:180 / 191
页数:12
相关论文
共 59 条
[1]   Deformable part-based tracking by coupled global and local correlation filters [J].
Akin, Osman ;
Erdem, Erkut ;
Erdem, Aykut ;
Mikolajczyk, Krystian .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 38 :763-774
[2]  
[Anonymous], 2016, ARXIV PREPRINT ARXIV
[3]  
[Anonymous], 2016, CVPR CVPR
[4]  
[Anonymous], 2017, ARXIV170406036
[5]  
[Anonymous], 2014, P BRIT MACH VIS C
[6]   Ensemble tracking [J].
Avidan, Shai .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (02) :261-271
[7]  
Babenko B, 2009, PROC CVPR IEEE, P983, DOI 10.1109/CVPRW.2009.5206737
[8]  
Blaschko MB, 2008, LECT NOTES COMPUT SC, V5302, P2, DOI 10.1007/978-3-540-88682-2_2
[9]  
Danelljan M., 2016, ARXIV161109224
[10]   Learning Spatially Regularized Correlation Filters for Visual Tracking [J].
Danelljan, Martin ;
Hager, Gustav ;
Khan, Fahad Shahbaz ;
Felsberg, Michael .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4310-4318