A motion model based on recurrent neural networks for visual object tracking

被引:14
作者
Shahbazi, Mohammad [2 ]
Bayat, Mohammad Hosein [1 ]
Tarvirdizadeh, Bahram [1 ]
机构
[1] Univ Tehran, Fac New Sci & Technol, Dept Mechatron Engn, Adv Serv Robots ASR Lab, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Mech Engn, Tehran, Iran
关键词
Single -object tracking; Motion model; Long short-term memory; Recurrent neural network; SIAMESE;
D O I
10.1016/j.imavis.2022.104533
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object tracking algorithms typically leverage on either or both appearance and motion features of target(s). It is common in multi-object tracking to use both features, whereas the role of motion features in single-object trackers has less been explored. Based on the Long Short-Term Memory (LSTM) architecture of recurrent neural networks, we train a novel motion model to be incorporated into the off-the-shelf single-object trackers. The developed model predicts the target location in each frame based on the history of processed motion features in a few prior frames. This aids the tracking algorithm in dynamically updating the search region location, as apposed to static or probabilistic region settings. We incorporate the model into three state-of-the-art CNN-based trackers, namely GOTURN, SiamFC, and DiMP and illustrate the tracking performance enhancements on popular benchmarks. Significant improvements are achieved specially on the sequences rendering challeng-ing situations such as Low Resolution, Out-of-Plane Rotation, Motion Blur, Fast Motion, and Occlusion. The motion model has a low computational cost and complies with the real-time execution of the base trackers.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 31 条
[1]  
Babaee M, 2018, IEEE IMAGE PROC, P2715, DOI 10.1109/ICIP.2018.8451140
[2]  
Babenko B, 2009, PROC CVPR IEEE, P983, DOI 10.1109/CVPRW.2009.5206737
[3]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[4]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[5]   Know Your Surroundings: Exploiting Scene Information for Object Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
COMPUTER VISION - ECCV 2020, PT XXIII, 2020, 12368 :205-221
[6]   Learning Discriminative Model Prediction for Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190
[7]   Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism [J].
Chu, Qi ;
Ouyang, Wanli ;
Li, Hongsheng ;
Wang, Xiaogang ;
Liu, Bin ;
Yu, Nenghai .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4846-4855
[8]   ATOM: Accurate Tracking by Overlap Maximization [J].
Danelljan, Martin ;
Bhat, Goutam ;
Khan, Fahad Shahbaz ;
Felsberg, Michael .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4655-4664
[9]   Learning to Track at 100 FPS with Deep Regression Networks [J].
Held, David ;
Thrun, Sebastian ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :749-765
[10]   Exploiting the Circulant Structure of Tracking-by-Detection with Kernels [J].
Henriques, Joao F. ;
Caseiro, Rui ;
Martins, Pedro ;
Batista, Jorge .
COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 :702-715