A motion model based on recurrent neural networks for visual object tracking

被引：14

作者：

Shahbazi, Mohammad ^{[2
]}

Bayat, Mohammad Hosein ^{[1
]}

Tarvirdizadeh, Bahram ^{[1
]}

机构：

[1] Univ Tehran, Fac New Sci & Technol, Dept Mechatron Engn, Adv Serv Robots ASR Lab, Tehran, Iran

[2] Iran Univ Sci & Technol, Sch Mech Engn, Tehran, Iran

来源：

IMAGE AND VISION COMPUTING | 2022年 / 126卷

关键词：

Single -object tracking; Motion model; Long short-term memory; Recurrent neural network; SIAMESE;

D O I：

10.1016/j.imavis.2022.104533

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object tracking algorithms typically leverage on either or both appearance and motion features of target(s). It is common in multi-object tracking to use both features, whereas the role of motion features in single-object trackers has less been explored. Based on the Long Short-Term Memory (LSTM) architecture of recurrent neural networks, we train a novel motion model to be incorporated into the off-the-shelf single-object trackers. The developed model predicts the target location in each frame based on the history of processed motion features in a few prior frames. This aids the tracking algorithm in dynamically updating the search region location, as apposed to static or probabilistic region settings. We incorporate the model into three state-of-the-art CNN-based trackers, namely GOTURN, SiamFC, and DiMP and illustrate the tracking performance enhancements on popular benchmarks. Significant improvements are achieved specially on the sequences rendering challeng-ing situations such as Low Resolution, Out-of-Plane Rotation, Motion Blur, Fast Motion, and Occlusion. The motion model has a low computational cost and complies with the real-time execution of the base trackers.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：12

共 31 条

[1]

Babaee M, 2018, IEEE IMAGE PROC, P2715, DOI 10.1109/ICIP.2018.8451140

[2]

Babenko B, 2009, PROC CVPR IEEE, P983, DOI 10.1109/CVPRW.2009.5206737

[3] Fully-Convolutional Siamese Networks for Object Tracking [J].

Bertinetto, Luca ;

Valmadre, Jack ;

Henriques, Joao F. ;

Vedaldi, Andrea ;

Torr, Philip H. S. .

COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865

[4]

Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003

[5] Know Your Surroundings: Exploiting Scene Information for Object Tracking [J].

Bhat, Goutam ;

Danelljan, Martin ;

Van Gool, Luc ;

Timofte, Radu .

COMPUTER VISION - ECCV 2020, PT XXIII, 2020, 12368 :205-221

[6] Learning Discriminative Model Prediction for Tracking [J].

Bhat, Goutam ;

Danelljan, Martin ;

Van Gool, Luc ;

Timofte, Radu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190

[7] Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism [J].

Chu, Qi ;

Ouyang, Wanli ;

Li, Hongsheng ;

Wang, Xiaogang ;

Liu, Bin ;

Yu, Nenghai .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4846-4855

[8] ATOM: Accurate Tracking by Overlap Maximization [J].

Danelljan, Martin ;

Bhat, Goutam ;

Khan, Fahad Shahbaz ;

Felsberg, Michael .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4655-4664

[9] Learning to Track at 100 FPS with Deep Regression Networks [J].

Held, David ;

Thrun, Sebastian ;

Savarese, Silvio .

COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :749-765

[10] Exploiting the Circulant Structure of Tracking-by-Detection with Kernels [J].

Henriques, Joao F. ;

Caseiro, Rui ;

Martins, Pedro ;

Batista, Jorge .

COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 :702-715

← 1 2 3 4 →