Variable scale learning for visual object tracking

被引：0

作者：

Xuedong He

Lu Zhao

Calvin Yu-Chian Chen

机构：

[1] Sun Yat-Sen University,Artificial Intelligence Medical Center, School of Intelligent Systems Engineering

[2] The Sixth Affiliated Hospital,Department of Clinical Laboratory

[3] Sun Yat-Sen University,Department of Medical Research

[4] China Medical University Hospital,Department of Bioinformatics and Medical Engineering

[5] Asia University,undefined

来源：

Journal of Ambient Intelligence and Humanized Computing | 2023年 / 14卷

关键词：

Object tracking; Deep learning; Correlation filter; Scale estimation; Variable scale learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recently, deep learning achieves competitive accuracy and robustness and dramatically improves the performance of target scale estimation through pre-trained special network branches. Yet, a fast and robust scale estimation method is still a challenging problem for visual object tracking. Early correlation filter tracking algorithm uses a multiscale search method to estimate the scale with the constant number of scale factors and invariant aspect ratio, which is redundant for the video frames with little or no scale change. Also, an independent network branch for target scale state is proposed, but the training network needs an abundance of datasets, and the effect is not very stable for the unseen target object. Aiming at the problems of existing scale estimation solutions, several variable scale learning methods are proposed to explore the scale change of the target. Firstly, we proposed a variable scale factor learning method, which makes us rid of the commonly used multiscale search with the flaws of fixed scale factors. Secondly, we used a multiscale aspect ratio solution to make up for invariant aspect ratio. Thirdly, the first and second scale methods were combined to propose a variable scale aspect ratio estimation method. Finally, the proposed scale estimation methods were embedded into the state-of-the-art ECO (Efficient Convolution Operators) and ATOM (Accurate Tracking by Overlap Maximization) trackers to replace the original scale methods for verifying the effectiveness of our proposed method. Extensive experiments on OTB100, UAV123, TC128 and LaSOT datasets demonstrate that the tracking performance can be improved effectively by using the proposed scale methods.

引用

页码：3315 / 3330

页数：15

共 71 条

[1]

Danelljan M(2017)Discriminative scale space tracking IEEE Trans Pattern Anal Mach Intell 39 1561-1575

[2]

Häger G(2015)High-speed tracking with kernelized correlation filters IEEE Trans Pattern Anal Mach Intell 37 583-596

[3]

Khan FS(2020)Multiple faces tracking using feature fusion and neural network in video Intell Autom Soft Comput 26 1549-1560

[4]

Felsberg M(2020)Robust visual tracking models designs through kernelized correlation filters Intell Autom Soft Comput 26 313-322

[5]

Henriques JF(2015)Encoding color information for visual tracking: Algorithms and benchmark IEEE Trans Image Process 24 5630-5644

[6]

Caseiro R(2019)Implementation system of human eye tracking algorithm based on fpga CMC-Comput Mat Contin 58 653-664

[7]

Martins P(2020)Fast: Fast and accurate scale estimation for tracking IEEE Signal Process Lett 27 161-165

[8]

Batista J(2021)Deep learning for visual tracking: A comprehensive survey IEEE Trans Intell Transp Syst 39 1137-1149

[9]

Hu B(2016)Faster r-cnn: Towards real-time object detection with region proposal networks IEEE Trans Pattern Anal Mach Intell 58 625-639

[10]

Zhao H(2019)An automated player detection and tracking in basketball game CMC-Comput Mat Contin 37 1834-1848

← 1 2 3 4 5 6 7 8 →