End-to-end multitask Siamese network with residual hierarchical attention for real-time object tracking

被引:0
作者
Wenhui Huang
Jason Gu
Xin Ma
Yibin Li
机构
[1] School of Information Science and Engineering at Shandong Normal University,
[2] Department of Electrical and Computer Engineering at Dalhousie University,undefined
[3] School of Control Science and Engineering at Shandong University,undefined
来源
Applied Intelligence | 2020年 / 50卷
关键词
Real-time object tracking; Deep networks; Attention mechanism; Correlation filters;
D O I
暂无
中图分类号
学科分类号
摘要
Object tracking with deep networks has recently achieved substantial improvement in terms of tracking performance. In this paper, we propose a multitask Siamese neural network that uses a residual hierarchical attention mechanism to achieve high-performance object tracking. This network is trained offline in an end-to-end manner, and it is capable of real-time tracking. To produce more efficient and generative attention-aware features, we propose residual hierarchical attention learning using residual skip connections in the attention module to receive hierarchical attention. Moreover, we formulate a multitask correlation filter layer to exploit the missing link between context awareness and regression target adaptation, and we insert this differentiable layer into a neural network to improve the discriminatory capability of the network. The results of experimental analyses conducted on the OTB, VOT and TColor-128 datasets, which contain various tracking scenarios, demonstrate the efficiency of our proposed real-time object-tracking network.
引用
收藏
页码:1908 / 1921
页数:13
相关论文
共 53 条
[1]  
Jiang H(2019)Effective use of convolutional neural networks and diverse deep supervision for better crowd counting Appl Intell 49 2415-2433
[2]  
Jin W(2019)Learning multi-temporal-scale deep information for action recognition Appl Intell 49 2017-2029
[3]  
Yao G(2017)Online learning of dynamic multi-view gallery for person re-identification Multimed Tools Appl 76 217-241
[4]  
Lei T(2018)Unsupervised multi-view feature extraction with dynamic graph learning J Vis Commun Image Represent 56 256-264
[5]  
Zhong J(2018)Classifying advertising video by topicalizing high-level semantic concepts Multimed Tools Appl 77 25475-25511
[6]  
Jiang P(2018)Correlation-filter based scale-adaptive visual tracking with hybrid-scheme sample learning IEEE Access 6 125-137
[7]  
Zhao Y(2018)Cross-media retrieval with collective deep semantic learning Multimed Tools Appl 77 22247-22266
[8]  
Xu Z(2017)Choroid segmentation from optical coherence tomography with graph edge weights learned from deep convolutional neural networks Neurocomputing 237 332-341
[9]  
Xiang Z(2016)Compressive sensing with weighted local classifiers for robot visual tracking Int J Robot Autom 31 416-427
[10]  
Liu Y(2015)High-speed tracking with kernelized correlation filters IEEE Trans Pattern Anal Mach Intell 37 583-596