Visual Tracking with Attentional Convolutional Siamese Networks

被引:0
作者
Tan, Ke [1 ,2 ]
Wei, Zhenzhong [1 ,2 ]
机构
[1] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing, Peoples R China
[2] Minist Educ, Key Lab Precis Optomechatron Technol, Beijing, Peoples R China
来源
IMAGE AND GRAPHICS, ICIG 2019, PT I | 2019年 / 11901卷
关键词
Visual tracking; Siamese networks; Visual attentions; OBJECT TRACKING;
D O I
10.1007/978-3-030-34120-6_30
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently Siamese trackers have drawn great attention due to their considerable accuracy and speed. To further improve the discriminability of Siamese networks for visual tracking, some deeper networks, such as VGG and ResNet, are exploited as backbone. However, high-level semantic information reduces the location discrimination. In this paper, we propose a novel Attentional Convolutional Siamese Networks for visual tracking (ACST), to improve the classical AlexNet by fusing spatial and channel attentions during feature learning. Moreover, a response-based weighted sampling strategy during training is proposed to strengthen the discrimination power to distinguish two objects with the similar attributes. With the efficiency of cross-correlation operator, our tracker can be trained end-to-end while running in real-time at inference phase. We validate our tracker through extensive experiments on OTB2013 and OTB2015, and results show that the proposed tracker obtains great improvements over the other Siamese trackers.
引用
收藏
页码:369 / 380
页数:12
相关论文
共 36 条
[1]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[2]   Learning Discriminative Model Prediction for Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190
[3]   Real-Time 'Actor-Critic' Tracking [J].
Chen, Boyu ;
Wang, Dong ;
Li, Peixia ;
Wang, Shuang ;
Lu, Huchuan .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :328-345
[4]   Visual Tracking Using Attention-Modulated Disintegration and Integration [J].
Choi, Jongwon ;
Chang, Hyung Jin ;
Jeong, Jiyeoup ;
Demiris, Yiannis ;
Choi, Jin Young .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4321-4330
[5]   ECO: Efficient Convolution Operators for Tracking [J].
Danelljan, Martin ;
Bhat, Goutam ;
Khan, Fahad Shahbaz ;
Felsberg, Michael .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6931-6939
[6]   Triplet Loss in Siamese Network for Object Tracking [J].
Dong, Xingping ;
Shen, Jianbing .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :472-488
[7]   SANet: Structure-Aware Network for Visual Tracking [J].
Fan, Heng ;
Ling, Haibin .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :2217-2224
[8]   Learning Dynamic Siamese Network for Visual Object Tracking [J].
Guo, Qing ;
Feng, Wei ;
Zhou, Ce ;
Huang, Rui ;
Wan, Liang ;
Wang, Song .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1781-1789
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Learning to Track at 100 FPS with Deep Regression Networks [J].
Held, David ;
Thrun, Sebastian ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :749-765