Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking

被引:493
作者
Wang, Qiang
Teng, Zhu
Xing, Junliang
Gao, Jin
Hu, Weiming
Maybank, Stephen
机构
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00510
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline training for object tracking has recently shown great potentials in balancing tracking accuracy and speed. However, it is still difficult to adapt an offline trained model to a target tracked online. This work presents a Residual Attentional Siamese Network (RASNet) for high performance object tracking. The RASNet model reformulates the correlation filter within a Siamese tracking framework, and introduces different kinds of the attention mechanisms to adapt the model without updating the model online. In particular, by exploiting the offline trained general attention, the target adapted residual attention, and the channel favored feature attention, the RASNet not only mitigates the over-fitting problem in deep network training, but also enhances its discriminative capacity and adaptability due to the separation of representation learning and discriminator learning. The proposed deep architecture is trained from end to end and takes full advantage of the rich spatial temporal information to achieve robust visual tracking. Experimental results on two latest benchmarks, OTB-2015 and VOT2017, show that the RASNet tracker has the state-of-the-art tracking accuracy while runs at more than 80 frames per second.
引用
收藏
页码:4854 / 4863
页数:10
相关论文
共 55 条
[31]   The Visual Object Tracking VOT2015 challenge results [J].
Kristan, Matej ;
Matas, Jiri ;
Leonardis, Ales ;
Felsberg, Michael ;
Cehovin, Luka ;
Fernandez, Gustavo ;
Vojir, Tomas ;
Hager, Gustav ;
Nebehay, Georg ;
Pflugfelder, Roman ;
Gupta, Abhinav ;
Bibi, Adel ;
Lukezic, Alan ;
Garcia-Martins, Alvaro ;
Saffari, Amir ;
Petrosino, Alfredo ;
Montero, Andres Solis ;
Varfolomieiev, Anton ;
Baskurt, Atilla ;
Zhao, Baojun ;
Ghanem, Bernard ;
Martinez, Brais ;
Lee, ByeongJu ;
Han, Bohyung ;
Wang, Chaohui ;
Garcia, Christophe ;
Zhang, Chunyuan ;
Schmid, Cordelia ;
Tao, Dacheng ;
Kim, Daijin ;
Huang, Dafei ;
Prokhorov, Danil ;
Du, Dawei ;
Yeung, Dit-Yan ;
Ribeiro, Eraldo ;
Khan, Fahad Shahbaz ;
Porikli, Fatih ;
Bunyak, Filiz ;
Zhu, Gao ;
Seetharaman, Guna ;
Kieritz, Hilke ;
Yau, Hing Tuen ;
Li, Hongdong ;
Qi, Honggang ;
Bischof, Horst ;
Possegger, Horst ;
Lee, Hyemin ;
Nam, Hyeonseob ;
Bogun, Ivan ;
Jeong, Jae-chan .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :564-586
[32]   The Visual Object Tracking VOT2014 Challenge Results [J].
Kristan, Matej ;
Pflugfelder, Roman ;
Leonardis, Ales ;
Matas, Jiri ;
Cehovin, Luka ;
Nebehay, Georg ;
Vojir, Tomas ;
Fernandez, Gustavo ;
Lukezic, Alan ;
Dimitriev, Aleksandar ;
Petrosino, Alfredo ;
Saffari, Amir ;
Li, Bo ;
Han, Bohyung ;
Heng, CherKeng ;
Garcia, Christophe ;
Pangersic, Dominik ;
Haeger, Gustav ;
Khan, Fahad Shahbaz ;
Oven, Franci ;
Possegger, Horst ;
Bischof, Horst ;
Nam, Hyeonseob ;
Zhu, Jianke ;
Li, JiJia ;
Choi, Jin Young ;
Choi, Jin-Woo ;
Henriques, Joao F. ;
van de Weijer, Joost ;
Batista, Jorge ;
Lebeda, Karel ;
Oefjaell, Kristoffer ;
Yi, Kwang Moo ;
Qin, Lei ;
Wen, Longyin ;
Maresca, Mario Edoardo ;
Danelljan, Martin ;
Felsberg, Michael ;
Cheng, Ming-Ming ;
Torr, Philip ;
Huang, Qingming ;
Bowden, Richard ;
Hare, Sam ;
Lim, Samantha YueYing ;
Hong, Seunghoon ;
Liao, Shengcai ;
Hadfield, Simon ;
Li, Stan Z. ;
Duffner, Stefan ;
Golodetz, Stuart .
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT II, 2015, 8926 :191-217
[33]   DeepTrack: Learning Discriminative Feature Representations Online for Robust Visual Tracking [J].
Li, Hanxi ;
Li, Yi ;
Porikli, Fatih .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (04) :1834-1848
[34]   A Survey of Appearance Models in Visual Object Tracking [J].
Li, Xi ;
Hu, Weiming ;
Shen, Chunhua ;
Zhang, Zhongfei ;
Dick, Anthony ;
Van den Hengel, Anton .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (04)
[35]  
Liu LW, 2012, INT C PATT RECOG, P565
[36]  
Liu LW, 2011, 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), P608, DOI 10.1109/ACPR.2011.6166688
[37]   Discriminative Correlation Filter with Channel and Spatial Reliability [J].
Lukezic, Alan ;
Vojir, Tomas ;
Zajc, Luka Cehovin ;
Matas, Jiri ;
Kristan, Matej .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4847-4856
[38]   Hierarchical Convolutional Features for Visual Tracking [J].
Ma, Chao ;
Huang, Jia-Bin ;
Yang, Xiaokang ;
Yang, Ming-Hsuan .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3074-3082
[39]   Optimizing Distributed Actor Systems for Dynamic Interactive Services [J].
Newell, Andrew ;
Kliot, Gabriel ;
Menache, Ishai ;
Gopalan, Aditya ;
Akiyama, Soramichi ;
Silberstein, Mark .
PROCEEDINGS OF THE ELEVENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, (EUROSYS 2016), 2016,
[40]  
OLSHAUSEN BA, 1993, J NEUROSCI, V13, P4700