IMPROVED REAL-TIME VISUAL TRACKING VIA ADVERSARIAL LEARNING

被引:0
作者
Zhong, Haoxiang [1 ]
Yan, Xiyu [1 ]
Jiang, Yong [1 ,2 ]
Xia, Shu-Tao [1 ,2 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] Peng Cheng Lab, PCL Res Ctr Networks & Commun, Shenzhen, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
基金
中国国家自然科学基金;
关键词
Visual tracking; Deep learning; Adversarial learning; Real-time tracking;
D O I
10.1109/icassp40776.2020.9053709
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The high accuracy and fast running speed are two goals of tracking algorithms. In recent years, many trackers based on deep learning have improved in both aspects respectively, but it is difficult to balance these two indicators. For example, a real-time tracking algorithm named RT-MDNet has greatly increased speed on the MDNet, but the accuracy is still limited to some extent. On the contrary, a state-of-the-art visual tracker based on adversarial learning named VITAL achieves a significant improvement in performance by alleviating the imbalance between positive and negative samples of tracking data. However, its running speed is seriously limited. In this paper, we attempt to combine the advantages from both methods and propose an improved real-time visual tracking algorithm via adversarial learning to get a more balanced result in accuracy and tracking speed. Specifically, we base on the framework of RT-MDNet and introduce a random feature map masking with adversarial learning to improve the quality of feature maps. Experiments on OTB2015 show our algorithm runs at 17 FPS with a precision of 87.6%.
引用
收藏
页码:1853 / 1857
页数:5
相关论文
共 20 条
[1]  
[Anonymous], 2017, P INT C LEARN REPR
[2]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[3]  
Brock Andrew, 2018, P 7 ACM INT S PERV D, DOI DOI 10.1145/3205873.3205877
[4]   StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].
Choi, Yunjey ;
Choi, Minje ;
Kim, Munyoung ;
Ha, Jung-Woo ;
Kim, Sunghun ;
Choo, Jaegul .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797
[5]   SANet: Structure-Aware Network for Visual Tracking [J].
Fan, Heng ;
Ling, Haibin .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :2217-2224
[6]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[7]   High-Speed Tracking with Kernelized Correlation Filters [J].
Henriques, Joao F. ;
Caseiro, Rui ;
Martins, Pedro ;
Batista, Jorge .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) :583-596
[8]  
Jung Ilchae, 2018, EUR C COMP VIS ECCV
[9]   A Novel Performance Evaluation Methodology for Single-Target Trackers [J].
Kristan, Matej ;
Matas, Jiri ;
Leonardis, Ales ;
Vojir, Tomas ;
Pflugfelder, Roman ;
Fernandez, Gustavo ;
Nebehay, Georg ;
Porikli, Fatih ;
Cehovin, Luka .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (11) :2137-2155
[10]  
Mirza, 2014, ARXIV