Spatial Reliability Enhanced Correlation Filter: An Efficient Approach for Real-Time UAV Tracking

被引:19
作者
Fu, Changhong [1 ]
Jin, Jin [2 ]
Ding, Fangqiang [3 ]
Li, Yiming [4 ]
Lu, Geng [5 ]
机构
[1] Tongji Univ, Sch Mech Engn, Shanghai 201804, Peoples R China
[2] Swiss Fed Inst Technol, Dept Management Technol & Econ, Zurich, Switzerland
[3] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Scotland
[4] NYU, Tandon Sch Engn, New York, NY 11201 USA
[5] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Target tracking; Reliability; Correlation; Real-time systems; Unmanned aerial vehicles; Training; Robustness; UAV tracking; spatial reliability map; response pool; label adjustment; correlation filter; appearance variation; VISUAL TRACKING; OBJECT TRACKING; ROBUST;
D O I
10.1109/TMM.2021.3118891
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional discriminative correlation filter (DCF) has received great popularity due to its high computational efficiency. However, the lightweight framework of DCF cannot promise robust performance when the tracker faces appearance variations within the background. These unpredictable appearance variations always distract the filter. Most existing DCF-based trackers either utilize deep convolutional features or incorporate additional constraints to elevate tracking robustness. Despite some improvements, both of them hamper the tracking speed and can only roughly alleviate the distractions of appearance variations. In this paper, a novel spatial reliability enhanced learning strategy is proposed to handle the problems aforementioned. By monitoring the variation of response produced in detection phase, a dynamic reliability map is generated to indicate the reliability of each background subregion. Then, label adjustment is conducted to repress the distractions of these unreliable areas. Compared with the conventional way of constraint where a new term is always added to realize the desired goal, label adjustment is simultaneously more efficient and effective. Moreover, to promise the accuracy and dependability of the reliability map, an adaptively updated response pool recording reliable historical response values is proposed. Extensive and exhaustive experiments on three challenging unmanned aerial vehicle (UAV) benchmarks, i.e., UAV123@10fps, DTB70 and UAVDT, which totally include 243 video sequences, validate the superiority of the proposed method against other state-of-the-art trackers and exhibit a remarkable generality in a variety of scenarios. Meanwhile, the tracking speed of 65.2FPS on a cheap CPU makes it suitable for real-time UAV applications.
引用
收藏
页码:4123 / 4137
页数:15
相关论文
共 52 条
[1]   2D Pose-Based Real-Time Human Action Recognition With Occlusion-Handling [J].
Angelini, Federico ;
Fu, Zeyu ;
Long, Yang ;
Shao, Ling ;
Naqvi, Syed Mohsen .
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) :1433-1446
[2]   Robust Object Tracking with Online Multiple Instance Learning [J].
Babenko, Boris ;
Yang, Ming-Hsuan ;
Belongie, Serge .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1619-1632
[3]  
Bao CL, 2012, PROC CVPR IEEE, P1830, DOI 10.1109/CVPR.2012.6247881
[4]   Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking [J].
Bao, Qian ;
Liu, Wu ;
Cheng, Yuhao ;
Zhou, Boyan ;
Mei, Tao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :161-175
[5]   Staple: Complementary Learners for Real-Time Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Golodetz, Stuart ;
Miksik, Ondrej ;
Torr, Philip H. S. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1401-1409
[6]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[7]  
Bolme DS, 2010, PROC CVPR IEEE, P2544, DOI 10.1109/CVPR.2010.5539960
[8]   Visual Tracking via Adaptive Spatially-Regularized Correlation Filters [J].
Dai, Kenan ;
Wang, Dong ;
Lu, Huchuan ;
Sun, Chong ;
Li, Jianhua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4665-4674
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]  
Danelljan, 2014, BRIT MACH VIS C NOTT, DOI [10.5244/C.28.65, DOI 10.5244/C.28.65]