End-to-end wavelet block feature purification network for efficient and effective UAV object tracking

被引:1
作者
Wang, Haijun [1 ]
Qi, Lihua [1 ]
Qu, Haoyu
Ma, Wenlai [1 ,2 ]
Yuan, Wei [1 ]
Hao, Wei [1 ]
机构
[1] Binzhou Univ, Aviat Informat Technol Res & Dev, Binzhou, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Civil Aviat, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Wavelet; Transformer; Unmanned aerial vehicle; Self-attention learning; Downsampling-upsampling strategy; CORRELATION FILTER;
D O I
10.1016/j.jvcir.2023.103950
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, unmanned aerial vehicle (UAV) object tracking tasks have significantly improved with the emergence of deep learning. However, owing to the object feature pollution caused by motion blur, illumination variation, and occlusion, most of the existing trackers often fail to precisely localize the target in the complex real -world circumstances. To overcome this challenge, we present a novel wavelet block feature purification network (WFPN) for efficient and effective UAV tracking. WFPN is mainly composed of downsampling network through wavelet transforms and upsampling network through inverse wavelet transforms. To be specific, the downsampling network performs discrete wavelet transform (DWT) to reduce interference information and preserve original feature details, while the upsampling network applies inverse DWT (IDWT) to reconstruct decontaminated feature information. Additionally, a novel sequential encoder is introduced to achieve a better purification effect. Finally, a pooling distance loss is devised to improve the purification effect of DWT downsampling network. Extensive experiments show that our WFPN achieves promising tracking performance on three well-known UAV benchmarks, especially on sequences with feature pollution. Moreover, our method runs at 33.2 frames per second on the edge platform of Nvidia Jetson AGX Orin, which is suitable for UAVs with limited onboard payload and computing capability.
引用
收藏
页数:14
相关论文
共 71 条
[31]  
Li SY, 2017, AAAI CONF ARTIF INTE, P4140
[32]   Target-Aware Deep Tracking [J].
Li, Xin ;
Ma, Chao ;
Wu, Baoyuan ;
He, Zhenyu ;
Yang, Ming-Hsuan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1369-1378
[33]   Augmented Memory for Correlation Filters in Real-Time UAV Tracking [J].
Li, Yiming ;
Fu, Changhong ;
Ding, Fangqiang ;
Huang, Ziyuan ;
Pan, Jia .
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, :1559-1566
[34]   AutoTrack: Towards High-Performance Visual Tracking for UAV with Automatic Spatio-Temporal Regularization [J].
Li, Yiming ;
Fu, Changhong ;
Ding, Fangqiang ;
Huang, Ziyuan ;
Lu, Geng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11920-11929
[35]  
Lin FL, 2020, IEEE INT CONF ROBOT, P2365, DOI [10.1109/icra40945.2020.9196530, 10.1109/ICRA40945.2020.9196530]
[36]   ReCF: Exploiting Response Reasoning for Correlation Filters in Real-Time UAV Tracking [J].
Lin, Fuling ;
Fu, Changhong ;
He, Yujie ;
Xiong, Weijiang ;
Li, Fan .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :10469-10480
[37]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[38]   Slimmable Dataset Condensation [J].
Liu, Songhua ;
Ye, Jingwen ;
Yu, Runpeng ;
Wang, Xinchao .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :3759-3768
[39]   Invertible Denoising Network: A Light Solution for Real Noise Removal [J].
Liu, Yang ;
Qin, Zhenyue ;
Anwar, Saeed ;
Ji, Pan ;
Kim, Dongwoo ;
Caldwell, Sabrina ;
Gedeon, Tom .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13360-13369
[40]   A Benchmark and Simulator for UAV Tracking [J].
Mueller, Matthias ;
Smith, Neil ;
Ghanem, Bernard .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :445-461