Transformer Tracking for Satellite Video: Matching, Propagation, and Prediction

被引:0
|
作者
Zhao, Manqi [1 ,2 ]
Li, Shengyang [1 ,3 ]
Yang, Jian [1 ,3 ]
机构
[1] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Key Lab Space Utilizat, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Target tracking; Satellites; Transformers; Training; Object tracking; Predictive models; Pipelines; Adaptation models; Feature extraction; Accuracy; Satellite video object tracking; sequence prediction; static matching; temporal propagation; transformer; OBJECT TRACKING; CORRELATION FILTER;
D O I
10.1109/TGRS.2024.3501380
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, transformer-based trackers have brought overwhelming advantages in general video. However, their performance in satellite video has been hindered by insufficient satellite-specific training and a lack of designs tailored to satellite targets and scene characteristics. To tackle these challenges, we propose a novel transformer-based tracking framework for satellite video object tracking: Transformer Matching, Propagation, and Prediction (TransMPP). TransMPP combines three stages: static matching, dynamic propagation, and prediction, to ensure accurate tracking in satellite videos. Specifically, the Matching model uses a one-stream pipeline for simultaneous feature extraction and relationship modeling across extensive search and template areas, thereby improving foreground and background discrimination capabilities. In addition, the Propagation and Prediction models enhance temporal modeling capabilities through local long-term and short-term feature propagation and global sequence prediction, respectively, boosting tracking robustness. Moreover, to ensure a fair comparison and evaluation, we also developed SatSOT-train, a large-scale training dataset for the SatSOT benchmark. After comprehensive training, TransMPP demonstrates state-of-the-art (SOTA) performance on the SatSOT dataset, achieving an area under the curve (AUC) score of 59.9% and a precision score of 71.5%, bringing improvements of 6.3% and 5.3%, respectively. The code will be available at https://github.com/DonDominic/TransMPP.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Aircraft Tracking Based on an Antidrift Multifilter Tracker in Satellite Video Data
    Pang, Ran
    Gao, Fang
    Zhang, Peng
    Li, Xiangkun
    Zhai, Yuwei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 4439 - 4456
  • [22] Deep Learning-Based Object Tracking in Satellite Videos: A Comprehensive Survey With a New Dataset
    Li, Yuxuan
    Jiao, Licheng
    Huang, Zhongjian
    Zhang, Xin
    Zhang, Ruohan
    Song, Xue
    Tian, Chenxi
    Zhang, Zixiao
    Liu, Fang
    Shuyuan, Yang
    Hou, Biao
    Ma, Wenping
    Liu, Xu
    Li, Lingling
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (04) : 181 - 212
  • [23] HRSiam: High-Resolution Siamese Network, Towards Space-Borne Satellite Video Tracking
    Shao, Jia
    Du, Bo
    Wu, Chen
    Gong, Mingming
    Liu, Tongliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3056 - 3068
  • [24] Sparse Transformer-Based Sequence Generation for Visual Object Tracking
    Tian, Dan
    Liu, Dong-Xin
    Wang, Xiao
    Hao, Ying
    IEEE ACCESS, 2024, 12 : 154418 - 154425
  • [25] Small Target Tracking in Satellite Videos Using Background Compensation
    Wang, Yunming
    Wang, Taoyang
    Zhang, Guo
    Cheng, Qian
    Wu, Jia-qi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (10): : 7010 - 7021
  • [26] MBLT: Learning Motion and Background for Vehicle Tracking in Satellite Videos
    Zhang, Wenhua
    Jiao, Licheng
    Liu, Fang
    Li, Lingling
    Liu, Xu
    Liu, Jia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [27] AFMtrack: Attention-Based Feature Matching for Multiple Object Tracking
    Cuong Bui, Duy
    Anh Hoang, Hiep
    Yoo, Myungsik
    IEEE ACCESS, 2024, 12 : 82897 - 82910
  • [28] Video tracking using block matching
    Hariharakrishnan, K
    Schonfeld, D
    Raffy, P
    Yassa, F
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 945 - 948
  • [29] Unsupervised Nighttime Object Tracking Based on Transformer and Domain Adaptation Fusion Network
    Wei, Haoran
    Fu, Yanyun
    Wang, Deyong
    Guo, Rui
    Zhao, Xueyi
    Fang, Jian
    IEEE ACCESS, 2024, 12 : 130896 - 130913
  • [30] VTST: Efficient Visual Tracking With a Stereoscopic Transformer
    Gu, Fengwei
    Lu, Jun
    Cai, Chengtao
    Zhu, Qidan
    Ju, Zhaojie
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2401 - 2416