Transformer Tracking for Satellite Video: Matching, Propagation, and Prediction

被引:0
|
作者
Zhao, Manqi [1 ,2 ]
Li, Shengyang [1 ,3 ]
Yang, Jian [1 ,3 ]
机构
[1] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Key Lab Space Utilizat, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Target tracking; Satellites; Transformers; Training; Object tracking; Predictive models; Pipelines; Adaptation models; Feature extraction; Accuracy; Satellite video object tracking; sequence prediction; static matching; temporal propagation; transformer; OBJECT TRACKING; CORRELATION FILTER;
D O I
10.1109/TGRS.2024.3501380
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, transformer-based trackers have brought overwhelming advantages in general video. However, their performance in satellite video has been hindered by insufficient satellite-specific training and a lack of designs tailored to satellite targets and scene characteristics. To tackle these challenges, we propose a novel transformer-based tracking framework for satellite video object tracking: Transformer Matching, Propagation, and Prediction (TransMPP). TransMPP combines three stages: static matching, dynamic propagation, and prediction, to ensure accurate tracking in satellite videos. Specifically, the Matching model uses a one-stream pipeline for simultaneous feature extraction and relationship modeling across extensive search and template areas, thereby improving foreground and background discrimination capabilities. In addition, the Propagation and Prediction models enhance temporal modeling capabilities through local long-term and short-term feature propagation and global sequence prediction, respectively, boosting tracking robustness. Moreover, to ensure a fair comparison and evaluation, we also developed SatSOT-train, a large-scale training dataset for the SatSOT benchmark. After comprehensive training, TransMPP demonstrates state-of-the-art (SOTA) performance on the SatSOT dataset, achieving an area under the curve (AUC) score of 59.9% and a precision score of 71.5%, bringing improvements of 6.3% and 5.3%, respectively. The code will be available at https://github.com/DonDominic/TransMPP.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Deep Siamese Network With Motion Fitting for Object Tracking in Satellite Videos
    Ruan, Lu
    Guo, Yujia
    Yang, Daiqin
    Chen, Zhenzhong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [32] Object Tracking in Satellite Videos: A Spatial-Temporal Regularized Correlation Filter Tracking Method With Interacting Multiple Model
    Li, Yangfan
    Bian, Chunjiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] TM2B: Transformer-Based Motion-to-Box Network for 3D Single Object Tracking on Point Clouds
    Xu, Anqi
    Nie, Jiahao
    He, Zhiwei
    Lv, Xudong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7078 - 7085
  • [34] Multiple Object Tracking With Appearance Feature Prediction and Similarity Fusion
    Li, Zhiyuan
    Chen, Jing
    Bi, Jieran
    IEEE ACCESS, 2023, 11 : 52492 - 52500
  • [35] Satellite Videos Object Tracking Based on Enhanced Correlation Filter With Motion Prediction Network
    Chen, Puhua
    Wang, Lu
    Guo, Lei
    Liu, Xu
    Zhang, Xiangrong
    Jiao, Licheng
    Liu, Fang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 12123 - 12137
  • [36] Multiscale Feature Learning by Transformer for Building Extraction From Satellite Images
    Chen, Xin
    Qiu, Chunping
    Guo, Wenyue
    Yu, Anzhu
    Tong, Xiaochong
    Schmitt, Michael
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [37] Transformer Sub-Patch Matching for High-Performance Visual Object Tracking
    Tang, Chuanming
    Hu, Qintao
    Zhou, Gaofan
    Yao, Jinzhen
    Zhang, Jianlin
    Huang, Yongmei
    Ye, Qixiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8121 - 8135
  • [38] Domain Adaptive Remaining Useful Life Prediction With Transformer
    Li, Xinyao
    Li, Jingjing
    Zuo, Lin
    Zhu, Lei
    Shen, Heng Tao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [39] A Real-Time Tracking Method for Satellite Video Based on Long-Term Tracking Framework
    Ding, Yufei
    He, Hongyan
    Cao, Shixiang
    Wang, Yu
    IMAGE AND GRAPHICS TECHNOLOGIES AND APPLICATIONS, IGTA 2021, 2021, 1480 : 227 - 237
  • [40] Dual Feature Fusion Tracking With Combined Cross-Correlation and Transformer
    Che, Chao
    Fu, Yanyun
    Shi, Wenxi
    Zhu, Zhansheng
    Wang, Deyong
    IEEE ACCESS, 2023, 11 : 144966 - 144977