Transformer Tracking for Satellite Video: Matching, Propagation, and Prediction

被引：0

作者：

Zhao, Manqi ^{[1
,2
]}

Li, Shengyang ^{[1
,3
]}

Yang, Jian ^{[1
,3
]}

机构：

[1] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Key Lab Space Utilizat, Beijing 100094, Peoples R China

[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China

[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Target tracking; Satellites; Transformers; Training; Object tracking; Predictive models; Pipelines; Adaptation models; Feature extraction; Accuracy; Satellite video object tracking; sequence prediction; static matching; temporal propagation; transformer; OBJECT TRACKING; CORRELATION FILTER;

D O I：

10.1109/TGRS.2024.3501380

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Recently, transformer-based trackers have brought overwhelming advantages in general video. However, their performance in satellite video has been hindered by insufficient satellite-specific training and a lack of designs tailored to satellite targets and scene characteristics. To tackle these challenges, we propose a novel transformer-based tracking framework for satellite video object tracking: Transformer Matching, Propagation, and Prediction (TransMPP). TransMPP combines three stages: static matching, dynamic propagation, and prediction, to ensure accurate tracking in satellite videos. Specifically, the Matching model uses a one-stream pipeline for simultaneous feature extraction and relationship modeling across extensive search and template areas, thereby improving foreground and background discrimination capabilities. In addition, the Propagation and Prediction models enhance temporal modeling capabilities through local long-term and short-term feature propagation and global sequence prediction, respectively, boosting tracking robustness. Moreover, to ensure a fair comparison and evaluation, we also developed SatSOT-train, a large-scale training dataset for the SatSOT benchmark. After comprehensive training, TransMPP demonstrates state-of-the-art (SOTA) performance on the SatSOT dataset, achieving an area under the curve (AUC) score of 59.9% and a precision score of 71.5%, bringing improvements of 6.3% and 5.3%, respectively. The code will be available at https://github.com/DonDominic/TransMPP.

引用

页数：16

共 50 条

[41] Siamese Transformer Network for Real-Time Aerial Object Tracking
Wang, Haijun
Zhang, Shengyan
IEEE ACCESS, 2022, 10 : 105201 - 105213
[42] MP2Net: Mask Propagation and Motion Prediction Network for Multiobject Tracking in Satellite Videos
Zhao, Manqi
Li, Shengyang
Wang, Han
Yang, Jian
Sun, Yuhan
Gu, Yanfeng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
[43] Video Tracking Based on Template Matching and Particle Filter
Lin, Shinfeng D.
Chen, Ting-Yi
2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
[44] TFTN: A Transformer-Based Fusion Tracking Framework of Hyperspectral and RGB
Zhao, Chunhui
Liu, Hongjiao
Su, Nan
Yan, Yiming
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[45] Revisiting Monocular Satellite Pose Estimation With Transformer
Wang, Zi
Zhang, Zhuo
Sun, Xiaoliang
Li, Zhang
Yu, Qifeng
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (05) : 4279 - 4294
[46] Satellite Video Tracking by Multi-Feature Correlation Filters with Motion Estimation
Zhang, Yan
Chen, Deng
Zheng, Yuhui
REMOTE SENSING, 2022, 14 (11)
[47] A Spatiotemporal Fusion Transformer Model for Chlorophyll-a Concentrations Prediction Over Large Areas With Satellite Time Series Data
Zhou, Gaoxiang
Liu, Ming
Li, Liangzhi
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[48] Relation Learning Reasoning Meets Tiny Object Tracking in Satellite Videos
Yang, Xiaoyan
Jiao, Licheng
Li, Yangyang
Liu, Xu
Liu, Fang
Li, Lingling
Chen, Puhua
Yang, Shuyuan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
[49] CTsynther: Contrastive Transformer Model for End-to-End Retrosynthesis Prediction
Lu, Hao
Wei, Zhiqiang
Zhang, Kun
Wang, Xuze
Ali, Liaqat
Liu, Hao
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 2235 - 2245
[50] MR-Transformer: Multiresolution Transformer for Multivariate Time Series Prediction
Zhu, Siying
Zheng, Jiawei
Ma, Qianli
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 13

← 1 2 3 4 5 →