TrackingMamba: Visual State Space Model for Object Tracking

被引:2
|
作者
Wang, Qingwang [1 ,2 ]
Zhou, Liyao [1 ,2 ]
Jin, Pengcheng [1 ,2 ]
Xin, Qu [1 ,2 ]
Zhong, Hangwei [1 ,2 ]
Song, Haochen [1 ,2 ]
Shen, Tao [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China
[2] Kunming Univ Sci & Technol, Yunnan Key Lab Comp Technol Applicat, Kunming 650500, Peoples R China
基金
中国国家自然科学基金;
关键词
Object tracking; Autonomous aerial vehicles; Transformers; Feature extraction; Computational modeling; Accuracy; Visualization; Jungle scenes; Mamba; object tracking; UAV remote sensing;
D O I
10.1109/JSTARS.2024.3458938
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, UAV object tracking has provided technical support across various fields. Most existing work relies on convolutional neural networks (CNNs) or visual transformers. However, CNNs have limited receptive fields, resulting in suboptimal performance, while transformers require substantial computational resources, making training and inference challenging. Mountainous and jungle environments-critical components of the Earth's surface and key scenarios for UAV object tracking-present unique challenges due to steep terrain, dense vegetation, and rapidly changing weather conditions, which complicate UAV tracking. The lack of relevant datasets further reduces tracking accuracy. This article introduces a new tracking framework based on a state-space model called TrackingMamba, which uses a single-stream tracking architecture with Vision Mamba as its backbone. TrackingMamba not only matches transformer-based trackers in global feature extraction and long-range dependence modeling but also maintains computational efficiency with linear growth. Compared to other advanced trackers, TrackingMamba delivers higher accuracy with a simpler model framework, fewer parameters, and reduced FLOPs. Specifically, on the UAV123 benchmark, TrackingMamba outperforms the baseline model OSTtrack-256, improving AUC by 2.59% and Precision by 4.42%, while reducing parameters by 95.52% and FLOPs by 95.02%. The article also evaluates the performance and shortcomings of TrackingMamba and other advanced trackers in the complex and critical context of jungle environments, and it explores potential future research directions in UAV jungle object tracking.
引用
收藏
页码:16744 / 16754
页数:11
相关论文
共 50 条
  • [21] Contour-Enhanced Visual State-Space Model for Remote Sensing Image Classification
    Yan, Liyue
    Zhang, Xing
    Wang, Kafeng
    Zhang, Dejin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [22] Complementary Discriminative Correlation Filters Based on Collaborative Representation for Visual Object Tracking
    Zhu, Xue-Feng
    Wu, Xiao-Jun
    Xu, Tianyang
    Feng, Zhen-Hua
    Kittler, Josef
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 557 - 568
  • [23] A Visual-Inertial Dynamic Object Tracking SLAM Tightly Coupled System
    Zhang, Hanxuan
    Wang, Dingyi
    Huo, Ju
    IEEE SENSORS JOURNAL, 2023, 23 (17) : 19905 - 19917
  • [24] Visual Object Tracking via Joint Learning Method
    Tian, Wei
    Lv, Jingyuan
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 1163 - 1167
  • [25] Exploring the Effects of Blur and Deblurring to Visual Object Tracking
    Guo, Qing
    Feng, Wei
    Gao, Ruijun
    Liu, Yang
    Wang, Song
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1812 - 1824
  • [26] Visual Object Tracking by Hierarchical Attention Siamese Network
    Shen, Jianbing
    Tang, Xin
    Dong, Xingping
    Shao, Ling
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3068 - 3080
  • [27] RS3Mamba: Visual State Space Model for Remote Sensing Image Semantic Segmentation
    Ma, Xianping
    Zhang, Xiaokang
    Pun, Man-On
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [28] Filtering in a unit quaternion space for model-based object tracking
    Ude, A
    ROBOTICS AND AUTONOMOUS SYSTEMS, 1999, 28 (2-3) : 163 - 172
  • [29] AN OBJECT TRACKING METHOD USING PARTICLE FILTER AND SCALE SPACE MODEL
    Heo, PyeongGang
    Park, Su-Jin
    Jin, Sang-Hun
    Yeou, Bo Yeoun
    Park, HyunWook
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4081 - +
  • [30] Object Tracking Based on Visual Attention
    Lin, Mingqiang
    Dai, Houde
    2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 1846 - 1849