FFTransMOT: Feature-Fused Transformer for Enhanced Multi-Object Tracking

被引:2
|
作者
Hu, Xufeng [1 ]
Jeon, Younghoon [2 ]
Gwak, Jeonghwan [1 ,2 ,3 ,4 ]
机构
[1] Korea Natl Univ Transportat, Dept IT Energy Convergence, Chungju 27469, South Korea
[2] Korea Natl Univ Transportat, Dept Software, Chungju 27469, South Korea
[3] Korea Natl Univ Transportat, Dept Biomed Engn, Chungju 27469, South Korea
[4] Korea Natl Univ Transportat, Dept AI Robot Engn, Chungju 27469, South Korea
关键词
Feature extraction; Transformers; Trajectory; Videos; Tracking; Decoding; Data models; Computer vision; Object tracking; feature fusion; multi-object tracking; object identification; OBJECT TRACKING;
D O I
10.1109/ACCESS.2023.3327262
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of computer vision, multi-object tracking (MOT) is a crucial task. It involves the identification, tracking, and classification of multiple objects in videos, connecting their trajectories to form a complete motion sequence. MOT comprises two core components: object detection and data association. This entails detecting objects in each frame, determining the objects to be tracked, performing data association with the next frame, and predicting the future trajectories of the objects. In this paper, we propose a model named Feature-Fused Transformer for Enhanced Multi-object Tracking (FFTransMOT). In the FFTransMOT framework, a feature fusion module is integral to synthesizing a robust representation of object features by combining information from the current and previous frames. This fusion process strengthens the feature set, enhancing its reliability for the decoder's subsequent data association tasks. The decoder leverages these improved features to accurately match objects across frames, significantly enhancing the model's tracking capabilities over time. Subsequently, the decoder conducts data association matching between $\text{frame}_{t}$ and the newly fused features. Additionally, we employ a self-attention mechanism to capture dependencies between input features, thereby enhancing the accuracy and stability of object detection. To validate the performance of our proposed FFTransMOT model, we conducted rigorous evaluations on four datasets (MOT16, MOT17, DanceTrack, BDD 100k). The experimental results demonstrate that the FFTransMOT model outperforms other trackers in terms of tracking accuracy and robustness in MOT tasks.
引用
收藏
页码:130060 / 130071
页数:12
相关论文
共 50 条
  • [41] CFTracker: Multi-Object Tracking With Cross-Frame Connections in Satellite Videos
    Kong, Lingyu
    Yan, Zhiyuan
    Zhang, Yidan
    Diao, Wenhui
    Zhu, Zining
    Wang, Lei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [42] Visible and Infrared Object Tracking via Convolution-Transformer Network With Joint Multimodal Feature Learning
    Qiu, Jiazhu
    Yao, Rui
    Zhou, Yong
    Wang, Peng
    Zhang, Yanning
    Zhu, Hancheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [43] Multi-object tracking using context-sensitive enhancement via feature fusion
    Zhou, Yan
    Chen, Junyu
    Wang, Dongli
    Zhu, Xiaolin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19465 - 19484
  • [44] Multi-Object Tracking in Video Sequences Based on Background Subtraction and SIFT Feature Matching
    Rahman, Md. Saidur
    Saha, Aparna
    Khanum, Snigdha
    ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 457 - 462
  • [45] Multi-object tracking using context-sensitive enhancement via feature fusion
    Yan Zhou
    Junyu Chen
    Dongli Wang
    Xiaolin Zhu
    Multimedia Tools and Applications, 2024, 83 : 19465 - 19484
  • [46] Multi-object tracking based on network flow model and ORB feature
    Chen, Jieyu
    Xi, Zhenghao
    Lu, Junxin
    Ji, Jingjing
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12282 - 12300
  • [47] Multi-object tracking based on network flow model and ORB feature
    Jieyu Chen
    Zhenghao Xi
    Junxin Lu
    Jingjing Ji
    Applied Intelligence, 2022, 52 : 12282 - 12300
  • [48] Pedestrian Multi-object Tracking Algorithm Based on Attention Feature Fusion
    Zhou, Yan
    Du, Zhennan
    Wang, Dongli
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT I, 2023, 14134 : 105 - 118
  • [49] SFFSORT Multi-Object Tracking by Shallow Feature Fusion for Vehicle Counting
    Zhonglin, Tian
    Wahab, Mohd Nadhir Ab
    Akbar, Muhammad Firdaus
    Mohamed, Ahmad Sufril Azlan
    Noor, Mohd Halim Mohd
    Rosdi, Bakhtiar Affendi
    IEEE ACCESS, 2023, 11 : 76827 - 76841
  • [50] Multi-object tracking in UAVs with feature fusion distribution and occlusion awareness
    Wang, Yuchen
    Zhao, Wei
    Zhang, Rufei
    Li, Nannan
    Li, Dongjin
    Lv, Jianwei
    Xu, Jingyu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)