FFTransMOT: Feature-Fused Transformer for Enhanced Multi-Object Tracking

被引:2
|
作者
Hu, Xufeng [1 ]
Jeon, Younghoon [2 ]
Gwak, Jeonghwan [1 ,2 ,3 ,4 ]
机构
[1] Korea Natl Univ Transportat, Dept IT Energy Convergence, Chungju 27469, South Korea
[2] Korea Natl Univ Transportat, Dept Software, Chungju 27469, South Korea
[3] Korea Natl Univ Transportat, Dept Biomed Engn, Chungju 27469, South Korea
[4] Korea Natl Univ Transportat, Dept AI Robot Engn, Chungju 27469, South Korea
关键词
Feature extraction; Transformers; Trajectory; Videos; Tracking; Decoding; Data models; Computer vision; Object tracking; feature fusion; multi-object tracking; object identification; OBJECT TRACKING;
D O I
10.1109/ACCESS.2023.3327262
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of computer vision, multi-object tracking (MOT) is a crucial task. It involves the identification, tracking, and classification of multiple objects in videos, connecting their trajectories to form a complete motion sequence. MOT comprises two core components: object detection and data association. This entails detecting objects in each frame, determining the objects to be tracked, performing data association with the next frame, and predicting the future trajectories of the objects. In this paper, we propose a model named Feature-Fused Transformer for Enhanced Multi-object Tracking (FFTransMOT). In the FFTransMOT framework, a feature fusion module is integral to synthesizing a robust representation of object features by combining information from the current and previous frames. This fusion process strengthens the feature set, enhancing its reliability for the decoder's subsequent data association tasks. The decoder leverages these improved features to accurately match objects across frames, significantly enhancing the model's tracking capabilities over time. Subsequently, the decoder conducts data association matching between $\text{frame}_{t}$ and the newly fused features. Additionally, we employ a self-attention mechanism to capture dependencies between input features, thereby enhancing the accuracy and stability of object detection. To validate the performance of our proposed FFTransMOT model, we conducted rigorous evaluations on four datasets (MOT16, MOT17, DanceTrack, BDD 100k). The experimental results demonstrate that the FFTransMOT model outperforms other trackers in terms of tracking accuracy and robustness in MOT tasks.
引用
收藏
页码:130060 / 130071
页数:12
相关论文
共 50 条
  • [21] Multi-object tracking by multi-feature fusion to associate all detected boxes
    Bilakeri, Shavantrevva
    Karunakar, A. K.
    COGENT ENGINEERING, 2022, 9 (01):
  • [22] FETrack: Feature-Enhanced Transformer Network for Visual Object Tracking
    Liu, Hang
    Huang, Detian
    Lin, Mingxin
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [23] Identity-Quantity Harmonic Multi-Object Tracking
    He, Yuhang
    Wei, Xing
    Hong, Xiaopeng
    Ke, Wei
    Gong, Yihong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2201 - 2215
  • [24] Multi-camera multi-object tracking: A review of current trends and future advances
    Amosa, Temitope Ibrahim
    Sebastian, Patrick
    Izhar, Lila Iznita
    Ibrahim, Oladimeji
    Ayinla, Lukman Shehu
    Bahashwan, Abdulrahman Abdullah
    Bala, Abubakar
    Samaila, Yau Alhaji
    NEUROCOMPUTING, 2023, 552
  • [25] An Object Point Set Inductive Tracker for Multi-Object Tracking and Segmentation
    Gao, Yan
    Xu, Haojun
    Zheng, Yu
    Li, Jie
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6083 - 6096
  • [26] Transformer-based two-source motion model for multi-object tracking
    Jieming Yang
    Hongwei Ge
    Shuzhi Su
    Guoqing Liu
    Applied Intelligence, 2022, 52 : 9967 - 9979
  • [27] Trajectories as Topics: Multi-Object Tracking by Topic Discovery
    Luo, Wenhan
    Stenger, Bjorn
    Zhao, Xiaowei
    Kim, Tae-Kyun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 240 - 252
  • [28] Aggregate Tracklet Appearance Features for Multi-Object Tracking
    Chen, Long
    Ai, Haizhou
    Chen, Rui
    Zhuang, Zijie
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (11) : 1613 - 1617
  • [29] Transformer-based two-source motion model for multi-object tracking
    Yang, Jieming
    Ge, Hongwei
    Su, Shuzhi
    Liu, Guoqing
    APPLIED INTELLIGENCE, 2022, 52 (09) : 9967 - 9979
  • [30] DETrack: Multi-Object Tracking Algorithm Based on Feature Decomposition and Feature Enhancement
    Wen, Feng
    Huang, Haixin
    Yin, Xiangyang
    Ma, Junguang
    Hu, Xiaojie
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2024, E107A (09) : 1522 - 1533