FFTransMOT: Feature-Fused Transformer for Enhanced Multi-Object Tracking

被引：2

作者：

Hu, Xufeng ^{[1
]}

Jeon, Younghoon ^{[2
]}

Gwak, Jeonghwan ^{[1
,2
,3
,4
]}

机构：

[1] Korea Natl Univ Transportat, Dept IT Energy Convergence, Chungju 27469, South Korea

[2] Korea Natl Univ Transportat, Dept Software, Chungju 27469, South Korea

[3] Korea Natl Univ Transportat, Dept Biomed Engn, Chungju 27469, South Korea

[4] Korea Natl Univ Transportat, Dept AI Robot Engn, Chungju 27469, South Korea

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Feature extraction; Transformers; Trajectory; Videos; Tracking; Decoding; Data models; Computer vision; Object tracking; feature fusion; multi-object tracking; object identification; OBJECT TRACKING;

D O I：

10.1109/ACCESS.2023.3327262

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the field of computer vision, multi-object tracking (MOT) is a crucial task. It involves the identification, tracking, and classification of multiple objects in videos, connecting their trajectories to form a complete motion sequence. MOT comprises two core components: object detection and data association. This entails detecting objects in each frame, determining the objects to be tracked, performing data association with the next frame, and predicting the future trajectories of the objects. In this paper, we propose a model named Feature-Fused Transformer for Enhanced Multi-object Tracking (FFTransMOT). In the FFTransMOT framework, a feature fusion module is integral to synthesizing a robust representation of object features by combining information from the current and previous frames. This fusion process strengthens the feature set, enhancing its reliability for the decoder's subsequent data association tasks. The decoder leverages these improved features to accurately match objects across frames, significantly enhancing the model's tracking capabilities over time. Subsequently, the decoder conducts data association matching between $\text{frame}_{t}$ and the newly fused features. Additionally, we employ a self-attention mechanism to capture dependencies between input features, thereby enhancing the accuracy and stability of object detection. To validate the performance of our proposed FFTransMOT model, we conducted rigorous evaluations on four datasets (MOT16, MOT17, DanceTrack, BDD 100k). The experimental results demonstrate that the FFTransMOT model outperforms other trackers in terms of tracking accuracy and robustness in MOT tasks.

引用

页码：130060 / 130071

页数：12

共 50 条

[21] Multi-object tracking by multi-feature fusion to associate all detected boxes
Bilakeri, Shavantrevva
Karunakar, A. K.
COGENT ENGINEERING, 2022, 9 (01):
[22] FETrack: Feature-Enhanced Transformer Network for Visual Object Tracking
Liu, Hang
Huang, Detian
Lin, Mingxin
APPLIED SCIENCES-BASEL, 2024, 14 (22):
[23] Identity-Quantity Harmonic Multi-Object Tracking
He, Yuhang
Wei, Xing
Hong, Xiaopeng
Ke, Wei
Gong, Yihong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2201 - 2215
[24] Multi-camera multi-object tracking: A review of current trends and future advances
Amosa, Temitope Ibrahim
Sebastian, Patrick
Izhar, Lila Iznita
Ibrahim, Oladimeji
Ayinla, Lukman Shehu
Bahashwan, Abdulrahman Abdullah
Bala, Abubakar
Samaila, Yau Alhaji
NEUROCOMPUTING, 2023, 552
[25] An Object Point Set Inductive Tracker for Multi-Object Tracking and Segmentation
Gao, Yan
Xu, Haojun
Zheng, Yu
Li, Jie
Gao, Xinbo
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6083 - 6096
[26] Transformer-based two-source motion model for multi-object tracking
Jieming Yang
Hongwei Ge
Shuzhi Su
Guoqing Liu
Applied Intelligence, 2022, 52 : 9967 - 9979
[27] Trajectories as Topics: Multi-Object Tracking by Topic Discovery
Luo, Wenhan
Stenger, Bjorn
Zhao, Xiaowei
Kim, Tae-Kyun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 240 - 252
[28] Aggregate Tracklet Appearance Features for Multi-Object Tracking
Chen, Long
Ai, Haizhou
Chen, Rui
Zhuang, Zijie
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (11) : 1613 - 1617
[29] Transformer-based two-source motion model for multi-object tracking
Yang, Jieming
Ge, Hongwei
Su, Shuzhi
Liu, Guoqing
APPLIED INTELLIGENCE, 2022, 52 (09) : 9967 - 9979
[30] DETrack: Multi-Object Tracking Algorithm Based on Feature Decomposition and Feature Enhancement
Wen, Feng
Huang, Haixin
Yin, Xiangyang
Ma, Junguang
Hu, Xiaojie
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2024, E107A (09) : 1522 - 1533

← 1 2 3 4 5 →