FFTransMOT: Feature-Fused Transformer for Enhanced Multi-Object Tracking

被引：2

作者：

Hu, Xufeng ^{[1
]}

Jeon, Younghoon ^{[2
]}

Gwak, Jeonghwan ^{[1
,2
,3
,4
]}

机构：

[1] Korea Natl Univ Transportat, Dept IT Energy Convergence, Chungju 27469, South Korea

[2] Korea Natl Univ Transportat, Dept Software, Chungju 27469, South Korea

[3] Korea Natl Univ Transportat, Dept Biomed Engn, Chungju 27469, South Korea

[4] Korea Natl Univ Transportat, Dept AI Robot Engn, Chungju 27469, South Korea

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Feature extraction; Transformers; Trajectory; Videos; Tracking; Decoding; Data models; Computer vision; Object tracking; feature fusion; multi-object tracking; object identification; OBJECT TRACKING;

D O I：

10.1109/ACCESS.2023.3327262

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the field of computer vision, multi-object tracking (MOT) is a crucial task. It involves the identification, tracking, and classification of multiple objects in videos, connecting their trajectories to form a complete motion sequence. MOT comprises two core components: object detection and data association. This entails detecting objects in each frame, determining the objects to be tracked, performing data association with the next frame, and predicting the future trajectories of the objects. In this paper, we propose a model named Feature-Fused Transformer for Enhanced Multi-object Tracking (FFTransMOT). In the FFTransMOT framework, a feature fusion module is integral to synthesizing a robust representation of object features by combining information from the current and previous frames. This fusion process strengthens the feature set, enhancing its reliability for the decoder's subsequent data association tasks. The decoder leverages these improved features to accurately match objects across frames, significantly enhancing the model's tracking capabilities over time. Subsequently, the decoder conducts data association matching between $\text{frame}_{t}$ and the newly fused features. Additionally, we employ a self-attention mechanism to capture dependencies between input features, thereby enhancing the accuracy and stability of object detection. To validate the performance of our proposed FFTransMOT model, we conducted rigorous evaluations on four datasets (MOT16, MOT17, DanceTrack, BDD 100k). The experimental results demonstrate that the FFTransMOT model outperforms other trackers in terms of tracking accuracy and robustness in MOT tasks.

引用

页码：130060 / 130071

页数：12

共 50 条

[41] CFTracker: Multi-Object Tracking With Cross-Frame Connections in Satellite Videos
Kong, Lingyu
Yan, Zhiyuan
Zhang, Yidan
Diao, Wenhui
Zhu, Zining
Wang, Lei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[42] Visible and Infrared Object Tracking via Convolution-Transformer Network With Joint Multimodal Feature Learning
Qiu, Jiazhu
Yao, Rui
Zhou, Yong
Wang, Peng
Zhang, Yanning
Zhu, Hancheng
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[43] Multi-object tracking using context-sensitive enhancement via feature fusion
Zhou, Yan
Chen, Junyu
Wang, Dongli
Zhu, Xiaolin
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19465 - 19484
[44] Multi-Object Tracking in Video Sequences Based on Background Subtraction and SIFT Feature Matching
Rahman, Md. Saidur
Saha, Aparna
Khanum, Snigdha
ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 457 - 462
[45] Multi-object tracking using context-sensitive enhancement via feature fusion
Yan Zhou
Junyu Chen
Dongli Wang
Xiaolin Zhu
Multimedia Tools and Applications, 2024, 83 : 19465 - 19484
[46] Multi-object tracking based on network flow model and ORB feature
Chen, Jieyu
Xi, Zhenghao
Lu, Junxin
Ji, Jingjing
APPLIED INTELLIGENCE, 2022, 52 (11) : 12282 - 12300
[47] Multi-object tracking based on network flow model and ORB feature
Jieyu Chen
Zhenghao Xi
Junxin Lu
Jingjing Ji
Applied Intelligence, 2022, 52 : 12282 - 12300
[48] Pedestrian Multi-object Tracking Algorithm Based on Attention Feature Fusion
Zhou, Yan
Du, Zhennan
Wang, Dongli
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT I, 2023, 14134 : 105 - 118
[49] SFFSORT Multi-Object Tracking by Shallow Feature Fusion for Vehicle Counting
Zhonglin, Tian
Wahab, Mohd Nadhir Ab
Akbar, Muhammad Firdaus
Mohamed, Ahmad Sufril Azlan
Noor, Mohd Halim Mohd
Rosdi, Bakhtiar Affendi
IEEE ACCESS, 2023, 11 : 76827 - 76841
[50] Multi-object tracking in UAVs with feature fusion distribution and occlusion awareness
Wang, Yuchen
Zhao, Wei
Zhang, Rufei
Li, Nannan
Li, Dongjin
Lv, Jianwei
Xu, Jingyu
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)

← 1 2 3 4 5 →