Progressive spatial-temporal transfer model for unsupervised person re-identification

被引：0

作者：

Zhou, Shuren ^{[1
]}

Li, Zhixiong ^{[1
]}

Liu, Jie ^{[1
]}

Zhou, Jiarui ^{[1
]}

Zhang, Jianming ^{[1
]}

机构：

[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL | 2024年 / 13卷 / 02期

关键词：

Person re-identification; Transfer learning; Spatial-temporal; Feature fusion; Neural network; NETWORK;

D O I：

10.1007/s13735-024-00324-w

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Over the past decade, a more widespread area of computer vision research has been person re-identification (P-Reid). This technology is applied in fields such as pedestrian tracking, security, and video surveillance. Currently, person re-identification performs well when supervised with labeled data, but accuracy frequently suffers when learning unsupervised on unlabeled samples. Therefore, improving unlabeled samples model is a challenging endeavor. In order to solve this problem, we propose a progressive spatial-temporal transfer model (PSTT), which consists of three stages, including incremental tuning, spatial-temporal fusion and target domain learning. In the first stage, a high-performance multi-scale network that can initially cluster samples is obtained through triplet loss function. In the next stage, to mine spatial-temporal and visual semantic information, we introduce a fusion model that fuses the visual information extracted from the labeled dataset and the unlabeled dataset using a trained network with its spatial-temporal information. In the final stage, with the assistance of fusion model, we employ a strategy that extends learning from labeled to unlabeled samples. During the training, the fusion model is used to select labeled and unlabeled samples, and multiple meta loss function is used for transfer learning. During the testing, the fusion model is employed to enhance the accuracy of network. In the experiment, we evaluate our method on five standard P-Reid benchmarks: Market1501, DukeMTMC-ReID, CUHK03, MSMT17 and Occluded-DukeMTMC. Extensive experiments show that our proposed PSTT achieves state-of-the-art performance, exceeding the previous method by a certain margin. The source code is available at https://github.com/LiZX12/PSTT.

引用

页数：16

共 50 条

[1] Unsupervised Spatial-Temporal Model Based on Region Alignment for Person Re-identification
Li, Wei
Qi, Meibin
Yang, Ning
Zhou, Guowu
Yang, Yubing
2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
[2] Progressive unsupervised video person re-identification with accumulative motion and tracklet spatial-temporal correlation
Yang, Yuanfeng
Li, Lin
Dong, Husheng
Liu, Gang
Sun, Xun
Liu, Zhaobin
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 90 - 100
[3] Spatial-Temporal Person Re-Identification
Wang, Guangcong
Lai, Jianhuang
Huang, Peigen
Xie, Xiaohua
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8933 - 8940
[4] Unsupervised Cross-dataset Person Re-identification by Transfer Learning of Spatial-Temporal Patterns
Lv, Jianming
Chen, Weihang
Li, Qing
Yang, Can
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7948 - 7956
[5] Person Re-Identification with Weighted Spatial-Temporal Features
Zhang, Dongyu
Chen, Rongcong
Qiu, Zhilin
Zhang, Wei
Wang, Qing
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1426 - 1431
[6] Person re-identification with activity prediction based on hierarchical spatial-temporal model
Li, Minxian
Shen, Fumin
Wang, Jingya
Guan, Chao
Tang, Jinhui
NEUROCOMPUTING, 2018, 275 : 1200 - 1207
[7] Spatial and Temporal Dual-Attention for Unsupervised Person Re-Identification
He, Qiaolin
Wang, Zihan
Zheng, Zhijie
Hu, Haifeng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1953 - 1965
[8] A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos
Zhang, Wei
He, Xuanyu
Yu, Xiaodong
Lu, Weizhi
Zha, Zhengjun
Tian, Qi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3365 - 3373
[9] COMPLEX SPATIAL-TEMPORAL ATTENTION AGGREGATION FOR VIDEO PERSON RE-IDENTIFICATION
Ding, Wenjie
Wei, Xing
Hong, Xiaopeng
Gong, Yihong
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2441 - 2445
[10] MULTI-SCALE SPATIAL-TEMPORAL NETWORK FOR PERSON RE-IDENTIFICATION
Wang, Zhikang
He, Lihuo
Gao, Xinbo
Huang, Yuanfei
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2052 - 2056

← 1 2 3 4 5 →