Progressive spatial-temporal transfer model for unsupervised person re-identification

被引:0
|
作者
Zhou, Shuren [1 ]
Li, Zhixiong [1 ]
Liu, Jie [1 ]
Zhou, Jiarui [1 ]
Zhang, Jianming [1 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China
关键词
Person re-identification; Transfer learning; Spatial-temporal; Feature fusion; Neural network; NETWORK;
D O I
10.1007/s13735-024-00324-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past decade, a more widespread area of computer vision research has been person re-identification (P-Reid). This technology is applied in fields such as pedestrian tracking, security, and video surveillance. Currently, person re-identification performs well when supervised with labeled data, but accuracy frequently suffers when learning unsupervised on unlabeled samples. Therefore, improving unlabeled samples model is a challenging endeavor. In order to solve this problem, we propose a progressive spatial-temporal transfer model (PSTT), which consists of three stages, including incremental tuning, spatial-temporal fusion and target domain learning. In the first stage, a high-performance multi-scale network that can initially cluster samples is obtained through triplet loss function. In the next stage, to mine spatial-temporal and visual semantic information, we introduce a fusion model that fuses the visual information extracted from the labeled dataset and the unlabeled dataset using a trained network with its spatial-temporal information. In the final stage, with the assistance of fusion model, we employ a strategy that extends learning from labeled to unlabeled samples. During the training, the fusion model is used to select labeled and unlabeled samples, and multiple meta loss function is used for transfer learning. During the testing, the fusion model is employed to enhance the accuracy of network. In the experiment, we evaluate our method on five standard P-Reid benchmarks: Market1501, DukeMTMC-ReID, CUHK03, MSMT17 and Occluded-DukeMTMC. Extensive experiments show that our proposed PSTT achieves state-of-the-art performance, exceeding the previous method by a certain margin. The source code is available at https://github.com/LiZX12/PSTT.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Unsupervised Spatial-Temporal Model Based on Region Alignment for Person Re-identification
    Li, Wei
    Qi, Meibin
    Yang, Ning
    Zhou, Guowu
    Yang, Yubing
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [2] Progressive unsupervised video person re-identification with accumulative motion and tracklet spatial-temporal correlation
    Yang, Yuanfeng
    Li, Lin
    Dong, Husheng
    Liu, Gang
    Sun, Xun
    Liu, Zhaobin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 90 - 100
  • [3] Spatial-Temporal Person Re-Identification
    Wang, Guangcong
    Lai, Jianhuang
    Huang, Peigen
    Xie, Xiaohua
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8933 - 8940
  • [4] Unsupervised Cross-dataset Person Re-identification by Transfer Learning of Spatial-Temporal Patterns
    Lv, Jianming
    Chen, Weihang
    Li, Qing
    Yang, Can
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7948 - 7956
  • [5] Person Re-Identification with Weighted Spatial-Temporal Features
    Zhang, Dongyu
    Chen, Rongcong
    Qiu, Zhilin
    Zhang, Wei
    Wang, Qing
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1426 - 1431
  • [6] Person re-identification with activity prediction based on hierarchical spatial-temporal model
    Li, Minxian
    Shen, Fumin
    Wang, Jingya
    Guan, Chao
    Tang, Jinhui
    NEUROCOMPUTING, 2018, 275 : 1200 - 1207
  • [7] Spatial and Temporal Dual-Attention for Unsupervised Person Re-Identification
    He, Qiaolin
    Wang, Zihan
    Zheng, Zhijie
    Hu, Haifeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1953 - 1965
  • [8] A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos
    Zhang, Wei
    He, Xuanyu
    Yu, Xiaodong
    Lu, Weizhi
    Zha, Zhengjun
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3365 - 3373
  • [9] COMPLEX SPATIAL-TEMPORAL ATTENTION AGGREGATION FOR VIDEO PERSON RE-IDENTIFICATION
    Ding, Wenjie
    Wei, Xing
    Hong, Xiaopeng
    Gong, Yihong
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2441 - 2445
  • [10] MULTI-SCALE SPATIAL-TEMPORAL NETWORK FOR PERSON RE-IDENTIFICATION
    Wang, Zhikang
    He, Lihuo
    Gao, Xinbo
    Huang, Yuanfei
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2052 - 2056