Dual representation modeling and progressive contrastive learning for unsupervised video person re-identification

被引:0
作者
Zhang, Cong [1 ]
Su, Yanzhao [1 ]
Wang, Nian [1 ]
Lan, Yunwei [1 ]
Wang, Tao [1 ]
Li, Aihua [1 ]
机构
[1] Xian Res Inst High Tech, Tong Xin Rd, Xian 710025, Shaanxi, Peoples R China
关键词
Unsupervised learning; Video person re-identification; Clustering; Contrastive learning;
D O I
10.1016/j.neucom.2025.130467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised video person re-identification (USL-VReID) aims to identify the same person across cameras without annotation. The key to this challenging task is to take advantage of the discriminative features of frames and tracklets. Most existing methods model tracklet features by mean average pooling and address the USL-VReID problem by using cluster centroid contrastive learning. However, the mean average pooling strategy and cluster centers primarily focus on commonality, overlooking divergencies. In this paper, we propose a Dual Representation Modeling and Progressive Contrastive Learning (DRMPCL) method to learn fine-grained features of the diversity distribution. Specifically, our approach explores the discriminant features of diversity distribution in both frame-level tracklet modeling and tracklet-level discrepant contrastive learning. Then a multi-level progressive learning strategy effectively optimizes the model from reliable to informative, and from commonality to divergence. Comprehensive experiments on two widely-used datasets demonstrate that DRMPCL outperforms the state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 54 条
[1]   Spatio-Temporal Representation Factorization for Video-based Person Re-Identification [J].
Aich, Abhishek ;
Zheng, Meng ;
Karanam, Srikrishna ;
Chen, Terrence ;
Roy-Chowdhury, Amit K. ;
Wu, Ziyan .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :152-162
[2]   Incorporating texture and silhouette for video-based person re-identification [J].
Bai, Shutao ;
Chang, Hong ;
Ma, Bingpeng .
PATTERN RECOGNITION, 2024, 156
[3]  
Chen Y., 2018, P BRIT MACH VIS C
[4]  
Cheng D., 2022, IEEE T IMAGE PROCESS, V31, P3334, DOI DOI 10.1109/TIP.2022.3169693
[5]  
Dai Z., 2022, P AS C COMP VIS, P1142, DOI DOI 10.1007/978-3-031-26351-4_20
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]  
Ding G., 2019, 30 BRIT MACH VIS C 2
[8]  
Ester M., 1996, KDD, DOI DOI 10.5555/3001460.3001507
[9]  
Fan H., 2017, ACM Trans. Multimed. Comput. Commun. Appl., V14
[10]  
Fu Y, 2019, AAAI CONF ARTIF INTE, P8287