共 50 条
Unsupervised Domain Adaptation Via Dynamic Clustering and Co-Segment Attentive Learning for Video-Based Person Re-Identification
被引:1
|作者:
Zhang, Fuping
[1
]
Chen, Fengjun
[1
]
Su, Zhonggen
[2
]
Wei, Jianming
[3
]
机构:
[1] Wenzhou Univ Technol, Sch Intelligent Mfg & Elect Engn, Wenzhou 325001, Peoples R China
[2] Wenzhou Univ Technol, Taishun Res Inst, Wenzhou 325011, Peoples R China
[3] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
来源:
关键词:
Adaptation models;
Pedestrians;
Training;
Generative adversarial networks;
Data models;
Cameras;
Supervised learning;
Identification of persons;
Unsupervised learning;
Clustering algorithms;
Learning systems;
Person re-identification;
unsupervised domain adaptation;
dynamic clustering;
co-segment attentive learning;
D O I:
10.1109/ACCESS.2024.3365583
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Currently, supervised person re-identification (Re-ID) models trained on labeled datasets can achieve high recognition performance in the same data domain. However, accuracy drops dramatically when these models are directly applied to other unlabeled datasets or natural environments, due to a significant sample distribution gap between the two domains. Unsupervised Domain Adaptation (UDA) methods can solve this problem by fine-tuning the model on the target dataset with pseudo-labels generated by the clustering method. Yet, these methods are primarily aimed at the image-based person Re-ID domain. This is because the background noise and interference information are complex and changeable in the video scenarios, resulting in large intra-class distances and small inter-class spaces, which easily lead to noisy labels. Huge domain gap and noisy labels hinder clustering and training processes heavily in the video-based person Re-ID. To address the problem, we propose a novel UDA method via Dynamic Clustering and Co-segment Attentive Learning (DCCAL) for it. DCCAL includes a Dynamic Clustering (DC) module and a Co-segment Attentive Learning (CAL) module. The DC module is responsible for adaptively clustering pedestrians within different generation processes to alleviate noisy labels. On the other hand, the CAL module reduces the domain gap using a co-segmentation-based attention mechanism. Additionally, we introduce Kullback-Leibler (KL) divergence loss to reduce the distribution of features between two domains for better performance. Experimental results on two large-scale video-based person Re-ID datasets, MARS and DukeMTMC-VideoReID (DukeV), demonstrate exceptional precision performance. Our method outperforms state-of-the-art semi-supervised and unsupervised approaches by 1.1% in Rank-1 and 1.5% in mAP on DukeV, as well as 3.1% and 2.1% in Rank-1 and mAP on MARS, respectively.
引用
收藏
页码:29583 / 29595
页数:13
相关论文