Unsupervised Domain Adaptation Via Dynamic Clustering and Co-Segment Attentive Learning for Video-Based Person Re-Identification

被引:1
|
作者
Zhang, Fuping [1 ]
Chen, Fengjun [1 ]
Su, Zhonggen [2 ]
Wei, Jianming [3 ]
机构
[1] Wenzhou Univ Technol, Sch Intelligent Mfg & Elect Engn, Wenzhou 325001, Peoples R China
[2] Wenzhou Univ Technol, Taishun Res Inst, Wenzhou 325011, Peoples R China
[3] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
关键词
Adaptation models; Pedestrians; Training; Generative adversarial networks; Data models; Cameras; Supervised learning; Identification of persons; Unsupervised learning; Clustering algorithms; Learning systems; Person re-identification; unsupervised domain adaptation; dynamic clustering; co-segment attentive learning;
D O I
10.1109/ACCESS.2024.3365583
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, supervised person re-identification (Re-ID) models trained on labeled datasets can achieve high recognition performance in the same data domain. However, accuracy drops dramatically when these models are directly applied to other unlabeled datasets or natural environments, due to a significant sample distribution gap between the two domains. Unsupervised Domain Adaptation (UDA) methods can solve this problem by fine-tuning the model on the target dataset with pseudo-labels generated by the clustering method. Yet, these methods are primarily aimed at the image-based person Re-ID domain. This is because the background noise and interference information are complex and changeable in the video scenarios, resulting in large intra-class distances and small inter-class spaces, which easily lead to noisy labels. Huge domain gap and noisy labels hinder clustering and training processes heavily in the video-based person Re-ID. To address the problem, we propose a novel UDA method via Dynamic Clustering and Co-segment Attentive Learning (DCCAL) for it. DCCAL includes a Dynamic Clustering (DC) module and a Co-segment Attentive Learning (CAL) module. The DC module is responsible for adaptively clustering pedestrians within different generation processes to alleviate noisy labels. On the other hand, the CAL module reduces the domain gap using a co-segmentation-based attention mechanism. Additionally, we introduce Kullback-Leibler (KL) divergence loss to reduce the distribution of features between two domains for better performance. Experimental results on two large-scale video-based person Re-ID datasets, MARS and DukeMTMC-VideoReID (DukeV), demonstrate exceptional precision performance. Our method outperforms state-of-the-art semi-supervised and unsupervised approaches by 1.1% in Rank-1 and 1.5% in mAP on DukeV, as well as 3.1% and 2.1% in Rank-1 and mAP on MARS, respectively.
引用
收藏
页码:29583 / 29595
页数:13
相关论文
共 50 条
  • [31] Unsupervised Domain Adaptation with Background Shift Mitigating for Person Re-Identification
    Huang, Yan
    Wu, Qiang
    Xu, Jingsong
    Zhong, Yi
    Zhang, Zhaoxiang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (07) : 2244 - 2263
  • [32] Unsupervised adversarial domain adaptation with similarity diffusion for person re-identification
    Tang, Geyu
    Gao, Xingyu
    Chen, Zhenyu
    Zhong, Huicai
    NEUROCOMPUTING, 2021, 442 (442) : 337 - 347
  • [33] Unsupervised domain adaptation in homogeneous distance space for person re-identification
    Zheng, Dingyuan
    Xiao, Jimin
    Wei, Yunchao
    Wang, Qiufeng
    Huang, Kaizhu
    Zhao, Yao
    PATTERN RECOGNITION, 2022, 132
  • [34] Unsupervised Domain Adaptation with Background Shift Mitigating for Person Re-Identification
    Yan Huang
    Qiang Wu
    Jingsong Xu
    Yi Zhong
    Zhaoxiang Zhang
    International Journal of Computer Vision, 2021, 129 : 2244 - 2263
  • [35] Unsupervised multi-source domain adaptation for person re-identification via sample weighting
    Tian, Qing
    Cheng, Yao
    INTELLIGENT DATA ANALYSIS, 2024, 28 (04) : 943 - 960
  • [36] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
    Zhang, Wei
    Yu, Xiaodong
    He, Xuanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
  • [37] Video-based person re-identification with scene and person attributes
    Gong, Xun
    Luo, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8117 - 8128
  • [38] Video-based person re-identification with scene and person attributes
    Xun Gong
    Bin Luo
    Multimedia Tools and Applications, 2024, 83 : 8117 - 8128
  • [39] Cluster-based Dual-branch Contrastive Learning for unsupervised domain adaptation person re-identification
    Tian, Qing
    Sun, Jixin
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [40] Graph-Based Local Feature Adaptation for Cross-Domain Person Re-Identification
    Wang, Jun
    IEEE ACCESS, 2022, 10 : 3017 - 3029