Multi-level fine-grained center calibration network for unsupervised person re-identification

被引：0

作者：

Che, Haojie ^{[1
]}

Zhao, Jiacheng ^{[1
]}

Li, Yongxi ^{[2
,3
]}

机构：

[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Middle Huaxia Rd, Shanghai 201210, Peoples R China

[2] Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence S, Inst Automat, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China

[3] Beihang Univ, Comp Sci & Engn, Colleage Rd, Beijing 100191, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2025年 / 31卷 / 02期

关键词：

Unsupervised person re-identification; Contrastive learning; Pseudo label;

D O I：

10.1007/s00530-025-01729-1

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Person re-identification (ReID) aims to match individuals across different camera views. Unlike traditional supervised methods, unsupervised ReID bypasses the need for costly manual annotations, making it highly desirable for real-world applications. In recent years, clustering-based pseudo-labeling has become a widely used approach in unsupervised person re-identification, achieving state-of-the-art performance on several benchmarks. However, two key limitations remain: (1) Biased Cluster Centers: Hard samples introduce bias into the cluster centers, diminishing the effectiveness of cluster center based contrastive learning. (2) Limitations of Local Features: Existing methods primarily rely on horizontal stripe pooling to extract local features, constraining their capacity to represent sample diversity. To address these limitations, we propose a novel Multi-Level Fine-Grained Center Calibration Network (MFCC) integrating a Fine-Grained Enhanced Feature Extractor and a Center-Guided Feature Calibration module. The Fine-Grained Enhanced Feature Extractor employs a multi-level attention strategy, incorporating low to high level clues, to dynamically identify discriminative regions and extract fine-grained local features. The Center-Guided Feature Calibration module uses a Gaussian Mixture Model (GMM) to identify and calibrate hard samples toward the center of easy samples, resulting in more compact clusters and refined cluster centers. Extensive experiments on two benchmark datasets, Market-1501 and MSMT17, demonstrate the effectiveness of our proposed MFCC framework.

引用

页数：13

共 30 条

[11] Li HF, 2024, AAAI CONF ARTIF INTE, P3091
[12] Transformer Based Multi-Grained Features for Unsupervised Person Re-Identification
Li, Jiachen
Wang, Menglin
Gong, Xiaojin
[J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 42 - 50
[13] Li SY, 2023, AAAI CONF ARTIF INTE, P1405
[14] Distinctive image features from scale-invariant keypoints
Lowe, DG
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
[15] Luo H., 2021, arXiv
[16] Bag of Tricks and A Strong Baseline for Deep Person Re-identification
Luo, Hao
Gu, Youzhi
Liao, Xingyu
Lai, Shenqi
Jiang, Wei
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1487 - 1495
[17] Sarker P. K., 2024, IEEE Transactions on Intelligent Vehicles
[18] Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline)
Sun, Yifan
Zheng, Liang
Yang, Yi
Tian, Qi
Wang, Shengjin
[J]. COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 501 - 518
[19] Touvron H, 2021, PR MACH LEARN RES, V139, P7358
[20] Vaswani A, 2017, ADV NEUR IN, V30

← 1 2 3 →