Weakly Supervised Text-based Person Re-Identification

被引：16

作者：

Zhao, Shizhen ^{[1
]}

Gao, Changxin ^{[1
]}

Shao, Yuanjie ^{[1
]}

Zheng, Wei-Shi ^{[2
]}

Sang, Nong ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab Image Proc & Intelligent Control, Wuhan, Peoples R China

[2] Sun Yat Sen Univ, Guangzhou, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV48922.2021.01120

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The conventional text-based person re-identification methods heavily rely on identity annotations. However, this labeling process is costly and time-consuming. In this paper, we consider a more practical setting called weakly supervised text-based person re-identification, where only the text-image pairs are available without the requirement of annotating identities during the training phase. To this end, we propose a Cross-Modal Mutual Training (CMMT) framework. Specifically, to alleviate the intra-class variations, a clustering method is utilized to generate pseudo labels for both visual and textual instances. To further refine the clustering results, CMMT provides a Mutual Pseudo Label Refinement module, which leverages the clustering results in one modality to refine that in the other modality constrained by the text-image pairwise relationship. Meanwhile, CMMT introduces a Text-IoU Guided Cross-Modal Projection Matching loss to resolve the cross-modal matching ambiguity problem. A Text-IoU Guided Hard Sample Mining method is also proposed for learning discriminative textual-visual joint embeddings. We conduct extensive experiments to demonstrate the effectiveness of the proposed CMMT, and the results show that CMMT performs favorably against existing text-based person re-identification methods. Our code will be available at https:// github.com/X-BrainLab/WS_Text- ReID.

引用

页码：11375 / 11384

页数：10

共 50 条

[1] Image-Centered Pseudo Label Generation for Weakly Supervised Text-Based Person Re-Identification
Nie, Weizhi
Wu, Chengji
Sun, Hao
Xie, Wei
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 : 477 - 491
[2] Weakly Supervised Person Re-Identification
Meng, Jingke
Wu, Sheng
Zheng, Wei-Shi
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 760 - 769
[3] Parallel Data Augmentation for Text-based Person Re-identification
Cai, Han-Qing
Li, Xin
Ji, Yi
Li, Ying
Liu, Chun-Ping
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[4] Weakly Supervised Pedestrian Segmentation for Person Re-Identification
Jin, Ziqi
Xie, Jinheng
Wu, Bizhu
Shen, Linlin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1349 - 1362
[5] MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION
Xu, Wenhao
Shao, Zhiyin
Ding, Changxing
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1680 - 1684
[6] BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
Fujii, Takuro
Tarashima, Shuhei
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2778 - 2782
[7] Resource-efficient Text-based Person Re-identification on Embedded Devices
Agyeman, Rockson
Rinner, Bernhard
2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 84 - 92
[8] FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification
Ma, Wentao
Wu, Xinyi
Zhao, Shan
Zhou, Tongqing
Guo, Dan
Gu, Lichuan
Cai, Zhiping
Wang, Meng
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5065 - 5077
[9] Decentralized Text-Based Person Re-Identification in Multi-Camera Networks
Agyeman, Rockson
Rinner, Bernhard
IEEE ACCESS, 2024, 12 : 172125 - 172148
[10] From attributes to natural language: A survey and foresight on text-based person re-identification
Jiang, Fanzhi
Yang, Su
Jones, Mark W.
Zhang, Liumei
INFORMATION FUSION, 2025, 118

← 1 2 3 4 5 →