Weakly Supervised Text-based Person Re-Identification

被引:16
|
作者
Zhao, Shizhen [1 ]
Gao, Changxin [1 ]
Shao, Yuanjie [1 ]
Zheng, Wei-Shi [2 ]
Sang, Nong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Key Lab Image Proc & Intelligent Control, Wuhan, Peoples R China
[2] Sun Yat Sen Univ, Guangzhou, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV48922.2021.01120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The conventional text-based person re-identification methods heavily rely on identity annotations. However, this labeling process is costly and time-consuming. In this paper, we consider a more practical setting called weakly supervised text-based person re-identification, where only the text-image pairs are available without the requirement of annotating identities during the training phase. To this end, we propose a Cross-Modal Mutual Training (CMMT) framework. Specifically, to alleviate the intra-class variations, a clustering method is utilized to generate pseudo labels for both visual and textual instances. To further refine the clustering results, CMMT provides a Mutual Pseudo Label Refinement module, which leverages the clustering results in one modality to refine that in the other modality constrained by the text-image pairwise relationship. Meanwhile, CMMT introduces a Text-IoU Guided Cross-Modal Projection Matching loss to resolve the cross-modal matching ambiguity problem. A Text-IoU Guided Hard Sample Mining method is also proposed for learning discriminative textual-visual joint embeddings. We conduct extensive experiments to demonstrate the effectiveness of the proposed CMMT, and the results show that CMMT performs favorably against existing text-based person re-identification methods. Our code will be available at https:// github.com/X-BrainLab/WS_Text- ReID.
引用
收藏
页码:11375 / 11384
页数:10
相关论文
共 50 条
  • [1] Image-Centered Pseudo Label Generation for Weakly Supervised Text-Based Person Re-Identification
    Nie, Weizhi
    Wu, Chengji
    Sun, Hao
    Xie, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 : 477 - 491
  • [2] Weakly Supervised Person Re-Identification
    Meng, Jingke
    Wu, Sheng
    Zheng, Wei-Shi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 760 - 769
  • [3] Parallel Data Augmentation for Text-based Person Re-identification
    Cai, Han-Qing
    Li, Xin
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] Weakly Supervised Pedestrian Segmentation for Person Re-Identification
    Jin, Ziqi
    Xie, Jinheng
    Wu, Bizhu
    Shen, Linlin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1349 - 1362
  • [5] MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION
    Xu, Wenhao
    Shao, Zhiyin
    Ding, Changxing
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1680 - 1684
  • [6] BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
    Fujii, Takuro
    Tarashima, Shuhei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2778 - 2782
  • [7] Resource-efficient Text-based Person Re-identification on Embedded Devices
    Agyeman, Rockson
    Rinner, Bernhard
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 84 - 92
  • [8] FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification
    Ma, Wentao
    Wu, Xinyi
    Zhao, Shan
    Zhou, Tongqing
    Guo, Dan
    Gu, Lichuan
    Cai, Zhiping
    Wang, Meng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5065 - 5077
  • [9] Decentralized Text-Based Person Re-Identification in Multi-Camera Networks
    Agyeman, Rockson
    Rinner, Bernhard
    IEEE ACCESS, 2024, 12 : 172125 - 172148
  • [10] From attributes to natural language: A survey and foresight on text-based person re-identification
    Jiang, Fanzhi
    Yang, Su
    Jones, Mark W.
    Zhang, Liumei
    INFORMATION FUSION, 2025, 118