Local-enhanced representation for text-based person search

被引:1
作者
Zhang, Guoqing [1 ,2 ]
Chen, Yuhao [1 ]
Zheng, Yuhui [1 ]
Martin, Gaven [3 ]
Wang, Ruili [2 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
[3] Massey Univ, Inst Adv Study, Auckland, New Zealand
[4] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modal retrieval; Local representation;
D O I
10.1016/j.patcog.2024.111247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a critical task in intelligent security, designed to locate a person of interest by text descriptions. The primary challenge in this task is to effectively bridge the significant gap between the text and image domains while simultaneously extracting the discriminative features that are crucial for the accurate identification of individuals. Existing methods have made some effective attempts by conducting cross-modal matching at the fine-grained representation level. However, these approaches frequently overlook two crucial factors: (i) the presence of noise in the local features during information fusion, and (ii) the lack of intra-modal matching when measuring feature similarity. To address the above issues, we propose a novel local- enhanced representation framework in this paper. Specifically, to restrain noises in local features, we design a Relation-based cross-modal local-enhanced fusion module, which can filter out weak related information by relation assessment. In addition, we explore an intra-cross modal projection strategy to overcome the limitations of existing cross-modal projection methods. This strategy jointly applies the intra-modal and cross- modal matching constrains in feature distribution. Finally, experiments on three mainstream datasets verify the performance superiority of our proposed method compared to existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Resource-efficient Text-based Person Re-identification on Embedded Devices
    Agyeman, Rockson
    Rinner, Bernhard
    [J]. 2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 84 - 92
  • [22] CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval
    Wang, Zijie
    Zhu, Aichun
    Xue, Jingyi
    Wan, Xili
    Liu, Chao
    Wang, Tian
    Li, Yifeng
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5314 - 5322
  • [23] Improving Text-Based Person Retrieval by Excavating All-Round Information Beyond Color
    Zhu, Aichun
    Wang, Zijie
    Xue, Jingyi
    Wan, Xili
    Jin, Jing
    Wang, Tian
    Snoussi, Hichem
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [24] From attributes to natural language: A survey and foresight on text-based person re-identification
    Jiang, Fanzhi
    Yang, Su
    Jones, Mark W.
    Zhang, Liumei
    [J]. INFORMATION FUSION, 2025, 118
  • [25] PMG-Pyramidal Multi-Granular Matching for Text-Based Person Re-Identification
    Liu, Chao
    Xue, Jingyi
    Wang, Zijie
    Zhu, Aichun
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [26] AMEN: Adversarial Multi-space Embedding Network for Text-Based Person Re-identification
    Wang, Zijie
    Xue, Jingyi
    Zhu, Aichun
    Li, Yifeng
    Zhang, Mingyi
    Zhong, Chongliang
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 462 - 473
  • [27] Dual-path CNN with Max Gated block for text-based person re-identification
    Ma, Tinghuai
    Yang, Mingming
    Rong, Huan
    Qian, Yurong
    Tian, Yuan
    Al-Nabhan, Najla
    [J]. IMAGE AND VISION COMPUTING, 2021, 111
  • [28] Multi-level cross-modality learning framework for text-based person re-identification
    Wu, Tinghui
    Zhang, Shuhe
    Chen, Dihu
    Hu, Haifeng
    [J]. ELECTRONICS LETTERS, 2023, 59 (20)
  • [29] A Text-Based Dual-Branch Person Re-Identification Algorithm Based on the Deep Attribute Information Mining Network
    Han, Ke
    Zhang, Xiyan
    Xu, Wenlong
    Jin, Long
    [J]. SYMMETRY-BASEL, 2025, 17 (01):
  • [30] Prototype-guided Cross-modal Completion and Alignment for Incomplete Text-based Person Re-identification
    Gong, Tiantian
    Du, Guodong
    Wang, Junsheng
    Ding, Yongkang
    Zhang, Liyan
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5253 - 5261