Local-enhanced representation for text-based person search

被引:1
|
作者
Zhang, Guoqing [1 ,2 ]
Chen, Yuhao [1 ]
Zheng, Yuhui [1 ]
Martin, Gaven [3 ]
Wang, Ruili [2 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
[3] Massey Univ, Inst Adv Study, Auckland, New Zealand
[4] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modal retrieval; Local representation;
D O I
10.1016/j.patcog.2024.111247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a critical task in intelligent security, designed to locate a person of interest by text descriptions. The primary challenge in this task is to effectively bridge the significant gap between the text and image domains while simultaneously extracting the discriminative features that are crucial for the accurate identification of individuals. Existing methods have made some effective attempts by conducting cross-modal matching at the fine-grained representation level. However, these approaches frequently overlook two crucial factors: (i) the presence of noise in the local features during information fusion, and (ii) the lack of intra-modal matching when measuring feature similarity. To address the above issues, we propose a novel local- enhanced representation framework in this paper. Specifically, to restrain noises in local features, we design a Relation-based cross-modal local-enhanced fusion module, which can filter out weak related information by relation assessment. In addition, we explore an intra-cross modal projection strategy to overcome the limitations of existing cross-modal projection methods. This strategy jointly applies the intra-modal and cross- modal matching constrains in feature distribution. Finally, experiments on three mainstream datasets verify the performance superiority of our proposed method compared to existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Enhancing visual representation for text-based person searching
    Shen, Wei
    Fang, Ming
    Wang, Yuxia
    Xiao, Jiafeng
    Li, Diping
    Chen, Huangqun
    Xu, Ling
    Zhang, Weifeng
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [2] Text-Guided Visual Feature Refinement for Text-Based Person Search
    Gao, Liying
    Niu, Kai
    Ma, Zehong
    Jiao, Bingliang
    Tan, Tonghao
    Wang, Peng
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 118 - 126
  • [3] Part-Based Multi-Scale Attention Network for Text-Based Person Search
    Wang, Yubin
    Qi, Ding
    Zhao, Cairong
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 462 - 474
  • [4] Feature semantic alignment and information supplement for Text-based person search
    Zhou, Hang
    Li, Fan
    Tian, Xuening
    Huang, Yuling
    FRONTIERS IN PHYSICS, 2023, 11
  • [5] An Overview of Text-Based Person Search: Recent Advances and Future Directions
    Niu K.
    Liu Y.
    Long Y.
    Huang Y.
    Wang L.
    Zhang Y.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (09) : 7803 - 7819
  • [6] Cross-modal alignment with synthetic caption for text-based person search
    Zhao, Weichen
    Lu, Yuxing
    Liu, Zhiyuan
    Yang, Yuan
    Jiao, Ge
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (02)
  • [7] Text-based person search via cross-modal alignment learning
    Ke, Xiao
    Liu, Hao
    Xu, Peirong
    Lin, Xinru
    Guo, Wenzhong
    PATTERN RECOGNITION, 2024, 152
  • [8] TIPCB: A simple but effective part-based convolutional baseline for text-based person search
    Chen, Yuhao
    Zhang, Guoqing
    Lu, Yujiang
    Wang, Zhenxing
    Zheng, Yuhui
    NEUROCOMPUTING, 2022, 494 : 171 - 181
  • [9] Text-based person search by non-saliency enhancing and dynamic label smoothing
    Pang Y.
    Zhang C.
    Li Z.
    Wei C.
    Wang Z.
    Neural Computing and Applications, 2024, 36 (21) : 13327 - 13339
  • [10] Fine-grained Semantics-aware Representation Learning for Text-based Person Retrieval
    Wang, Di
    Yan, Feng
    Wang, Yifeng
    Zhao, Lin
    Liang, Xiao
    Zhong, Haodi
    Zhang, Ronghua
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 92 - 100