Local-enhanced representation for text-based person search

被引:1
作者
Zhang, Guoqing [1 ,2 ]
Chen, Yuhao [1 ]
Zheng, Yuhui [1 ]
Martin, Gaven [3 ]
Wang, Ruili [2 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
[3] Massey Univ, Inst Adv Study, Auckland, New Zealand
[4] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modal retrieval; Local representation;
D O I
10.1016/j.patcog.2024.111247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a critical task in intelligent security, designed to locate a person of interest by text descriptions. The primary challenge in this task is to effectively bridge the significant gap between the text and image domains while simultaneously extracting the discriminative features that are crucial for the accurate identification of individuals. Existing methods have made some effective attempts by conducting cross-modal matching at the fine-grained representation level. However, these approaches frequently overlook two crucial factors: (i) the presence of noise in the local features during information fusion, and (ii) the lack of intra-modal matching when measuring feature similarity. To address the above issues, we propose a novel local- enhanced representation framework in this paper. Specifically, to restrain noises in local features, we design a Relation-based cross-modal local-enhanced fusion module, which can filter out weak related information by relation assessment. In addition, we explore an intra-cross modal projection strategy to overcome the limitations of existing cross-modal projection methods. This strategy jointly applies the intra-modal and cross- modal matching constrains in feature distribution. Finally, experiments on three mainstream datasets verify the performance superiority of our proposed method compared to existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Fine-grained semantic oriented embedding set alignment for text-based person search
    Zhao, Jiaqi
    Fu, Ao
    Zhou, Yong
    Du, Wen-liang
    Yao, Rui
    IMAGE AND VISION COMPUTING, 2024, 152
  • [12] MACA: Memory-aided Coarse-to-fine Alignment for Text-based Person Search
    Su, Liangxu
    Quan, Rong
    Qi, Zhiyuan
    Qin, Jie
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2497 - 2501
  • [13] SUM: Serialized Updating and Matching for text-based person retrieval
    Wang, Zijie
    Zhu, Aichun
    Xue, Jingyi
    Jiang, Daihong
    Liu, Chao
    Li, Yifeng
    Hu, Fangqiang
    KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [14] DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval
    Zhu, Aichun
    Wang, Zijie
    Li, Yifeng
    Wan, Xili
    Jin, Jing
    Wang, Tian
    Hu, Fangqiang
    Hua, Gang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 209 - 217
  • [15] Text-Based Person re-ID by Saliency Mask and Dynamic Label Smoothing
    Pang, Yonghua
    Zhang, Canlong
    Li, Zhixin
    Hu, Liaojie
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 : 443 - 454
  • [16] Parallel Data Augmentation for Text-based Person Re-identification
    Cai, Han-Qing
    Li, Xin
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [17] Modal Complementarity Based on Multimodal Large Language Model for Text-Based Person Retrieval
    Bao, Tong
    Xu, Tong
    Xu, Derong
    Zheng, Zhi
    WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 264 - 279
  • [18] MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION
    Xu, Wenhao
    Shao, Zhiyin
    Ding, Changxing
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1680 - 1684
  • [19] EESSO: Exploiting Extreme and Smooth Signals via Omni-frequency learning for Text-based Person Retrieval
    Xue, Jingyi
    Wang, Zijie
    Dong, Guan-Nan
    Zhu, Aichun
    IMAGE AND VISION COMPUTING, 2024, 142
  • [20] FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification
    Ma, Wentao
    Wu, Xinyi
    Zhao, Shan
    Zhou, Tongqing
    Guo, Dan
    Gu, Lichuan
    Cai, Zhiping
    Wang, Meng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5065 - 5077