Local-enhanced representation for text-based person search

被引:3
作者
Zhang, Guoqing [1 ,2 ]
Chen, Yuhao [1 ]
Zheng, Yuhui [1 ]
Martin, Gaven [3 ]
Wang, Ruili [2 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
[3] Massey Univ, Inst Adv Study, Auckland, New Zealand
[4] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Cross-modal retrieval; Local representation;
D O I
10.1016/j.patcog.2024.111247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based person search is a critical task in intelligent security, designed to locate a person of interest by text descriptions. The primary challenge in this task is to effectively bridge the significant gap between the text and image domains while simultaneously extracting the discriminative features that are crucial for the accurate identification of individuals. Existing methods have made some effective attempts by conducting cross-modal matching at the fine-grained representation level. However, these approaches frequently overlook two crucial factors: (i) the presence of noise in the local features during information fusion, and (ii) the lack of intra-modal matching when measuring feature similarity. To address the above issues, we propose a novel local- enhanced representation framework in this paper. Specifically, to restrain noises in local features, we design a Relation-based cross-modal local-enhanced fusion module, which can filter out weak related information by relation assessment. In addition, we explore an intra-cross modal projection strategy to overcome the limitations of existing cross-modal projection methods. This strategy jointly applies the intra-modal and cross- modal matching constrains in feature distribution. Finally, experiments on three mainstream datasets verify the performance superiority of our proposed method compared to existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
[11]   MACA: Memory-aided Coarse-to-fine Alignment for Text-based Person Search [J].
Su, Liangxu ;
Quan, Rong ;
Qi, Zhiyuan ;
Qin, Jie .
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, :2497-2501
[12]   Fine-grained semantic oriented embedding set alignment for text-based person search [J].
Zhao, Jiaqi ;
Fu, Ao ;
Zhou, Yong ;
Du, Wen-liang ;
Yao, Rui .
IMAGE AND VISION COMPUTING, 2024, 152
[13]   SUM: Serialized Updating and Matching for text-based person retrieval [J].
Wang, Zijie ;
Zhu, Aichun ;
Xue, Jingyi ;
Jiang, Daihong ;
Liu, Chao ;
Li, Yifeng ;
Hu, Fangqiang .
KNOWLEDGE-BASED SYSTEMS, 2022, 248
[14]   DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval [J].
Zhu, Aichun ;
Wang, Zijie ;
Li, Yifeng ;
Wan, Xili ;
Jin, Jing ;
Wang, Tian ;
Hu, Fangqiang ;
Hua, Gang .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :209-217
[15]   Text-Based Person re-ID by Saliency Mask and Dynamic Label Smoothing [J].
Pang, Yonghua ;
Zhang, Canlong ;
Li, Zhixin ;
Hu, Liaojie .
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 :443-454
[16]   Parallel Data Augmentation for Text-based Person Re-identification [J].
Cai, Han-Qing ;
Li, Xin ;
Ji, Yi ;
Li, Ying ;
Liu, Chun-Ping .
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[17]   Modal Complementarity Based on Multimodal Large Language Model for Text-Based Person Retrieval [J].
Bao, Tong ;
Xu, Tong ;
Xu, Derong ;
Zheng, Zhi .
WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 :264-279
[18]   MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION [J].
Xu, Wenhao ;
Shao, Zhiyin ;
Ding, Changxing .
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, :1680-1684
[19]   EESSO: Exploiting Extreme and Smooth Signals via Omni-frequency learning for Text-based Person Retrieval [J].
Xue, Jingyi ;
Wang, Zijie ;
Dong, Guan-Nan ;
Zhu, Aichun .
IMAGE AND VISION COMPUTING, 2024, 142
[20]   FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification [J].
Ma, Wentao ;
Wu, Xinyi ;
Zhao, Shan ;
Zhou, Tongqing ;
Guo, Dan ;
Gu, Lichuan ;
Cai, Zhiping ;
Wang, Meng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :5065-5077