An Overview of Text-Based Person Search: Recent Advances and Future Directions

被引:1
作者
Niu, Kai [1 ,2 ]
Liu, Yanyi [1 ]
Long, Yuzhou [1 ]
Huang, Yan [3 ]
Wang, Liang [3 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated AeroSpace Ground Ocean B, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ Shenzhen, Inst Res & Dev, Shenzhen 518063, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Text-based person search; cross-modal retrieval; video surveillance; feature extraction; semantic alignments; NEURAL-NETWORK; ATTENTION NETWORK; IMAGE; TRANSFORMER;
D O I
10.1109/TCSVT.2024.3376373
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the practical significance in smart video surveillance systems, Text-Based Person Search (TBPS) has been one of the research hotspots recently, which refers to searching for the interested pedestrian images given natural language sentences. To help researchers quickly grasp the developments of this important task, we comprehensively summarize the recent research advances of TBPS from two perspectives, i.e., Feature Extraction (FE) and Semantic Alignments (SA). Specifically, the FE mainly consists of pre-processing approaches and end-to-end frameworks, and the SA could be briefly divided into cross-modal attention mechanism, non-attention alignments, training objectives, and generative approaches. Afterwards, we elaborate four widely-used benchmarks and also the evaluation criterion for TBPS. And comparisons and analyses among the state-of-the-art (SOTA) solutions are provided based on these large-scale benchmarks. At last, we point out some future research directions that need to be further addressed, which will greatly facilitate the practical applications of TBPS.
引用
收藏
页码:7803 / 7819
页数:17
相关论文
共 50 条
[41]   From attributes to natural language: A survey and foresight on text-based person re-identification [J].
Jiang, Fanzhi ;
Yang, Su ;
Jones, Mark W. ;
Zhang, Liumei .
INFORMATION FUSION, 2025, 118
[42]   A review of consumer affinity research: recent advances and future directions [J].
Mar Serrano-Arcos, M. ;
Sanchez-Fernandez, Raquel ;
Carlos Perez-Mesa, Juan ;
Riefler, Petra .
INTERNATIONAL MARKETING REVIEW, 2022, 39 (05) :1252-1282
[43]   Diverse Co-Saliency Feature Learning for Text-Based Person Retrieval [J].
You, Shuai ;
Chen, Cuiqun ;
Feng, Yujian ;
Liu, Hai ;
Ji, Yimu ;
Ye, Mang .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 :5465-5477
[44]   Generative models for protein sequence modeling: recent advances and future directions [J].
Mardikoraem, Mehrsa ;
Wang, Zirui ;
Pascual, Nathaniel ;
Woldring, Daniel .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
[45]   FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification [J].
Ma, Wentao ;
Wu, Xinyi ;
Zhao, Shan ;
Zhou, Tongqing ;
Guo, Dan ;
Gu, Lichuan ;
Cai, Zhiping ;
Wang, Meng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :5065-5077
[46]   Decentralized Text-Based Person Re-Identification in Multi-Camera Networks [J].
Agyeman, Rockson ;
Rinner, Bernhard .
IEEE ACCESS, 2024, 12 :172125-172148
[47]   Fine-grained Semantics-aware Representation Learning for Text-based Person Retrieval [J].
Wang, Di ;
Yan, Feng ;
Wang, Yifeng ;
Zhao, Lin ;
Liang, Xiao ;
Zhong, Haodi ;
Zhang, Ronghua .
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, :92-100
[48]   Deep Learning for Forecasting-Based Applications in Cyber-Physical Microgrids: Recent Advances and Future Directions [J].
Habibi, Mohammad Reza ;
Golestan, Saeed ;
Guerrero, Josep M. M. ;
Vasquez, Juan C. C. .
ELECTRONICS, 2023, 12 (07)
[49]   Hybrid POF-VLC Systems: Recent Advances, Challenges, Opportunities, and Future Directions [J].
Abdallah, Rola ;
Atef, Mohamed ;
Saeed, Nasir .
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2025, 6 :317-335
[50]   Artificial intelligence in the prediction of protein-ligand interactions: recent advances and future directions [J].
Dhakal, Ashwin ;
McKay, Cole ;
Tanner, John J. ;
Cheng, Jianlin .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)