Learning Scene-Pedestrian Graph for End-to-End Person Search

被引:4
作者
Song, Zifan [1 ]
Zhao, Cairong [1 ]
Hu, Guosheng [2 ]
Miao, Duoqian [1 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 200092, Peoples R China
[2] Oosto, Belfast BT3 9DT, North Ireland
关键词
Pedestrians; Feature extraction; Head; Task analysis; Informatics; Image edge detection; Cameras; Deep learning; graph neural networks; identification of persons; machine vision;
D O I
10.1109/TII.2023.3298473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person search aims to find specific persons from visual scenes, including two subtasks, pedestrian detection, and person reidentification. The dominant fashion in this area is end-to-end networks that focus on analyzing the foreground (i.e., pedestrian) while ignoring the background (i.e., scene) information. However, the scene information often offers useful clues for person search. For example, pedestrians normally appear on the road rather than the top of a tree, and pedestrians appearing at the same location are likely to have similar occlusions. The interplay between the pedestrians and scenes can potentially improve the performance. In this article, a novel scene-pedestrian graph (SPG) is proposed, which can explicitly model the interplay between the pedestrians and scenes. To polish the quality of pedestrian bounding boxes, we pioneer a strategy of using the high-quality pedestrian bounding box to guide the low-quality one in the same scene. In addition, we design a contextual and temporal graph matching algorithm to effectively utilize the contextual and temporal information present in the constructed SPG to improve the performance of pedestrian matching. Benefiting from the robustness on complex scenes, our model achieves promising performance over the state-of-the-art methods on two popular person search benchmarks, CUHK-SYSU and PRW.
引用
收藏
页码:2979 / 2990
页数:12
相关论文
共 42 条
[1]  
Bruna J., 2014, INT C LEARN REPR ICL
[2]   RCAA: Relational Context-Aware Agents for Person Search [J].
Chang, Xiaojun ;
Huang, Po-Yao ;
Shen, Yi-Dong ;
Liang, Xiaodan ;
Yang, Yi ;
Hauptmann, Alexander G. .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :86-102
[3]   Cross-Modal Retrieval with Heterogeneous Graph Embedding [J].
Chen, Dapeng ;
Wang, Min ;
Chen, Haobin ;
Wu, Lin ;
Qin, Jing ;
Peng, Wei .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :3291-3300
[4]  
Chen D, 2020, AAAI CONF ARTIF INTE, V34, P10518
[5]   Person Search via a Mask-Guided Two-Stream CNN Model [J].
Chen, Di ;
Zhang, Shanshan ;
Ouyang, Wanli ;
Yang, Jian ;
Tai, Ying .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :764-781
[6]   Norm-Aware Embedding for Efficient Person Search [J].
Chen, Di ;
Zhang, Shanshan ;
Yang, Jian ;
Schiele, Bernt .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12612-12621
[7]  
Chung F.R.K., 1997, Spectral graph theory, DOI DOI 10.1090/CBMS/092
[8]   Bi-directional Interaction Network for Person Search [J].
Dong, Wenkai ;
Zhang, Zhaoxiang ;
Song, Chunfeng ;
Tan, Tieniu .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2836-2845
[9]   Instance Guided Proposal Network for Person Search [J].
Dong, Wenkai ;
Zhang, Zhaoxiang ;
Song, Chunfeng ;
Tan, Tieniu .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2582-2591
[10]   Structure-aware person search with self-attention and online instance aggregation matching [J].
Gao, Cunyuan ;
Yao, Rui ;
Zhao, Jiaqi ;
Zhou, Yong ;
Hu, Fuyuan ;
Li, Leida .
NEUROCOMPUTING, 2019, 369 :29-38