Learning Multi-Attention Context Graph for Group-Based Re-Identification

被引:39
作者
Yan, Yichao [1 ]
Qin, Jie [1 ]
Ni, Bingbing [2 ]
Chen, Jiaxin [1 ]
Liu, Li [1 ]
Zhu, Fan [1 ]
Zheng, Wei-Shi [3 ,4 ]
Yang, Xiaokang [2 ,5 ]
Shao, Ling [1 ,6 ]
机构
[1] Incept Inst Artificial Intelligence, Abu Dhabi 112040, U Arab Emirates
[2] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[3] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518005, Peoples R China
[5] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[6] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
关键词
Task analysis; Deep learning; Measurement; Visualization; Context modeling; Layout; Group re-identification; person re-identification; context learning; graph neural networks; PERSON REIDENTIFICATION; NEURAL-NETWORK;
D O I
10.1109/TPAMI.2020.3032542
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning to re-identify or retrieve a group of people across non-overlapped camera systems has important applications in video surveillance. However, most existing methods focus on (single) person re-identification (re-id), ignoring the fact that people often walk in groups in real scenarios. In this work, we take a step further and consider employing context information for identifying groups of people, i.e., group re-id. On the one hand, group re-id is more challenging than single person re-id, since it requires both a robust modeling of local individual person appearance (with different illumination conditions, pose/viewpoint variations, and occlusions), as well as full awareness of global group structures (with group layout and group member variations). On the other hand, we believe that person re-id can be greatly enhanced by incorporating additional visual context from neighboring group members, a task which we formulate as group-aware (single) person re-id. In this paper, we propose a novel unified framework based on graph neural networks to simultaneously address the above two group-based re-id tasks, i.e., group re-id and group-aware person re-id. Specifically, we construct a context graph with group members as its nodes to exploit dependencies among different people. A multi-level attention mechanism is developed to formulate both intra-group and inter-group context, with an additional self-attention module for robust graph-level representations by attentively aggregating node-level features. The proposed model can be directly generalized to tackle group-aware person re-id using node-level representations. Meanwhile, to facilitate the deployment of deep learning models on these tasks, we build a new group re-id dataset which contains more than 3:8K images with 1:5K annotated groups, an order of magnitude larger than existing group re-id datasets. Extensive experiments on the novel dataset as well as three existing datasets clearly demonstrate the effectiveness of the proposed framework for both group-based re-id tasks.
引用
收藏
页码:7001 / 7018
页数:18
相关论文
共 107 条
[1]  
Abu-El-Haija S, 2018, Arxiv, DOI arXiv:1710.09599
[2]  
Ahmed E, 2015, PROC CVPR IEEE, P3908, DOI 10.1109/CVPR.2015.7299016
[3]   Human Re-identification in Crowd Videos Using Personal, Social and Environmental Constraints [J].
Assari, Shayan Modiri ;
Idrees, Haroon ;
Shah, Mubarak .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :119-136
[4]  
Ba J., 2015, PROC 3 INT C LEARN R
[5]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473]
[6]  
Bai S, 2020, Arxiv, DOI arXiv:1901.10650
[7]   Hypergraph convolution and hypergraph attention [J].
Bai, Song ;
Zhang, Feihu ;
Torr, Philip H. S. .
PATTERN RECOGNITION, 2021, 110
[8]   Re-ranking via Metric Fusion for Object Retrieval and Person Re-identification [J].
Bai, Song ;
Tang, Peng ;
Torr, Philip H. S. ;
Latecki, Longin Jan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :740-749
[9]   Scalable Person Re-identification on Supervised Smoothed Manifold [J].
Bai, Song ;
Bai, Xiang ;
Tian, Qi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3356-3365
[10]   Part-based spatio-temporal model for multi-person re-identification [J].
Bedagkar-Gala, A. ;
Shah, Shishir K. .
PATTERN RECOGNITION LETTERS, 2012, 33 (14) :1908-1915