Learning Multi-Attention Context Graph for Group-Based Re-Identification

被引：39

作者：

Yan, Yichao ^{[1
]}

Qin, Jie ^{[1
]}

Ni, Bingbing ^{[2
]}

Chen, Jiaxin ^{[1
]}

Liu, Li ^{[1
]}

Zhu, Fan ^{[1
]}

Zheng, Wei-Shi ^{[3
,4
]}

Yang, Xiaokang ^{[2
,5
]}

Shao, Ling ^{[1
,6
]}

机构：

[1] Incept Inst Artificial Intelligence, Abu Dhabi 112040, U Arab Emirates

[2] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China

[3] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Peoples R China

[4] Peng Cheng Lab, Shenzhen 518005, Peoples R China

[5] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

[6] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 06期

关键词：

Task analysis; Deep learning; Measurement; Visualization; Context modeling; Layout; Group re-identification; person re-identification; context learning; graph neural networks; PERSON REIDENTIFICATION; NEURAL-NETWORK;

D O I：

10.1109/TPAMI.2020.3032542

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning to re-identify or retrieve a group of people across non-overlapped camera systems has important applications in video surveillance. However, most existing methods focus on (single) person re-identification (re-id), ignoring the fact that people often walk in groups in real scenarios. In this work, we take a step further and consider employing context information for identifying groups of people, i.e., group re-id. On the one hand, group re-id is more challenging than single person re-id, since it requires both a robust modeling of local individual person appearance (with different illumination conditions, pose/viewpoint variations, and occlusions), as well as full awareness of global group structures (with group layout and group member variations). On the other hand, we believe that person re-id can be greatly enhanced by incorporating additional visual context from neighboring group members, a task which we formulate as group-aware (single) person re-id. In this paper, we propose a novel unified framework based on graph neural networks to simultaneously address the above two group-based re-id tasks, i.e., group re-id and group-aware person re-id. Specifically, we construct a context graph with group members as its nodes to exploit dependencies among different people. A multi-level attention mechanism is developed to formulate both intra-group and inter-group context, with an additional self-attention module for robust graph-level representations by attentively aggregating node-level features. The proposed model can be directly generalized to tackle group-aware person re-id using node-level representations. Meanwhile, to facilitate the deployment of deep learning models on these tasks, we build a new group re-id dataset which contains more than 3:8K images with 1:5K annotated groups, an order of magnitude larger than existing group re-id datasets. Extensive experiments on the novel dataset as well as three existing datasets clearly demonstrate the effectiveness of the proposed framework for both group-based re-id tasks.

引用

页码：7001 / 7018

页数：18

共 107 条

[1]

Abu-El-Haija S, 2018, Arxiv, DOI arXiv:1710.09599

[2]

Ahmed E, 2015, PROC CVPR IEEE, P3908, DOI 10.1109/CVPR.2015.7299016

[3] Human Re-identification in Crowd Videos Using Personal, Social and Environmental Constraints [J].

Assari, Shayan Modiri ;

Idrees, Haroon ;

Shah, Mubarak .

COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :119-136

[4]

Ba J., 2015, PROC 3 INT C LEARN R

[5]

Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473]

[6]

Bai S, 2020, Arxiv, DOI arXiv:1901.10650

[7] Hypergraph convolution and hypergraph attention [J].

Bai, Song ;

Zhang, Feihu ;

Torr, Philip H. S. .

PATTERN RECOGNITION, 2021, 110

[8] Re-ranking via Metric Fusion for Object Retrieval and Person Re-identification [J].

Bai, Song ;

Tang, Peng ;

Torr, Philip H. S. ;

Latecki, Longin Jan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :740-749

[9] Scalable Person Re-identification on Supervised Smoothed Manifold [J].

Bai, Song ;

Bai, Xiang ;

Tian, Qi .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3356-3365

[10] Part-based spatio-temporal model for multi-person re-identification [J].

Bedagkar-Gala, A. ;

Shah, Shishir K. .

PATTERN RECOGNITION LETTERS, 2012, 33 (14) :1908-1915

← 1 2 3 4 5 6 7 8 9 10 →