AAGCN: Adjacency-aware Graph Convolutional Network for person re-identification

被引:27
作者
Pan, Honghu [1 ]
Bai, Yang [1 ]
He, Zhenyu [1 ,2 ]
Zhang, Chunkai [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
关键词
Person re-identification; Graph Convolutional Network; Mahalanobis distance; NEURAL-NETWORK; PERFORMANCE;
D O I
10.1016/j.knosys.2021.107300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification (ReID) is an important topic of computer vision. Existing works in this field focus primarily on learning a feature extractor that maps the pedestrian images into a feature space, in which feature vectors corresponding to the same identity are close to each other. In this paper, we propose the adjacency-aware Graph Convolutional Network (AAGCN) to smooth the intra-class features and thus reduce the intra-class variance. Specifically, our AAGCN takes the features learned by a backbone as the input nodes; it first establishes the connections or adjacency relations for the intra-class features, then the adjacent nodes (i.e., the intra-class features) would be smoothed thanks to the property of low-pass filtering of Graph Convolutional Network (GCN). In this paper, we propose two methods, i.e., the Mahalanobis Neighborhood Adjacency (MNA) and Non-Linear Mapping (NLM), to learn the adjacency relations for the intra-class features. The MNA defines the adjacency weight between two nodes as the negative exponent of the Mahalanobis distance between their corresponding features, therefore it aims to learn a small Mahalanobis distance between the intra-class features and a large Mahalanobis distance between the inter-class ones. The NLM enables the non-linear mapping from the features of the nodes to their corresponding adjacency weights. The experimental results on both visible ReID and visual-infrared ReID verify the effectiveness of our method, for instance, our model achieves 95.7% rank-1 and 93.1% mAP on Market1501, as well as 58.6% rank-1 and 60.0% mAP on SYSU. (c) 2021 Published by Elsevier B.V.
引用
收藏
页数:11
相关论文
共 72 条
[11]  
Goldberger J., 2005, Advances in Neural Information Processing Systems (NeurIPS)
[12]   Graph embedding techniques, applications, and performance: A survey [J].
Goyal, Palash ;
Ferrara, Emilio .
KNOWLEDGE-BASED SYSTEMS, 2018, 151 :78-94
[13]  
Hamilton WL, 2017, ADV NEUR IN, V30
[14]  
Hao Y, 2019, AAAI CONF ARTIF INTE, P8385
[15]   Smart Mining for Deep Metric Learning [J].
Harwood, Ben ;
Kumar, Vijay B. G. ;
Carneiro, Gustavo ;
Reid, Ian ;
Drummond, Tom .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2840-2848
[16]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[17]  
Hermans A., 2017, In defense of the triplet loss for person re-identification
[18]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
[19]  
Kipf T, 2016, ARXIV
[20]   Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning [J].
Ko, Byungsoo ;
Gu, Geonmo .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :7253-7262