AAGCN: Adjacency-aware Graph Convolutional Network for person re-identification

被引：27

作者：

Pan, Honghu ^{[1
]}

Bai, Yang ^{[1
]}

He, Zhenyu ^{[1
,2
]}

Zhang, Chunkai ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2022年 / 236卷

关键词：

Person re-identification; Graph Convolutional Network; Mahalanobis distance; NEURAL-NETWORK; PERFORMANCE;

D O I：

10.1016/j.knosys.2021.107300

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Person re-identification (ReID) is an important topic of computer vision. Existing works in this field focus primarily on learning a feature extractor that maps the pedestrian images into a feature space, in which feature vectors corresponding to the same identity are close to each other. In this paper, we propose the adjacency-aware Graph Convolutional Network (AAGCN) to smooth the intra-class features and thus reduce the intra-class variance. Specifically, our AAGCN takes the features learned by a backbone as the input nodes; it first establishes the connections or adjacency relations for the intra-class features, then the adjacent nodes (i.e., the intra-class features) would be smoothed thanks to the property of low-pass filtering of Graph Convolutional Network (GCN). In this paper, we propose two methods, i.e., the Mahalanobis Neighborhood Adjacency (MNA) and Non-Linear Mapping (NLM), to learn the adjacency relations for the intra-class features. The MNA defines the adjacency weight between two nodes as the negative exponent of the Mahalanobis distance between their corresponding features, therefore it aims to learn a small Mahalanobis distance between the intra-class features and a large Mahalanobis distance between the inter-class ones. The NLM enables the non-linear mapping from the features of the nodes to their corresponding adjacency weights. The experimental results on both visible ReID and visual-infrared ReID verify the effectiveness of our method, for instance, our model achieves 95.7% rank-1 and 93.1% mAP on Market1501, as well as 58.6% rank-1 and 60.0% mAP on SYSU. (c) 2021 Published by Elsevier B.V.

引用

页数：11

共 72 条

[11]

Goldberger J., 2005, Advances in Neural Information Processing Systems (NeurIPS)

[12] Graph embedding techniques, applications, and performance: A survey [J].

Goyal, Palash ;

Ferrara, Emilio .

KNOWLEDGE-BASED SYSTEMS, 2018, 151 :78-94

[13]

Hamilton WL, 2017, ADV NEUR IN, V30

[14]

Hao Y, 2019, AAAI CONF ARTIF INTE, P8385

[15] Smart Mining for Deep Metric Learning [J].

Harwood, Ben ;

Kumar, Vijay B. G. ;

Carneiro, Gustavo ;

Reid, Ian ;

Drummond, Tom .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2840-2848

[16] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[17]

Hermans A., 2017, In defense of the triplet loss for person re-identification

[18]

King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001

[19]

Kipf T, 2016, ARXIV

[20] Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning [J].

Ko, Byungsoo ;

Gu, Geonmo .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :7253-7262

← 1 2 3 4 5 6 7 8 →