Joint graph regularized dictionary learning and sparse ranking for multi-modal multi-shot person re-identification

被引：6

作者：

Zheng, Aihua ^{[1
]}

Li, Hongchao ^{[1
]}

Jiang, Bo ^{[1
,2
]}

Zheng, Wei-Shi ^{[3
]}

Luo, Bin ^{[1
]}

机构：

[1] Anhui Univ, Sch Comp Sci & Technol, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei, Peoples R China

[2] Anhui Univ, Inst Phys Sci & Informat Technol, Hefei, Peoples R China

[3] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China

来源：

PATTERN RECOGNITION | 2020年 / 104卷 / 104期

基金：

中国国家自然科学基金;

关键词：

Person re-identification; Sparse ranking; Joint graph regularization; RECOGNITION;

D O I：

10.1016/j.patcog.2020.107352

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The promising achievement of sparse ranking in image-based recognition gives rise to a number of development on person re-identification (Re-ID) which aims to reconstruct the probe as a linear combination of few atoms/images from an over-complete dictionary/gallery. However, most of the existing sparse ranking based Re-ID methods lack considering the geometric relationships between probe, gallery, and cross-modal images of the same person in multi-shot Re-ID. In this paper, we propose a novel joint graph regularized dictionary learning and sparse ranking method for multi-modal multi-shot person Re-ID. First, we explore the probe-based geometrical structure by enforcing the smoothness between the codings/coefficients, which refers to the multi-shot images from the same person in probe. Second, we explore the gallery-based geometrical structure among gallery images, which encourages the multi-shot images from the same person in the gallery making similar contributions while reconstructing a certain probe image. Third, we explore the cross-modal geometrical structure by enforcing the smoothness between the cross-modal images and thus extend our model for the multi-modal case. Finally, we design an APG based optimization to solve the problem. Comprehensive experiments on benchmark datasets demonstrate the superior performance of the proposed model. The code is available at https://github.com/ttaalle/Lhc. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：13

共 59 条

[1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].

Aharon, Michal ;

Elad, Michael ;

Bruckstein, Alfred .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322

[2]

[Anonymous], 2017, CVPR

[3]

Bak Slawomir, 2010, Proceedings 7th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2010), P435, DOI 10.1109/AVSS.2010.34

[4]

Bak S., 2011, Proceedings of the 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2011), P179, DOI 10.1109/AVSS.2011.6027316

[5]

Bak S, 2012, LECT NOTES COMPUT SC, V7574, P806, DOI 10.1007/978-3-642-33712-3_58

[6]

Baltieri D., 2011, P 2011 JOINT ACM WOR, P59, DOI DOI 10.1145/2072572.2072590

[7]

Barbosa IB, 2012, LECT NOTES COMPUT SC, V7583, P433, DOI 10.1007/978-3-642-33863-2_43

[8]

Bazzani Loris, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P1413, DOI 10.1109/ICPR.2010.349

[9] Multiple-shot person re-identification by chromatic and epitomic analyses [J].

Bazzani, Loris ;

Cristani, Marco ;

Perina, Alessandro ;

Murino, Vittorio .

PATTERN RECOGNITION LETTERS, 2012, 33 (07) :898-903

[10] An Asymmetric Distance Model for Cross-View Feature Mapping in Person Reidentification [J].

Chen, Ying-Cong ;

Zheng, Wei-Shi ;

Lai, Jian-Huang ;

Yuen, Pong C. .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (08) :1661-1675

← 1 2 3 4 5 6 →