Parameter-Efficient Person Re-Identification in the 3D Space

被引：18

作者：

Zheng, Zhedong ^{[1
]}

Wang, Xiaohan ^{[2
]}

Zheng, Nenggan ^{[2
]}

Yang, Yi ^{[2
]}

机构：

[1] Natl Univ Singapore, Sch Comp, Sea NExT Joint Lab, Singapore 118404, Singapore

[2] Zhejiang Univ, Sch Comp Sci, Hangzhou 310027, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 06期

关键词：

Three-dimensional displays; Solid modeling; Geometry; Semantics; Data models; Image color analysis; Training; 3D human representation; graph convolutional networks; image retrieval; person re-identification (re-id); point cloud; NEURAL-NETWORK; POSE; ATTRIBUTE;

D O I：

10.1109/TNNLS.2022.3214834

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

People live in a 3D world. However, existing works on person re-identification (re-id) mostly consider the semantic representation learning in a 2D space, intrinsically limiting the understanding of people. In this work, we address this limitation by exploring the prior knowledge of the 3D body structure. Specifically, we project 2D images to a 3D space and introduce a novel parameter-efficient omni-scale graph network (OG-Net) to learn the pedestrian representation directly from 3D point clouds. OG-Net effectively exploits the local information provided by sparse 3D points and takes advantage of the structure and appearance information in a coherent manner. With the help of 3D geometry information, we can learn a new type of deep re-id feature free from noisy variants, such as scale and viewpoint. To our knowledge, we are among the first attempts to conduct person re-id in the 3D space. We demonstrate through extensive experiments that the proposed method: (1) eases the matching difficulty in the traditional 2D space; 2) exploits the complementary information of 2D appearance and 3D structure; 3) achieves competitive results with limited parameters on four large-scale person re-id datasets; and 4) has good scalability to unseen datasets. Our code, models, and generated 3D human data are publicly available at https://github.com/layumi/person-reid-3d.

引用

页码：7534 / 7547

页数：14

共 124 条

[1] Point Convolutional Neural Networks by Extension Operators
Atzmon, Matan
Maron, Haggai
Lipman, Yaron
[J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
[2] Bai S, 2017, AAAI CONF ARTIF INTE, P1281
[3] Looking beyond appearances: Synthetic training data for deep CNNs in re identification
Barbosa, Igor Barros
Cristani, Marco
Caputo, Barbara
Rognhaugen, Aleksander
Theoharis, Theoharis
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 167 : 50 - 62
[4] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5] Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification
Deng, Weijian
Zheng, Liang
Ye, Qixiang
Kang, Guoliang
Yang, Yi
Jiao, Jianbin
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 994 - 1003
[6] Adaptive Exploration for Unsupervised Person Re-identification
Ding, Yuhang
Fan, Hehe
Xu, Mingliang
Yang, Yi
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
[7] Eom Chanho, 2019, ADV NEUR IN, V32
[8] Object Detection with Discriminatively Trained Part-Based Models
Felzenszwalb, Pedro F.
Girshick, Ross B.
McAllester, David
Ramanan, Deva
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
[9] Gao S, 2020, PROC CVPR IEEE, P11741, DOI 10.1109/CVPR42600.2020.01176
[10] Ge YX, 2018, ADV NEUR IN, V31

← 1 2 3 4 5 6 7 8 9 10 →