Parameter-Efficient Person Re-Identification in the 3D Space

被引:18
作者
Zheng, Zhedong [1 ]
Wang, Xiaohan [2 ]
Zheng, Nenggan [2 ]
Yang, Yi [2 ]
机构
[1] Natl Univ Singapore, Sch Comp, Sea NExT Joint Lab, Singapore 118404, Singapore
[2] Zhejiang Univ, Sch Comp Sci, Hangzhou 310027, Peoples R China
关键词
Three-dimensional displays; Solid modeling; Geometry; Semantics; Data models; Image color analysis; Training; 3D human representation; graph convolutional networks; image retrieval; person re-identification (re-id); point cloud; NEURAL-NETWORK; POSE; ATTRIBUTE;
D O I
10.1109/TNNLS.2022.3214834
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People live in a 3D world. However, existing works on person re-identification (re-id) mostly consider the semantic representation learning in a 2D space, intrinsically limiting the understanding of people. In this work, we address this limitation by exploring the prior knowledge of the 3D body structure. Specifically, we project 2D images to a 3D space and introduce a novel parameter-efficient omni-scale graph network (OG-Net) to learn the pedestrian representation directly from 3D point clouds. OG-Net effectively exploits the local information provided by sparse 3D points and takes advantage of the structure and appearance information in a coherent manner. With the help of 3D geometry information, we can learn a new type of deep re-id feature free from noisy variants, such as scale and viewpoint. To our knowledge, we are among the first attempts to conduct person re-id in the 3D space. We demonstrate through extensive experiments that the proposed method: (1) eases the matching difficulty in the traditional 2D space; 2) exploits the complementary information of 2D appearance and 3D structure; 3) achieves competitive results with limited parameters on four large-scale person re-id datasets; and 4) has good scalability to unseen datasets. Our code, models, and generated 3D human data are publicly available at https://github.com/layumi/person-reid-3d.
引用
收藏
页码:7534 / 7547
页数:14
相关论文
共 124 条
  • [1] Point Convolutional Neural Networks by Extension Operators
    Atzmon, Matan
    Maron, Haggai
    Lipman, Yaron
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
  • [2] Bai S, 2017, AAAI CONF ARTIF INTE, P1281
  • [3] Looking beyond appearances: Synthetic training data for deep CNNs in re identification
    Barbosa, Igor Barros
    Cristani, Marco
    Caputo, Barbara
    Rognhaugen, Aleksander
    Theoharis, Theoharis
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 167 : 50 - 62
  • [4] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [5] Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification
    Deng, Weijian
    Zheng, Liang
    Ye, Qixiang
    Kang, Guoliang
    Yang, Yi
    Jiao, Jianbin
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 994 - 1003
  • [6] Adaptive Exploration for Unsupervised Person Re-identification
    Ding, Yuhang
    Fan, Hehe
    Xu, Mingliang
    Yang, Yi
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
  • [7] Eom Chanho, 2019, ADV NEUR IN, V32
  • [8] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
  • [9] Gao S, 2020, PROC CVPR IEEE, P11741, DOI 10.1109/CVPR42600.2020.01176
  • [10] Ge YX, 2018, ADV NEUR IN, V31