A review on video person re-identification based on deep learning

被引:3
作者
Ma, Haifei [1 ,3 ]
Zhang, Canlong [1 ,2 ]
Zhang, Yifeng [1 ]
Li, Zhixin [1 ,2 ]
Wang, Zhiwen [4 ]
Wei, Chunrong [1 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Peoples R China
[3] Guangdong Univ Sci & Technol, Dongguan, Peoples R China
[4] Guangxi Univ Sci & Technol, Sch Elect Engn, Liuzhou, Peoples R China
基金
美国国家科学基金会;
关键词
Video-based person ReID; Temporal learning; Literature survey and perspectives; Attention mechanism; Convolutional neural network; UNSUPERVISED DOMAIN ADAPTATION; NEURAL-NETWORK;
D O I
10.1016/j.neucom.2024.128479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person Re-Identification (ReID) is an essential technology for matching a person across non-overlapping cameras. It has attracted increasing attention in recent years due to its wide range of applications in various real-world scenarios such as security surveillance and criminal investigation. Different from other person ReID tasks, video-based ReID uses a video clip as the retrieval input, which can provide more promising ReID performance because that the video has rich information on appearance, motion cues and pose variations on temporal pipeline. Over the last few years, many deep learning-based video person ReID have been proposed to address various challenges, such as illumination variation, complex background, occlusion, etc. To provide a more comprehensive and readable review on existing video-based person ReID methods, we propose a novel taxonomy method that observes existing methods from four perspectives: data, algorithms, computing power, and applications. Specifically, we first introduce some popular datasets and evaluation criterion used for video-based person ReID. Next, from limited data and little annotation view, we introduce data augmentation and unsupervised learning ReID. From algorithm view, we focus on reviewing supervised methods including spatial feature learning, temporal feature learning and spatio-temporal feature learning, and further discuss and conduct a systematic comparison among these approaches. From complex open-world application view, we mainly summarized domain adaption and multimodal ReID. From insufficient GPU computing power view, we mainly discuss modality-agnostic unified large-scale ReID and their lightweighting. Finally, we provide a discussion of open problems and potential research directions for the community.
引用
收藏
页数:17
相关论文
共 162 条
  • [1] Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
    Aich, Abhishek
    Zheng, Meng
    Karanam, Srikrishna
    Chen, Terrence
    Roy-Chowdhury, Amit K.
    Wu, Ziyan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 152 - 162
  • [2] Alsehaim A., 2022, VID-trans-reid: Enhanced video transformers for person re-identification
  • [3] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [4] Salient-to-Broad Transition for Video Person Re-identification
    Bai, Shutao
    Ma, Bingpeng
    Chang, Hong
    Huang, Rui
    Chen, Xilin
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7329 - 7338
  • [5] Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification
    Bai, Yan
    Wang, Ce
    Lou, Yihang
    Liu, Jun
    Duan, Ling-Yu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6715 - 6729
  • [6] The relation between the ROC curve and the CMC
    Bolle, RM
    Connell, JH
    Pankanti, S
    Ratha, NK
    Senior, AW
    [J]. FOURTH IEEE WORKSHOP ON AUTOMATIC IDENTIFICATION ADVANCED TECHNOLOGIES, PROCEEDINGS, 2005, : 15 - 20
  • [7] Video Person Re-Identification Using Attribute-Enhanced Features
    Chai, Tianrui
    Chen, Zhiyuan
    Li, Annan
    Chen, Jiaxin
    Mei, Xinyu
    Wang, Yunhong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7951 - 7966
  • [8] Towards Modality-Agnostic Person Re-identification with Descriptive Query
    Chen, Cuiqun
    Ye, Mang
    Jiang, Ding
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15128 - 15137
  • [9] Saliency and Granularity: Discovering Temporal Coherence for Video-Based Person Re-Identification
    Chen, Cuiqun
    Ye, Mang
    Qi, Meibin
    Wu, Jingjing
    Liu, Yimin
    Jiang, Jianguo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6100 - 6112
  • [10] Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification
    Chen, Cuiqun
    Ye, Mang
    Qi, Meibin
    Wu, Jingjing
    Jiang, Jianguo
    Lin, Chia-Wen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2352 - 2364