Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning

被引:38
|
作者
Shi, Yuxuan [1 ]
Wei, Zhen [2 ]
Ling, Hefei [1 ]
Wang, Ziyang [1 ]
Shen, Jialie [3 ]
Li, Ping [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland
[3] Queens Univ Belfast, Belfast BT7 1NN, Antrim, North Ireland
关键词
Cognition; Feature extraction; Hair; Semantics; Training; Robustness; Convolution; Person retrieval; person re-identification; human attribute; graph convolutional network; NEURAL-NETWORK; REIDENTIFICATION; IDENTIFICATION;
D O I
10.1109/TMM.2020.3042068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person retrieval largely relies on the appearance features of pedestrians. This task is rather more difficult in surveillance videos due to the limitations of extracting robust appearance features brought by the cross-view and cross-camera data with lower image resolution, motion blur, occlusion and other kinds of image degradation. To build up a more reliable person retrieval system, recent works introduced appearance attribute models to describe and distinguish different persons with high-level semantic concepts. Despite the progress of previous works, the value of utilizing appearance attributes is still under-explored. On one hand, existing methods lack for concise and precise attribute representations that are specific for each attribute category and, in the meantime, are able to filter noisy information in irrelevant spatial locations and useless patterns. On the other hand, correlation and reasoning between different attributes are neglected, which could generate more useful information and add more robustness to the retrieval system. In this paper, we propose an Attribute Mining and Reasoning (AMR) framework which is capable to handle the issues in question. The AMR makes better use of appearance attributes with two main components. First, the AMR disentangles the representations of different attributes by localizing their spatial positions and identifying their effective patterns in a weakly supervised manner. To achieve more reliable localization, we propose the Attribute Localization Ensemble (ALE) module that is consisted of multiple localization heads and a voting mechanism. Second, we introduce the Attribute Reasoning (AR) module to correlate different attributes together with the global appearance features and discover their latent relations to generate more comprehensive descriptions of pedestrians. Extensive experiments on DukeMTMC-ReID and Market-1501 datasets demonstrate the effectiveness of the proposed AMR framework as well as its superiority over the existing state-of-the-art methods. The AMR model also shows great generalization ability on the unseen CUHK03 dataset when it is only trained on Market-1501 dataset.
引用
收藏
页码:4376 / 4387
页数:12
相关论文
共 50 条
  • [31] Exploiting Unlabeled Videos for Video-Text Retrieval via Pseudo-Supervised Learning
    Lu, Yu
    Quan, Ruijie
    Zhu, Linchao
    Yang, Yi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6748 - 6760
  • [32] Microgroup Mining on TSina via Network Structure and User Attribute
    Xiong, Xiaobing
    Niu, Xiang
    Zhou, Gang
    Xu, Ke
    Huang, Yongzhong
    ADVANCED DATA MINING AND APPLICATIONS, PT II, 2011, 7121 : 138 - 151
  • [33] POSE GUIDED DEEP MODEL FOR PEDESTRIAN ATTRIBUTE RECOGNITION IN SURVEILLANCE SCENARIOS
    Li, Dangwei
    Chen, Xiaotang
    Zhang, Zhang
    Huang, Kaiqi
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [34] Suspicious Person Retrieval from UAV-sensors based on part level deep features
    Bouhlel, Fatma
    Mliki, Hazar
    Hammami, Mohamed
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 318 - 327
  • [35] Person Search via Deep Integrated Networks
    Chen, Ju-Chin
    Wu, Cheng-Feng
    Chen, Chun-Huei
    Lin, Cheng-Rong
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [36] Deep adversarial data augmentation with attribute guided for person re-identification
    Qiong Wu
    Pingyang Dai
    Peixian Chen
    Yuyu Huang
    Signal, Image and Video Processing, 2021, 15 : 655 - 662
  • [37] Occlusion-Sensitive Person Re-Identification via Attribute-Based Shift Attention
    Jin, Hanyang
    Lai, Shenqi
    Qian, Xueming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2170 - 2185
  • [38] Deep adversarial data augmentation with attribute guided for person re-identification
    Wu, Qiong
    Dai, Pingyang
    Chen, Peixian
    Huang, Yuyu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (04) : 655 - 662
  • [39] DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval
    Zhu, Aichun
    Wang, Zijie
    Li, Yifeng
    Wan, Xili
    Jin, Jing
    Wang, Tian
    Hu, Fangqiang
    Hua, Gang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 209 - 217
  • [40] Learning Deep Binary Descriptors via Bitwise Interaction Mining
    Wang, Ziwei
    Xiao, Han
    Duan, Yueqi
    Zhou, Jie
    Lu, Jiwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1919 - 1933