Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning

被引:38
|
作者
Shi, Yuxuan [1 ]
Wei, Zhen [2 ]
Ling, Hefei [1 ]
Wang, Ziyang [1 ]
Shen, Jialie [3 ]
Li, Ping [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland
[3] Queens Univ Belfast, Belfast BT7 1NN, Antrim, North Ireland
关键词
Cognition; Feature extraction; Hair; Semantics; Training; Robustness; Convolution; Person retrieval; person re-identification; human attribute; graph convolutional network; NEURAL-NETWORK; REIDENTIFICATION; IDENTIFICATION;
D O I
10.1109/TMM.2020.3042068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person retrieval largely relies on the appearance features of pedestrians. This task is rather more difficult in surveillance videos due to the limitations of extracting robust appearance features brought by the cross-view and cross-camera data with lower image resolution, motion blur, occlusion and other kinds of image degradation. To build up a more reliable person retrieval system, recent works introduced appearance attribute models to describe and distinguish different persons with high-level semantic concepts. Despite the progress of previous works, the value of utilizing appearance attributes is still under-explored. On one hand, existing methods lack for concise and precise attribute representations that are specific for each attribute category and, in the meantime, are able to filter noisy information in irrelevant spatial locations and useless patterns. On the other hand, correlation and reasoning between different attributes are neglected, which could generate more useful information and add more robustness to the retrieval system. In this paper, we propose an Attribute Mining and Reasoning (AMR) framework which is capable to handle the issues in question. The AMR makes better use of appearance attributes with two main components. First, the AMR disentangles the representations of different attributes by localizing their spatial positions and identifying their effective patterns in a weakly supervised manner. To achieve more reliable localization, we propose the Attribute Localization Ensemble (ALE) module that is consisted of multiple localization heads and a voting mechanism. Second, we introduce the Attribute Reasoning (AR) module to correlate different attributes together with the global appearance features and discover their latent relations to generate more comprehensive descriptions of pedestrians. Extensive experiments on DukeMTMC-ReID and Market-1501 datasets demonstrate the effectiveness of the proposed AMR framework as well as its superiority over the existing state-of-the-art methods. The AMR model also shows great generalization ability on the unseen CUHK03 dataset when it is only trained on Market-1501 dataset.
引用
收藏
页码:4376 / 4387
页数:12
相关论文
共 50 条
  • [21] Person re-identification combining deep features and attribute detection
    Watson, Gregory
    Bhalerao, Abhir
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 6463 - 6481
  • [22] Specific Person Retrieval via Incomplete Text Description
    Ye, Mang
    Liang, Chao
    Wang, Zheng
    Leng, Qingming
    Chen, Jun
    Liu, Jun
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 547 - 550
  • [23] Person re-identification combining deep features and attribute detection
    Gregory Watson
    Abhir Bhalerao
    Multimedia Tools and Applications, 2020, 79 : 6463 - 6481
  • [24] An Efficient Person Search Method Using Spatio-Temporal Features for Surveillance Videos
    Feng, Deying
    Yang, Jie
    Wei, Yanxia
    Xiao, Hairong
    Zhang, Laigang
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [25] Person Re-Identification Research via Deep Learning
    Lu Jian
    Chen Xu
    Luo Maoxin
    Wang Hangying
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (16)
  • [26] BCRA: bidirectional cross-modal implicit relation reasoning and aligning for text-to-image person retrieval
    Li, Zhaoqi
    Xie, Yongping
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [27] A Text-Based Dual-Branch Person Re-Identification Algorithm Based on the Deep Attribute Information Mining Network
    Han, Ke
    Zhang, Xiyan
    Xu, Wenlong
    Jin, Long
    SYMMETRY-BASEL, 2025, 17 (01):
  • [28] A MASK BASED DEEP RANKING NEURAL NETWORK FOR PERSON RETRIEVAL
    Qi, Lei
    Huo, Jing
    Wang, Lei
    Shi, Yinghuan
    Gao, Yang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 496 - 501
  • [29] Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification
    Zhao, Cairong
    Lv, Xinbi
    Zhang, Zhang
    Zuo, Wangmeng
    Wu, Jun
    Miao, Duoqian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (12) : 3180 - 3195
  • [30] Violence Detection From Industrial Surveillance Videos Using Deep Learning
    Khan, Hamza
    Yuan, Xiaohong
    Qingge, Letu
    Roy, Kaushik
    IEEE ACCESS, 2025, 13 : 15363 - 15375