Hierarchical Reasoning Network for Pedestrian Attribute Recognition

被引:13
作者
An, Haoran [1 ]
Hu, Hai-Miao [1 ]
Guo, Yuanfang [1 ]
Zhou, Qianli [2 ]
Li, Bo [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Peoples Publ Secur Univ China, Beijing 100038, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Cognition; Semantics; Task analysis; Machine learning; Correlation; Image color analysis; Pedestrian attribute recognition; video surveillance; abstraction levels; hierarchical; reason; CLASSIFICATION; RETRIEVAL; ALIGNMENT;
D O I
10.1109/TMM.2020.2975417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian attribute recognition, which can benefit other tasks such as person re-identification and pedestrian retrieval, is very important in video surveillance related tasks. In this paper, we observe that the existing methods tackle this problem from the perspective of multi-label classification without considering the hierarchical relationships among the attributes. In human cognition, the attributes can be categorized according to their semantic/abstraction levels. The high-level attributes can be predicted by reasoning from the low-level and medium-level attributes, while the recognition of the low-level and medium-level attributes can be guided by the high-level attributes. Based on this attribute categorization, we propose a novel Hierarchical Reasoning Network (HR-Net), which can hierarchically predict the attributes at different abstraction levels in different stages of the network. We also propose an attribute reasoning structure to exploit the relationships among the attributes at different semantic levels. Experimental results demonstrate that the proposed network gives superior performances compared to the state-of-the-art techniques.
引用
收藏
页码:268 / 280
页数:13
相关论文
共 45 条
  • [1] Multi-Task CNN Model for Attribute Prediction
    Abdulnabi, Abrar H.
    Wang, Gang
    Lu, Jiwen
    Jia, Kui
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1949 - 1959
  • [2] [Anonymous], P EUR C COMP VIS ECC
  • [3] [Anonymous], 2010, PERSON RE IDENTIFICA
  • [4] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.90
  • [5] Group-Sensitive Triplet Embedding for Vehicle Reidentification
    Bai, Yan
    Lou, Yihang
    Gao, Feng
    Wang, Shiqi
    Wu, Yuwei
    Duan, Ling-Yu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) : 2385 - 2399
  • [6] Bourdev L, 2011, IEEE I CONF COMP VIS, P1543, DOI 10.1109/ICCV.2011.6126413
  • [7] Pedestrian Attribute Recognition with Part-based CNN and Combined Feature Representations
    Chen, Yiqiang
    Duffner, Stefan
    Stoian, Andrei
    Dufour, Jean-Yves
    Baskurt, Atilla
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 114 - 122
  • [8] Probabilistic Semantic Retrieval for Surveillance Videos With Activity Graphs
    Chen, Yuting
    Wang, Joseph
    Bai, Yannan
    Castanon, Gregory
    Saligrama, Venkatesh
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) : 704 - 716
  • [9] Query-Free Clothing Retrieval via Implicit Relevance Feedback
    Chen, Zhuoxiang
    Xu, Zhe
    Zhang, Ya
    Gu, Xiao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (08) : 2126 - 2137
  • [10] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893