Hierarchical Reasoning Network for Pedestrian Attribute Recognition

被引：15

作者：

An, Haoran ^{[1
]}

Hu, Hai-Miao ^{[1
]}

Guo, Yuanfang ^{[1
]}

Zhou, Qianli ^{[2
]}

Li, Bo ^{[1
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China

[2] Peoples Publ Secur Univ China, Beijing 100038, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2021年 / 23卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Cognition; Semantics; Task analysis; Machine learning; Correlation; Image color analysis; Pedestrian attribute recognition; video surveillance; abstraction levels; hierarchical; reason; CLASSIFICATION; RETRIEVAL; ALIGNMENT;

D O I：

10.1109/TMM.2020.2975417

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Pedestrian attribute recognition, which can benefit other tasks such as person re-identification and pedestrian retrieval, is very important in video surveillance related tasks. In this paper, we observe that the existing methods tackle this problem from the perspective of multi-label classification without considering the hierarchical relationships among the attributes. In human cognition, the attributes can be categorized according to their semantic/abstraction levels. The high-level attributes can be predicted by reasoning from the low-level and medium-level attributes, while the recognition of the low-level and medium-level attributes can be guided by the high-level attributes. Based on this attribute categorization, we propose a novel Hierarchical Reasoning Network (HR-Net), which can hierarchically predict the attributes at different abstraction levels in different stages of the network. We also propose an attribute reasoning structure to exploit the relationships among the attributes at different semantic levels. Experimental results demonstrate that the proposed network gives superior performances compared to the state-of-the-art techniques.

引用

页码：268 / 280

页数：13

共 45 条

[1] Multi-Task CNN Model for Attribute Prediction [J].

Abdulnabi, Abrar H. ;

Wang, Gang ;

Lu, Jiwen ;

Jia, Kui .

IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) :1949-1959

[2]

[Anonymous], 2015, Proc. IEEE International Conference on Computer Vision Workshops

[3]

[Anonymous], 2010, PERSON RE IDENTIFICA

[4]

[Anonymous], 2014, PERSON REIDENTIFICAT

[5]

[Anonymous], 2016, P 29 IEEE C COMPUTER

[6]

[Anonymous], 2009, 2009 WORKSH APPL COM, DOI DOI 10.1109/WACV.2009.5403131

[7] Group-Sensitive Triplet Embedding for Vehicle Reidentification [J].

Bai, Yan ;

Lou, Yihang ;

Gao, Feng ;

Wang, Shiqi ;

Wu, Yuwei ;

Duan, Ling-Yu .

IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) :2385-2399

[8]

Bourdev L, 2011, IEEE I CONF COMP VIS, P1543, DOI 10.1109/ICCV.2011.6126413

[9] Pedestrian Attribute Recognition with Part-based CNN and Combined Feature Representations [J].

Chen, Yiqiang ;

Duffner, Stefan ;

Stoian, Andrei ;

Dufour, Jean-Yves ;

Baskurt, Atilla .

PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, :114-122

[10] Probabilistic Semantic Retrieval for Surveillance Videos With Activity Graphs [J].

Chen, Yuting ;

Wang, Joseph ;

Bai, Yannan ;

Castanon, Gregory ;

Saligrama, Venkatesh .

IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) :704-716

← 1 2 3 4 5 →