Inter-Attribute awareness for pedestrian attribute recognition

被引：12

作者：

Wu, Junyi ^{[1
]}

Huang, Yan ^{[2
]}

Gao, Zhipeng ^{[1
]}

Hong, Yating ^{[1
]}

Zhao, Jianqiang ^{[1
]}

Du, Xinsheng ^{[1
]}

机构：

[1] Xiamen Meiya P Informat Co Ltd, AI Res Ctr, Xiamen, Peoples R China

[2] Chinese Acad Sci CASIA, Inst Automat, Ctr Res Intelligent Percept & Comp CRIPAC, Natl Lab Pattern Recognit NLPR, Beijing, Peoples R China

来源：

PATTERN RECOGNITION | 2022年 / 131卷

关键词：

Pedestrian attribute recognition; Inter-Attribute awareness; Vector-Neuron capsules;

D O I：

10.1016/j.patcog.2022.108865

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The task of pedestrian attribute recognition (PAR) is to distinguish a series of person semantic attributes. Generally, existing methods adopt multi-label classification algorithms to tackle the PAR task by utilizing multiple attribute labels. Despite remarkable progress, this kind of method normally ignores relations between different attributes. In order to be aware of relations between attributes, we propose an inter attribute aware network via vector-neuron capsule for PAR (IAA-Caps). Our IAA-Caps method replaces traditional one-dimensional scalar neurons with two-dimensional vector-neuron capsules by embedding them in IAA-Caps. Specifically, during IAA-Caps training, one dimension in capsules is used to recognize different attributes, and the other dimension is used to strengthen the relations of different attributes. Through considering inter-attribute relations, compared with previous methods that use a heavyweight backbone (e.g., ResNet50 or BN-Inception), a more lightweight backbone (i.e., OSNet) can be adopted in our proposed IAA-Caps to achieve better performance. Experiments are conducted on several PAR benchmark datasets, including PETA, PA-100K, RAPv1, and RAPv2, demonstrating the effectiveness of the proposed IAA-Caps. In addition, experiments also show that the proposed method can improve the performance of PAR on different backbones, showing its generalization ability.(c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：14

共 40 条

[1] Multi-Task CNN Model for Attribute Prediction [J].

Abdulnabi, Abrar H. ;

Wang, Gang ;

Lu, Jiwen ;

Jia, Kui .

IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) :1949-1959

[2] Hierarchical Reasoning Network for Pedestrian Attribute Recognition [J].

An, Haoran ;

Hu, Hai-Miao ;

Guo, Yuanfang ;

Zhou, Qianli ;

Li, Bo .

IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :268-280

[3]

[Anonymous], 2017, P INT C LEARN REPR T

[4] Pedestrian Attribute Recognition At Far Distance [J].

Deng, Yubin ;

Luo, Ping ;

Loy, Chen Change ;

Tang, Xiaoou .

PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :789-792

[5] Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes [J].

Dong, Qi ;

Gong, Shaogang ;

Zhu, Xiatian .

2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :520-529

[6]

Fabbri M, 2017, 2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS)

[7] Visual Attention Consistency for Human Attribute Recognition [J].

Guo, Hao ;

Fan, Xiaochuan ;

Wang, Song .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (04) :1088-1106

[8] Visual Attention Consistency under Image Transforms for Multi-Label Image Classification [J].

Guo, Hao ;

Zheng, Kang ;

Fan, Xiaochuan ;

Yu, Hongkai ;

Wang, Song .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :729-739

[9] Human attribute recognition by refining attention heat map [J].

Guo, Hao ;

Fan, Xiaochuan ;

Wang, Song .

PATTERN RECOGNITION LETTERS, 2017, 94 :38-45

[10]

Han K, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2456

← 1 2 3 4 →