Pedestrian Attribute Recognition via Hierarchical Multi-task Learning and Relationship Attention

被引:11
作者
Gao, Lian [1 ]
Huang, Di [1 ]
Guo, Yuanfang [2 ]
Wang, Yunhong [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing Adv Innovat Ctr Big Data & Brain Comp, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19) | 2019年
关键词
pedestrian attribute recognition; deep learning; multi-task learning and visual attention;
D O I
10.1145/3343031.3351003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Pedestrian Attribute Recognition (PAR) is an important task in surveillance video analysis. In this paper, we propose a novel end-to-end hierarchical deep learning approach to PAR. The proposed network introduces semantic segmentation into PAR and formulates it as a multi-task learning problem, which brings in pixel-level supervision in feature learning for attribute localization. According to the spatial properties of local and global attributes, we present a two stage learning mechanism to decouple coarse attribute localization and fine attribute recognition into successive phases within a single model, which strengthens feature learning. Besides, we design an attribute relationship attention module to efficiently capture and emphasize the latent relations among different attributes, further enhancing the discriminative power of the feature. Extensive experiments are conducted and very competitive results are reached on the RAP and PETA databases, indicating the effectiveness and superiority of the proposed approach.
引用
收藏
页码:1340 / 1348
页数:9
相关论文
共 29 条
[1]  
[Anonymous], P EUR C COMP VIS ECC
[2]  
[Anonymous], 2008, P 16 ACM INT C MULT, DOI [DOI 10.1145/1459359.1459470.11.P, DOI 10.1145/1459359.1459470]
[3]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[4]  
[Anonymous], ARXIV E PRINTS
[5]  
[Anonymous], 2016, ARXIV161105603
[6]  
[Anonymous], P IEEE INT C COMP VI
[7]  
[Anonymous], 2016, ARXIV160307054
[8]  
[Anonymous], 2014, Neural Information Processing Systems
[9]  
Bilen Hakan, 2016, Advances in neural information processing systems, P235
[10]   Attention to Scale: Scale-aware Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Yang, Yi ;
Wang, Jiang ;
Xu, Wei ;
Yuille, Alan L. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649