Human attribute recognition by refining attention heat map

被引:40
作者
Guo, Hao [1 ]
Fan, Xiaochuan [2 ]
Wang, Song [1 ]
机构
[1] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] HERE North Amer LLC, 425 W Randolph St, Chicago, IL 60606 USA
关键词
Human attribute recognition; Attention heat map; Exponential loss; Refine; VISUAL SALIENCY;
D O I
10.1016/j.patrec.2017.05.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing methods of human attribute recognition are part-based, where features are extracted at human body parts corresponding to each human attribute and the part-based features are then fed to classifiers individually or together for recognizing human attributes. The performance of these methods is highly dependent on the accuracy of body-part detection, which is a well known challenging problem in computer vision. Different from these part-based methods, we propose to recognize human attributes by using CAM (Class Activation Map) network and further improve the recognition by refining the attention heat map, which is an intermediate result in CAM and reflects relevant image regions for each attribute. The proposed method does not require the detection of body parts and the prior correspondence between body parts and attributes. In particular, we define a new exponential loss function to measure the appropriateness of the attention heat map. The attribute classifiers are further trained in terms of both the original classification loss function and this new exponential loss function. The proposed method is developed on an end-to-end CNN network with CAM, by adding a new component for refining attention heat map. We conduct experiments on Berkeley Attributes of Human People Dataset and WIDER Attribute Dataset. The proposed methods achieve comparable performance of attribute recognition to the current state-of-the-art methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 38 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]  
[Anonymous], 2013, P INT C LEARN REPR
[3]  
[Anonymous], 2016, P IEEE C COMP VIS PA
[4]  
[Anonymous], IEEE T PATTERN ANAL
[5]  
[Anonymous], 2015, PROC CVPR IEEE
[6]  
[Anonymous], 2007, PROC IEEE C COMPUT V, DOI 10.1109/CVPR.2007.383267
[7]  
Bourdev L, 2011, IEEE I CONF COMP VIS, P1543, DOI 10.1109/ICCV.2011.6126413
[8]  
Chang KY, 2011, IEEE I CONF COMP VIS, P914, DOI 10.1109/ICCV.2011.6126333
[9]   Global Contrast Based Salient Region Detection [J].
Cheng, Ming-Ming ;
Mitra, Niloy J. ;
Huang, Xiaolei ;
Torr, Philip H. S. ;
Hu, Shi-Min .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) :569-582
[10]  
Duan K, 2012, PROC CVPR IEEE, P3474, DOI 10.1109/CVPR.2012.6248089