UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval

被引:22
作者
Specker, Andreas [1 ,2 ,3 ]
Cormier, Mickael [1 ,2 ,3 ]
Beyerer, Juergen [1 ,2 ,3 ]
机构
[1] Karlsruhe Inst Technol, Karlsruhe, Germany
[2] Fraunhofer IOSB, Karlsruhe, Germany
[3] Fraunhofer Ctr Machine Learning, St Augustin, Germany
来源
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年
关键词
D O I
10.1109/WACV56688.2023.00104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing soft-biometric pedestrian attributes is essential in video surveillance and fashion retrieval. Recent works show promising results on single datasets. Nevertheless, the generalization ability of these methods under different attribute distributions, viewpoints, varying illumination, and low resolutions remains rarely understood due to strong biases and varying attributes in current datasets. To close this gap and support a systematic investigation, we present UPAR, the Unified Person Attribute Recognition Dataset. It is based on four well-known person attribute recognition datasets: PA100K, PETA, RAPv2, and Market1501. We unify those datasets by providing 3,3M additional annotations to harmonize 40 important binary attributes over 12 attribute categories across the datasets. We thus enable research on generalizable pedestrian attribute recognition as well as attribute-based person retrieval for the first time. Due to the vast variance of the image distribution, pedestrian pose, scale, and occlusion, existing approaches are greatly challenged both in terms of accuracy and efficiency. Furthermore, we develop a strong baseline for PAR and attribute-based person retrieval based on a thorough analysis of regularization methods. Our models achieve state-of-the-art performance in cross-domain and specialization settings on PA100k, PETA, RAPv2, Market1501-Attributes, and UPAR. We believe UPAR and our strong baseline will contribute to the artificial intelligence community and promote research on large-scale, generalizable attribute recognition systems. The dataset is available here: https://github.com/speckean/upar_dataset
引用
收藏
页码:981 / 990
页数:10
相关论文
共 50 条
[1]  
[Anonymous], 2012, P IEEE C COMP VIS PA
[2]  
[Anonymous], 2016, ARXIV161105603
[3]  
[Anonymous], 2009, WACV
[4]  
Cao Yu-Tong, 2020, P EUR C COMP VIS ECC
[5]  
Chen M., 2021, P IEEE INT C IM PROC, P1329
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   Pedestrian Attribute Recognition At Far Distance [J].
Deng, Yubin ;
Luo, Ping ;
Loy, Chen Change ;
Tang, Xiaoou .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :789-792
[8]  
Dong Qi, 2019, P IEEE INT C COMP VI
[9]  
Dosovitskiy A., 2021, P 9 INT C LEARN REPR
[10]  
Eisenschtat A., 2017, P IEEE C COMP VIS PA