ATTRIBUTE-AWARE NETWORK FOR PEDESTRIAN ATTRIBUTE RECOGNITION

被引:0
作者
Wu, Zesen
Ye, Mang
Chen, Shuoyi
Du, Bo
机构
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024 | 2024年
关键词
Deep Learning; Pedestrian Attribute Recognition;
D O I
10.1109/ICMEW63481.2024.10645357
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian attribute recognition (PAR) aims to identify the attributes like gender, hat, and upper clothes color of a captured pedestrian, which is a challenging but practical research problem in security applications. One key consideration in PAR is that the uniform extracted features of the backbone cannot effectively fulfill the requirements for classifying each attribute. In the scope of the MMVRAC challenge, this paper introduces the Attribute-aware Multi-layer Projector (AMLP) to enhance the performance in the UAVHuman pedestrian attribute recognition benchmark. The AMLP transforms the uniform pedestrian feature into attribute-aware representation, thereby enhancing the subsequent prediction accuracy. Furthermore, we validate the validity of the AMLP and develop a strong pipeline for PAR based on sufficient experiments, which validates the effectiveness of various commonly employed modules used to improve PAR, such as data augmentation, label smoothing or dropout.
引用
收藏
页数:6
相关论文
共 44 条
[1]  
Baldi P., 2013, Advances in Neural Information Processing Systems, V26, P2814
[2]   C2T-Net: Channel-Aware Cross-Fused Transformer-Style Networks for Pedestrian Attribute Recognition [J].
Bui, Doanh C. ;
Thinh V. Le ;
Ngo, Ba Hung .
2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, :351-358
[3]  
Bui DC, 2024, Arxiv, DOI [arXiv:2403.06119, 10.48550/ARXIV.2403.06119, DOI 10.48550/ARXIV.2403.06119]
[4]   A novel self-boosting dual-branch model for pedestrian attribute recognition [J].
Cao, Yilu ;
Fang, Yuchun ;
Zhang, Yaofang ;
Hou, Xiaoyu ;
Zhang, Kunlin ;
Huang, Wei .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
[5]   Rotation Invariant Transformer for Recognizing Object in UAVs [J].
Chen, Shuoyi ;
Ye, Mang ;
Du, Bo .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :2565-2574
[6]   Class-Balanced Loss Based on Effective Number of Samples [J].
Cui, Yin ;
Jia, Menglin ;
Lin, Tsung-Yi ;
Song, Yang ;
Belongie, Serge .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9260-9269
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Dosovitskiy Alexey., 2021, PROC INT C LEARN REP, P2021
[9]  
Fan Xinwen, 2023, IEEE Transactions on Circuits and Systems for Video Technology
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778