PEDESTRIAN ATTRIBUTE RECOGNITION BASED ON MTCNN WITH ONLINE BATCH WEIGHTED LOSS

被引:0
作者
He, Xingting [1 ,2 ]
Shi, Qiuyue [1 ,2 ]
Su, Fei [1 ,2 ]
Zhao, Zhicheng [1 ,2 ]
Zhuang, Bojin [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China
[3] Ping An Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
关键词
Pedestrian Attribute Recognition; Grouping; CNN; Multi-task Learning; Online Batch Weighted Loss;
D O I
10.1109/icip.2019.8803227
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Due to the large number and huge diversity of attributes, pedestrian attribute recognition in video surveillance scenarios is a challenging task in the field of computer vision. Different from most previous works which only focus on extremely imbalanced attribute distribution problem, a new grouping way of attributes based multi-task convolutional neural network (MTCNN) is put forward, which exploits the spatial correlations among attributes and guarantees some independence of each attribute as well. Meanwhile, we propose a novel online batch weighted loss to narrow the performance differences among attributes and boost the model to gain a higher average recognition accuracy. The whole network can be trained end to end, and experimental results on PETA and RAP datasets show that our method achieves significant performance, comparing with those state-of-the-art methods.
引用
收藏
页码:2461 / 2465
页数:5
相关论文
共 22 条
  • [1] [Anonymous], IMAGE RANKING RETRIE
  • [2] [Anonymous], ATTRIBUTE RECOGNITIO
  • [3] [Anonymous], PROC CVPR IEEE
  • [4] [Anonymous], ADV NEURAL INFORM PR
  • [5] [Anonymous], 2016, ARXIV160307054
  • [6] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [7] Pedestrian Attribute Recognition At Far Distance
    Deng, Yubin
    Luo, Ping
    Loy, Chen Change
    Tang, Xiaoou
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 789 - 792
  • [8] Feris R., 2014, P INT C MULT RETR, P153
  • [9] Gao MF, 2017, 2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), P1077, DOI 10.1109/FSKD.2017.8392913
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778