A Framework for Pedestrian Attribute Recognition Using Deep Learning

被引:1
作者
Sakib, Saadman [1 ]
Deb, Kaushik [1 ]
Dhar, Pranab Kumar [1 ]
Kwon, Oh-Jin [2 ]
机构
[1] Chittagong Univ Engn & Technol, Dept Comp Sci & Engn, Chattogram 4349, Bangladesh
[2] Sejong Univ, Dept Elect Engn, 209 Neungdong Ro, Seoul 05006, South Korea
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 02期
关键词
pedestrian attribute recognition; mask R-CNN; transfer learning; ResNet; 152; v2; oversampling;
D O I
10.3390/app12020622
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The pedestrian attribute recognition task is becoming more popular daily because of its significant role in surveillance scenarios. As the technological advances are significantly more than before, deep learning came to the surface of computer vision. Previous works applied deep learning in different ways to recognize pedestrian attributes. The results are satisfactory, but still, there is some scope for improvement. The transfer learning technique is becoming more popular for its extraordinary performance in reducing computation cost and scarcity of data in any task. This paper proposes a framework that can work in surveillance scenarios to recognize pedestrian attributes. The mask R-CNN object detector extracts the pedestrians. Additionally, we applied transfer learning techniques on different CNN architectures, i.e., Inception ResNet v2, Xception, ResNet 101 v2, ResNet 152 v2. The main contribution of this paper is fine-tuning the ResNet 152 v2 architecture, which is performed by freezing layers, last 4, 8, 12, 14, 20, none, and all. Moreover, data balancing techniques are applied, i.e., oversampling, to resolve the class imbalance problem of the dataset and analysis of the usefulness of this technique is discussed in this paper. Our proposed framework outperforms state-of-the-art methods, and it provides 93.41% mA and 89.24% mA on the RAP v2 and PARSE100K datasets, respectively.
引用
收藏
页数:25
相关论文
共 47 条
[1]   Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models [J].
AlDahoul, Nouar ;
Sabri, Aznul Qalid Md ;
Mansoor, Ali Mohammed .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[2]  
[Anonymous], ARXIV160307054
[3]  
[Anonymous], 2016, Comput. Vis. Pattern Recogn.
[4]   Human detection techniques for real time surveillance: a comprehensive survey [J].
Ansari, Mohd. Aquib ;
Singh, Dushyant Kumar .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) :8759-8808
[5]  
Astawa I., 2017, TELKOMNIKA, V15, P1894
[6]  
Bharati Puja, 2020, Computational Intelligence in Pattern Recognition. Proceedings of CIPR 2019. Advances in Intelligent Systems and Computing (AISC 999), P657, DOI 10.1007/978-981-13-9042-5_56
[7]   Pedestrian Attribute Recognition with Part-based CNN and Combined Feature Representations [J].
Chen, Yiqiang ;
Duffner, Stefan ;
Stoian, Andrei ;
Dufour, Jean-Yves ;
Baskurt, Atilla .
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, :114-122
[8]   Pedestrian Attribute Recognition At Far Distance [J].
Deng, Yubin ;
Luo, Ping ;
Loy, Chen Change ;
Tang, Xiaoou .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :789-792
[9]  
Drummond C., 2003, WORKSH LEARN IMB DAT, V11, P1
[10]  
Fang WH, 2018, CHINA COMMUN, V15, P208