FaceHunter: A multi-task convolutional neural network based face detector

被引:10
作者
Wang, Dong [1 ]
Yang, Jing [1 ]
Deng, Jiankang [1 ]
Liu, Qingshan [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Informat & Control, B DAT Lab, 219 Ningliu Rd, Nanjing, Jiangsu, Peoples R China
关键词
Face detection; Convolutional neural network; Multi-task; Adaptive pooling layer; Region proposal network; MODEL;
D O I
10.1016/j.image.2016.04.004
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new multi-task Convolutional Neural Network (CNN) based face detector, which is named FaceHunter for simplicity. The main idea is to make the face detector achieve a high detection accuracy and obtain much reliable face boxes. Reliable face boxes output will be much helpful for further face image analysis. To reach this goal, we design a deep CNN network with a multi-task loss, i.e., one is for discriminating face and non-face, and another is for face box regression. An adaptive pooling layer is added before full connection to make the network adaptive to variable candidate proposals, and the truncated SVD is applied to compress the parameters of the fully connected layers. To further speed up the detector, the convolutional feature map is directly used to generate the candidate proposals by using Region Proposal Network (RPN). The proposed FaceHunter is evaluated on the AFW dataset, FDDB dataset and Pascal Faces respectively, and extensive experiments demonstrate its powerful performance against several state-of-the-art detectors. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:476 / 481
页数:6
相关论文
共 28 条
[1]  
[Anonymous], ARXIV150408083
[2]  
[Anonymous], 2015, P ADV NEUR INF PROC
[3]  
[Anonymous], 1985, PARALLEL DISTRIBUTED
[4]  
[Anonymous], ARXIV13054537
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
Dong Wang, 2015, Image and Graphics. 8th International Conference, ICIG 2015. Proceedings: LNCS 9218, P373, DOI 10.1007/978-3-319-21963-9_34
[7]   Object Detection with Discriminatively Trained Part-Based Models [J].
Felzenszwalb, Pedro F. ;
Girshick, Ross B. ;
McAllester, David ;
Ramanan, Deva .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1627-1645
[9]  
Grauman K, 2005, IEEE I CONF COMP VIS, P1458
[10]  
Jain V, 2011, PROC CVPR IEEE, P577, DOI 10.1109/CVPR.2011.5995317