Face Detection With Different Scales Based on Faster R-CNN

被引:120
作者
Wu, Wenqi [1 ,2 ]
Yin, Yingjie [1 ,2 ]
Wang, Xingang [1 ,2 ]
Xu, De [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Res Ctr Precis Sensing & Control, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Deep convolutional neural network (DCNN); deep learning; face detection; Faster R-CNN; RECOGNITION;
D O I
10.1109/TCYB.2018.2859482
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the application of deep learning based on deep convolutional neural networks has gained great success in face detection. However, one of the remaining open challenges is the detection of small-scaled faces. The depth of the convolutional network can cause the projected feature map for small faces to be quickly shrunk, and most detection approaches with scale invariant can hardly handle less than 15 x 15 pixel faces. To solve this problem, we propose a different scales face detector (DSFD) based on Faster R-CNN. The new network can improve the precision of face detection while performing as real-time a Faster R-CNN. First, an efficient multitask region proposal network (RPN), combined with boosting face detection, is developed to obtain the human face ROI. Setting the ROI as a constraint, an anchor is inhomogeneously produced on the top feature map by the multitask RPN. A human face proposal is extracted through the anchor combined with facial landmarks. Then, a parallel-type Fast R-CNN network is proposed based on the proposal scale. According to the different percentages they cover on the images, the proposals are assigned to three corresponding Fast R-CNN networks. The three networks are separated through the proposal scales and differ from each other in the weight of feature map concatenation. A variety of strategies is introduced in our face detection network, including multitask learning, feature pyramid, and feature concatenation. Compared to state-of-the-art face detection methods such as UnitBox, HyperFace, FastCNN, the proposed DSFD method achieves promising performance on popular benchmarks including FDDB, AFW, PASCAL faces, and WIDER FACE.
引用
收藏
页码:4017 / 4028
页数:12
相关论文
共 63 条
[11]  
[Anonymous], ARXIV160802236V1
[12]  
[Anonymous], PROC CVPR IEEE
[13]  
[Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
[14]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.596
[15]   Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].
Bell, Sean ;
Zitnick, C. Lawrence ;
Bala, Kavita ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883
[16]   Robust object detection via soft cascade [J].
Bourdev, L ;
Brandt, J .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :236-243
[17]  
Chen D, 2014, LECT NOTES COMPUT SC, V8694, P109, DOI 10.1007/978-3-319-10599-4_8
[18]   Fast Edge Detection Using Structured Forests [J].
Dollar, Piotr ;
Zitnick, C. Lawrence .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (08) :1558-1570
[19]   Supervised Transformer Network for Efficient Face Detection [J].
Chen, Dong ;
Hua, Gang ;
Wen, Fang ;
Sun, Jian .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :122-138
[20]   Multi-view Face Detection Using Deep Convolutional Neural Networks [J].
Farfade, Sachin Sudhakar ;
Saberian, Mohammad ;
Li, Li-Jia .
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, :643-650