Single-Shot Scale-Aware Network for Real-Time Face Detection

被引:1
作者
Shifeng Zhang
Longyin Wen
Hailin Shi
Zhen Lei
Siwei Lyu
Stan Z. Li
机构
[1] Chinese Academy of Sciences,CBSR & NLPR, Institute of Automation
[2] University of Chinese Academy of Sciences,Computer Science Department
[3] JD Digits,undefined
[4] JD AI Research,undefined
[5] University at Albany,undefined
[6] SUNY,undefined
来源
International Journal of Computer Vision | 2019年 / 127卷
关键词
Face detection; Single-shot; Scale-aware; Class imbalance;
D O I
暂无
中图分类号
学科分类号
摘要
In this work, we describe a single-shot scale-aware convolutional neural network based face detector (SFDet). In comparison with the state-of-the-art anchor-based face detection methods, the main advantages of our method are summarized in four aspects. (1) We propose a scale-aware detection network using a wide scale range of layers associated with appropriate scales of anchors to handle faces with various scales, and describe a new equal density principle to ensure anchors with different scales to be evenly distributed on the image. (2) To improve the recall rates of faces with certain scales (e.g., the scales of the faces are quite different from the scales of designed anchors), we design a new anchor matching strategy with scale compensation. (3) We introduce an IoU-aware weighting scheme for each training sample in classification loss calculation to encode samples accurately in training process. (4) Considering the class imbalance issue, a max-out background strategy is used to reduce false positives. Several experiments are conducted on public challenging face detection datasets, i.e., WIDER FACE, AFW, PASCAL Face, FDDB, and MAFA, to demonstrate that the proposed method achieves the state-of-the-art results and runs at 82.1 FPS for the VGA-resolution images.
引用
收藏
页码:537 / 559
页数:22
相关论文
共 45 条
  • [1] Brubaker SC(2008)On the design of cascades of boosted ensembles for face detection International Journal of Computer Vision 77 65-86
  • [2] Wu J(2010)Object detection with discriminatively trained part-based models IEEE Transactions on Pattern Analysis and Machine Intelligence 32 1627-1645
  • [3] Sun J(1997)A decision-theoretic generalization of on-line learning and an application to boosting Journal of Computer and System Sciences 55 119-139
  • [4] Mullin MD(2016)A fast and accurate unconstrained face detector IEEE Transactions on Pattern Analysis and Machine Intelligence 38 211-223
  • [5] Rehg JM(2004)Distinctive image features from scale-invariant keypoints International Journal of Computer Vision 60 91-110
  • [6] Felzenszwalb PF(2017)Faster R-CNN: Towards real-time object detection with region proposal networks IEEE Transactions on Pattern Analysis and Machine Intelligence 39 1137-1149
  • [7] Girshick RB(2015)Imagenet large scale visual recognition challenge International Journal of Computer Vision 115 211-252
  • [8] McAllester DA(2013)Selective search for object recognition International Journal of Computer Vision 104 154-171
  • [9] Ramanan D(2004)Robust real-time face detection International Journal of Computer Vision 57 137-154
  • [10] Freund Y(2014)Face detection by structural models Image Vision Computing 32 790-799