YOLO-FaceV2: A scale and occlusion aware face detector

被引:111
作者
Yu, Ziping [1 ]
Huang, Hongbo [2 ]
Chen, Weijun [3 ]
Su, Yongxin [4 ]
Liu, Yahui [5 ]
Wang, Xiuying [2 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Instrument Sci & Optoelect Engn, Beijing, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Comp Sch, Beijing, Peoples R China
[3] Data Algorithm NIO, Shanghai, Peoples R China
[4] Beijing Informat Sci & Technol Univ, Sch Mech & Elect Engn, Beijing, Peoples R China
[5] Beijing Informat Sci & Technol Univ, Sch Informat Management, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Face detection; YOLO; Scale-aware; Occlusion; Imbalance problem;
D O I
10.1016/j.patcog.2024.110714
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, face detection algorithms based on deep learning have made great progress. Nevertheless, the effective utilization of face detectors for small and occlusion faces remains challenging, primarily stemming from the limitations in pixel information and the presence of missing features. In this paper, we propose a novel real-time face detector, YOLO-FaceV2, built upon the YOLOv5 architecture. Our approach introduces a Receptive Field Enhancement (RFE) module designed to extract multi-scale pixel information and augment the receptive field for accurately detecting small faces. To address issues related to face occlusion, we introduce an attention mechanism termed the Separated and Enhancement Attention Module (SEAM), which effectively focuses on the regions affected by occlusion. Furthermore, we propose a Slide Weight Function (SWF) to mitigate the imbalance between easy and hard samples. The experiments demonstrate that our YOLO-FaceV2 achieves performance exceeding the state-of-the-art on the WiderFace validation dataset. Source code and pre-trained model are available at https://github.com/Krasjet-Yu/YOLO-FaceV2.
引用
收藏
页数:10
相关论文
共 33 条
[1]   Prime Sample Attention in Object Detection [J].
Cao, Yuhang ;
Chen, Kai ;
Loy, Chen Change ;
Lin, Dahua .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11580-11588
[2]  
Chen M, 2020, PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), P772, DOI 10.1109/DDCLS49620.2020.9275060
[3]   YOLO-face: a real-time face detector [J].
Chen, Weijun ;
Huang, Hongbo ;
Peng, Shuai ;
Zhou, Changsheng ;
Zhang, Cuiping .
VISUAL COMPUTER, 2021, 37 (04) :805-813
[4]  
Chi C, 2019, AAAI CONF ARTIF INTE, P8231
[5]   RetinaFace: Single-shot Multi-level Face Localisation in the Wild [J].
Deng, Jiankang ;
Guo, Jia ;
Ververas, Evangelos ;
Kotsia, Irene ;
Zafeiriou, Stefanos .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5202-5211
[6]   Coordinate Attention for Efficient Mobile Network Design [J].
Hou, Qibin ;
Zhou, Daquan ;
Feng, Jiashi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717
[7]  
Jocher G., Yolov5
[8]   Keep an eye on faces: Robust face detection with heatmap-Assisted spatial attention and scale-Aware layer attention [J].
Ju, Lei ;
Kittler, Josef ;
Rana, Muhammad Awais ;
Yang, Wankou ;
Feng, Zhenhua .
PATTERN RECOGNITION, 2023, 140
[9]  
Li HX, 2015, PROC CVPR IEEE, P5325, DOI 10.1109/CVPR.2015.7299170
[10]   DSFD: Dual Shot Face Detector [J].
Li, Jian ;
Wang, Yabiao ;
Wang, Changan ;
Tai, Ying ;
Qian, Jianjun ;
Yang, Jian ;
Wang, Chengjie ;
Li, Jilin ;
Huang, Feiyue .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5055-5064