Real-time face alignment: evaluation methods, training strategies and implementation optimization

被引:13
作者
Alvarez Casado, Constantino [1 ]
Bordallo Lopez, Miguel [1 ,2 ]
机构
[1] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu, Finland
[2] VTT Tech Res Ctr Finland Ltd, Oulu, Finland
关键词
Face alignment; Real-time; Embedded devices; Cascaded regression; Optimization implementation; Training strategies; DISCRIMINATION POWER ANALYSIS; RECOGNITION; REGRESSION; NOISE;
D O I
10.1007/s11554-021-01107-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face alignment is a crucial component in most face analysis systems. It focuses on identifying the location of several keypoints of the human faces in images or videos. Although several methods and models are available to developers in popular computer vision libraries, they still struggle with challenges such as insufficient illumination, extreme head poses, or occlusions, especially when they are constrained by the needs of real-time applications. Throughout this article, we propose a set of training strategies and implementations based on data augmentation, software optimization techniques that help in improving a large variety of models belonging to several real-time algorithms for face alignment. We propose an extended set of evaluation metrics that allow novel evaluations to mitigate the typical problems found in real-time tracking contexts. The experimental results show that the generated models using our proposed techniques are faster, smaller, more accurate, more robust in specific challenging conditions and smoother in tracking systems. In addition, the training strategy shows to be applicable across different types of devices and algorithms, making them versatile in both academic and industrial uses.
引用
收藏
页码:2239 / 2267
页数:29
相关论文
共 108 条
[1]  
Ahuja, 2015, INT J COMPUT APPL, V125, P13
[2]  
Alabort-i-Medina J., 2016, ARXIV160100199 CORR
[3]   The effects of adding noise during backpropagation training on a generalization performance [J].
An, GZ .
NEURAL COMPUTATION, 1996, 8 (03) :643-674
[4]  
AnduCjar C., 2018, ARXIV180408046 CORR
[5]  
[Anonymous], 2013, ZONE ISSUE, P1139
[6]  
[Anonymous], 2005, Proceedings-2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, DOI DOI 10.1109/CVPR.2005.268
[7]  
[Anonymous], 2011, P SCAL DAYS
[8]   Lucas-Kanade 20 years on: A unifying framework [J].
Baker, S ;
Matthews, I .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 56 (03) :221-255
[9]   Localizing Parts of Faces Using a Consensus of Exemplars [J].
Belhumeur, Peter N. ;
Jacobs, David W. ;
Kriegman, David J. ;
Kumar, Neeraj .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2930-2940
[10]   TRAINING WITH NOISE IS EQUIVALENT TO TIKHONOV REGULARIZATION [J].
BISHOP, CM .
NEURAL COMPUTATION, 1995, 7 (01) :108-116