Composite recurrent network with internal denoising for facial alignment in still and video images in the wild

被引：9

作者：

Aspandi, Decky ^{[1
]}

Martinez, Oriol ^{[1
]}

Sukno, Federico ^{[1
]}

Binefa, Xavier ^{[1
]}

机构：

[1] Pompeu Fabra Univ, Dept Informat & Commun Technol, Barcelona, Spain

来源：

IMAGE AND VISION COMPUTING | 2021年 / 111卷

关键词：

Facial alignment; Facial tracking; Temporal modeling; Internal denoising; RECOGNITION; TRACKING; MODELS;

D O I：

10.1016/j.imavis.2021.104189

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Facial alignment is an essential task for many higher level facial analysis applications, such as animation, human activity recognition and human -computer interaction. Although the recent availability of big datasets and powerful deep-learning approaches have enabled major improvements on the state of the art accuracy, the performance of current approaches can severely deteriorate when dealing with images in highly unconstrained conditions, which limits the real-life applicability of such models. In this paper, we propose a composite recurrent tracker with internal denoising that jointly address both single image facial alignment and deformable facial tracking in the wild. Specifically, we incorporate multilayer LSTMs to model temporal dependencies with variable length and introduce an internal denoiser which selectively enhances the input images to improve the robustness of our overall model. We achieve this by combining 4 different sub-networks that specialize in each of the key tasks that are required, namely face detection, bounding-box tracking, facial region validation and facial alignment with internal denoising. These blocks are endowed with novel algorithms resulting in a facial tracker that is both accurate, robust to in-the-wild settings and resilient against drifting. We demonstrate this by testing our model on 300-W and Menpo datasets for single image facial alignment, and 300-VW dataset for deformable facial tracking. Comparison against 20 other state of the art methods demonstrates the excellent performance of the proposed approach. (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).

引用

页数：14

共 68 条

[1] AFIF4: Deep gender classification based on AdaBoost-based fusion of isolated facial features and foggy faces [J].

Afifi, Mahmoud ;

Abdelhamed, Abdelrahman .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 :77-86

[2]

[Anonymous], 2018, INT J COMPUT VISION

[3] An Enhanced Adversarial Network with Combined Latent Features for Spatio-temporal Facial Affect Estimation in the Wild [J].

Aspandi, Decky ;

Sukno, Federico ;

Schuller, Bjoern ;

Binefa, Xavier .

VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, :172-181

[4]

Aspandi D, 2019, IEEE INT CONF AUTOMA, P730

[5] Latent-Based Adversarial Neural Networks for Facial Affect Estimations [J].

Aspandi, Decky ;

Mallol-Ragolta, Adria ;

Schuller, Bjoern ;

Binefa, Xavier .

2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, :606-610

[6] Fully End-to-End Composite Recurrent Convolution Network for Deformable Facial Tracking In The Wild [J].

Aspandi, Decky ;

Martinez, Oriol ;

Sukno, Federico ;

Binefa, Xavier .

2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, :115-122

[7] Robust Facial Alignment with Internal Denoising Auto-Encoder [J].

Aspandi, Decky ;

Martinez, Oriol ;

Sukno, Federico ;

Binefa, Xavier .

2019 16TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2019), 2019, :143-150

[8] Incremental Face Alignment in the Wild [J].

Asthana, Akshay ;

Zafeiriou, Stefanos ;

Cheng, Shiyang ;

Pantic, Maja .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1859-1866

[9] OpenFace 2.0: Facial Behavior Analysis Toolkit [J].

Baltrusaitis, Tadas ;

Zadeh, Amir ;

Lim, Yao Chong ;

Morency, Louis-Philippe .

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :59-66

[10] Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].

Belhumeur, PN ;

Hespanha, JP ;

Kriegman, DJ .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720

← 1 2 3 4 5 6 7 →