Joint learning for face alignment and face transfer with depth image

被引：1

作者：

Wang, Xiaoli ^{[1
]}

Zheng, Yinglin ^{[1
]}

Zeng, Ming ^{[1
]}

Cheng, Xuan ^{[1
]}

Lu, Wei ^{[2
]}

机构：

[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China

[2] Renmin Univ, Sch Informat, Beijing, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2020年 / 79卷 / 45-46期

基金：

中国国家自然科学基金;

关键词：

Face alignment; Cross-modal face transfer; Deep learning; Multi-task learning; Transfer learning;

D O I：

10.1007/s11042-020-08873-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Face alignment and cross-modal face transfer are two important tasks for automatic face analysis in computer vision. Over the years, they have been extensively studied. Recently, deep neural networks have attracted much research attention for both face alignment and face transfer. With the prevalence of the consumer depth sensor, depth-based face alignment and cross-modal (image and depth) are increasingly important. Different from existing RGB- image based tasks, the main challenge of depth-based tasks is the lack of annotated data. To address the challenge, we observe that these two tasks are closely related and their learning processes may benefit each other. This paper develops a joint multi-task learning algorithm for both depth-based face alignment and face transfer using the deep convolutional neural network (CNN). The proposed approach allows the CNN model to simultaneously share visual knowledge and information between two tasks. We use a dataset of 10,000 face depth images for validation. Our experiments show that the proposed approach outperforms state-of-the-art algorithms. The results also show that learning these two related tasks simultaneously improves the performance of each individual task.

引用

页码：33993 / 34010

页数：18

共 60 条

[1] [Anonymous], 2012, arXiv preprint arXiv:1203.6722.
[2] [Anonymous], 2018, ARXIV180210151
[3] [Anonymous], 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence
[4] [Anonymous], 2017, P ASM 36 INT C OC P ASM 36 INT C OC
[5] [Anonymous], 2017, ARXIV171006090
[6] Ben-David S., 2006, ADV NEURAL INFORM PR, P137
[7] A theory of learning from different domains
Ben-David, Shai
Blitzer, John
Crammer, Koby
Kulesza, Alex
Pereira, Fernando
Vaughan, Jennifer Wortman
[J]. MACHINE LEARNING, 2010, 79 (1-2) : 151 - 175
[8] Estimating the Success of Unsupervised Image to Image Translation
Benaim, Sagie
Galanti, Tomer
Wolf, Lior
[J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 222 - 238
[9] POSEidon: Face-from-Depth for Driver Pose Estimation
Borghi, Guido
Venturelli, Marco
Vezzani, Roberto
Cucchiara, Rita
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5494 - 5503
[10] Super-FAN: Integrated facial landmark localization and super-resolution of real-world low resolution faces in arbitrary poses with GANs
Bulat, Adrian
Tzimiropoulos, Georgios
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 109 - 117

← 1 2 3 4 5 6 →