Joint learning for face alignment and face transfer with depth image

被引:1
作者
Wang, Xiaoli [1 ]
Zheng, Yinglin [1 ]
Zeng, Ming [1 ]
Cheng, Xuan [1 ]
Lu, Wei [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
[2] Renmin Univ, Sch Informat, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Face alignment; Cross-modal face transfer; Deep learning; Multi-task learning; Transfer learning;
D O I
10.1007/s11042-020-08873-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face alignment and cross-modal face transfer are two important tasks for automatic face analysis in computer vision. Over the years, they have been extensively studied. Recently, deep neural networks have attracted much research attention for both face alignment and face transfer. With the prevalence of the consumer depth sensor, depth-based face alignment and cross-modal (image and depth) are increasingly important. Different from existing RGB- image based tasks, the main challenge of depth-based tasks is the lack of annotated data. To address the challenge, we observe that these two tasks are closely related and their learning processes may benefit each other. This paper develops a joint multi-task learning algorithm for both depth-based face alignment and face transfer using the deep convolutional neural network (CNN). The proposed approach allows the CNN model to simultaneously share visual knowledge and information between two tasks. We use a dataset of 10,000 face depth images for validation. Our experiments show that the proposed approach outperforms state-of-the-art algorithms. The results also show that learning these two related tasks simultaneously improves the performance of each individual task.
引用
收藏
页码:33993 / 34010
页数:18
相关论文
共 60 条
  • [1] [Anonymous], 2012, arXiv preprint arXiv:1203.6722.
  • [2] [Anonymous], 2018, ARXIV180210151
  • [3] [Anonymous], 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence
  • [4] [Anonymous], 2017, P ASM 36 INT C OC P ASM 36 INT C OC
  • [5] [Anonymous], 2017, ARXIV171006090
  • [6] Ben-David S., 2006, ADV NEURAL INFORM PR, P137
  • [7] A theory of learning from different domains
    Ben-David, Shai
    Blitzer, John
    Crammer, Koby
    Kulesza, Alex
    Pereira, Fernando
    Vaughan, Jennifer Wortman
    [J]. MACHINE LEARNING, 2010, 79 (1-2) : 151 - 175
  • [8] Estimating the Success of Unsupervised Image to Image Translation
    Benaim, Sagie
    Galanti, Tomer
    Wolf, Lior
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 222 - 238
  • [9] POSEidon: Face-from-Depth for Driver Pose Estimation
    Borghi, Guido
    Venturelli, Marco
    Vezzani, Roberto
    Cucchiara, Rita
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5494 - 5503
  • [10] Super-FAN: Integrated facial landmark localization and super-resolution of real-world low resolution faces in arbitrary poses with GANs
    Bulat, Adrian
    Tzimiropoulos, Georgios
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 109 - 117