Synthesising 3D Facial Motion from "In-the-Wild" Speech

被引:6
|
作者
Tzirakis, Panagiotis [1 ]
Papaioannou, Athanasios [1 ]
Lattas, Alexandros [1 ]
Tarasiou, Michail [1 ]
Schuller, Bjoern [1 ,2 ]
Zafeiriou, Stefanos [1 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] Univ Augsburg, ZD B Chair Embedded Intelligence Hlth Care & Well, Augsburg, Germany
来源
2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020) | 2020年
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/FG47880.2020.00100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Synthesising 3D facial motion from speech is a crucial problem manifesting in a multitude of applications such as computer games and movies. Recently proposed methods tackle this problem in controlled conditions of speech. In this paper, we introduce the first methodology for 3D facial motion synthesis from speech captured in arbitrary recording conditions ("in-the-wild") and independent of the speaker. For our purposes, we captured 4D sequences of people uttering 500 words, contained in the Lip Reading in the Wild (LRW) words, a publicly available large-scale in-the-wild dataset, and built a set of 3D blendshapes appropriate for speech. We correlate the 3D shape parameters of the speech blendshapes to the LRW audio samples by means of a novel time-warping technique, named Deep Canonical Attentional Warping (DCAW), that can simultaneously learn hierarchical non-linear representations and a warping path in an end-to-end manner. We thoroughly evaluate our proposed methods, and show the ability of a deep learning model to synthesise 3D facial motion in handling different speakers and continuous speech signals in uncontrolled conditions(1).
引用
收藏
页码:265 / 272
页数:8
相关论文
共 50 条
  • [1] DeepFaceFlow: In-the-wild Dense 3D Facial Motion Estimation
    Koujan, Mohammad Rami
    Roussos, Anastasios
    Zafeiriou, Stefanos
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6617 - 6626
  • [2] AvatarMe: Realistically Renderable 3D Facial Reconstruction "in-the-wild"
    Lattas, Alexandros
    Moschoglou, Stylianos
    Gecer, Baris
    Ploumpis, Stylianos
    Triantafyllou, Vasileios
    Ghosh, Abhijeet
    Zafeiriou, Stefanos
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 757 - 766
  • [3] TRAM: Global Trajectory and Motion of 3D Humans from in-the-Wild Videos
    Wang, Yufu
    Wang, Ziyun
    Liu, Lingjie
    Daniilidis, Kostas
    COMPUTER VISION - ECCV 2024, PT XI, 2025, 15069 : 467 - 487
  • [4] Multimodal 2D and 3D for In-the-wild Facial Expression Recognition
    Ly, Son Thai
    Nhu-Tai Do
    Lee, Guee-Sang
    Kim, Soo-Hyung
    Yang, Hyung-Jeong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2927 - 2934
  • [5] Realtime Dynamic 3D Facial Reconstruction for Monocular Video In-the-Wild
    Liu, Shuang
    Wang, Zhao
    Yang, Xiaosong
    Zhang, Jianjun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 777 - 785
  • [6] A novel 2D and 3D multimodal approach for in-the-wild facial expression recognition
    Ly, Thai Son
    Do, Nhu-Tai
    Kim, Soo-Hyung
    Yang, Hyung-Jeong
    Lee, Guee-Sang
    IMAGE AND VISION COMPUTING, 2019, 92
  • [7] 3D Face Morphable Models "In-the-Wild"
    Booth, James
    Antonakos, Epameinondas
    Ploumpis, Stylianos
    Trigeorgis, George
    Panagakis, Yannis
    Zafeiriou, Stefanos
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5464 - 5473
  • [8] A 3D FACE MODELING APPROACH FOR IN-THE-WILD FACIAL EXPRESSION RECOGNITION ON IMAGE DATASETS
    Ly, Son Thai
    Do, Nhu-Tai
    Lee, Guee-Sang
    Kim, Soo-Hyung
    Yang, Hyung-Jeong
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3492 - 3496
  • [9] Learning to Restore 3D Face from In-the-Wild Degraded Images
    Zhang, Zhenyu
    Ge, Yanhao
    Tai, Ying
    Huang, Xiaoming
    Wang, Chengjie
    Tang, Hao
    Huang, Dongjin
    Xie, Zhifeng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4227 - 4237
  • [10] On Learning 3D Face Morphable Mode from In-the-Wild Images
    Tran, Luan
    Liu, Xiaoming
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 157 - 171