Synthesising 3D Facial Motion from "In-the-Wild" Speech

被引：6

作者：

Tzirakis, Panagiotis ^{[1
]}

Papaioannou, Athanasios ^{[1
]}

Lattas, Alexandros ^{[1
]}

Tarasiou, Michail ^{[1
]}

Schuller, Bjoern ^{[1
,2
]}

Zafeiriou, Stefanos ^{[1
]}

机构：

[1] Imperial Coll London, Dept Comp, London, England

[2] Univ Augsburg, ZD B Chair Embedded Intelligence Hlth Care & Well, Augsburg, Germany

来源：

2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020) | 2020年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/FG47880.2020.00100

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Synthesising 3D facial motion from speech is a crucial problem manifesting in a multitude of applications such as computer games and movies. Recently proposed methods tackle this problem in controlled conditions of speech. In this paper, we introduce the first methodology for 3D facial motion synthesis from speech captured in arbitrary recording conditions ("in-the-wild") and independent of the speaker. For our purposes, we captured 4D sequences of people uttering 500 words, contained in the Lip Reading in the Wild (LRW) words, a publicly available large-scale in-the-wild dataset, and built a set of 3D blendshapes appropriate for speech. We correlate the 3D shape parameters of the speech blendshapes to the LRW audio samples by means of a novel time-warping technique, named Deep Canonical Attentional Warping (DCAW), that can simultaneously learn hierarchical non-linear representations and a warping path in an end-to-end manner. We thoroughly evaluate our proposed methods, and show the ability of a deep learning model to synthesise 3D facial motion in handling different speakers and continuous speech signals in uncontrolled conditions(1).

引用

页码：265 / 272

页数：8

共 50 条

[1] DeepFaceFlow: In-the-wild Dense 3D Facial Motion Estimation
Koujan, Mohammad Rami
Roussos, Anastasios
Zafeiriou, Stefanos
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6617 - 6626
[2] AvatarMe: Realistically Renderable 3D Facial Reconstruction "in-the-wild"
Lattas, Alexandros
Moschoglou, Stylianos
Gecer, Baris
Ploumpis, Stylianos
Triantafyllou, Vasileios
Ghosh, Abhijeet
Zafeiriou, Stefanos
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 757 - 766
[3] TRAM: Global Trajectory and Motion of 3D Humans from in-the-Wild Videos
Wang, Yufu
Wang, Ziyun
Liu, Lingjie
Daniilidis, Kostas
COMPUTER VISION - ECCV 2024, PT XI, 2025, 15069 : 467 - 487
[4] Multimodal 2D and 3D for In-the-wild Facial Expression Recognition
Ly, Son Thai
Nhu-Tai Do
Lee, Guee-Sang
Kim, Soo-Hyung
Yang, Hyung-Jeong
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2927 - 2934
[5] Realtime Dynamic 3D Facial Reconstruction for Monocular Video In-the-Wild
Liu, Shuang
Wang, Zhao
Yang, Xiaosong
Zhang, Jianjun
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 777 - 785
[6] A novel 2D and 3D multimodal approach for in-the-wild facial expression recognition
Ly, Thai Son
Do, Nhu-Tai
Kim, Soo-Hyung
Yang, Hyung-Jeong
Lee, Guee-Sang
IMAGE AND VISION COMPUTING, 2019, 92
[7] 3D Face Morphable Models "In-the-Wild"
Booth, James
Antonakos, Epameinondas
Ploumpis, Stylianos
Trigeorgis, George
Panagakis, Yannis
Zafeiriou, Stefanos
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5464 - 5473
[8] A 3D FACE MODELING APPROACH FOR IN-THE-WILD FACIAL EXPRESSION RECOGNITION ON IMAGE DATASETS
Ly, Son Thai
Do, Nhu-Tai
Lee, Guee-Sang
Kim, Soo-Hyung
Yang, Hyung-Jeong
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3492 - 3496
[9] Learning to Restore 3D Face from In-the-Wild Degraded Images
Zhang, Zhenyu
Ge, Yanhao
Tai, Ying
Huang, Xiaoming
Wang, Chengjie
Tang, Hao
Huang, Dongjin
Xie, Zhifeng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4227 - 4237
[10] On Learning 3D Face Morphable Mode from In-the-Wild Images
Tran, Luan
Liu, Xiaoming
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 157 - 171

← 1 2 3 4 5 →