Real-time speech-driven face animation with expressions using neural networks

被引:70
|
作者
Hong, PY [1 ]
Wen, Z [1 ]
Huang, TS [1 ]
机构
[1] Univ Illinois, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2002年 / 13卷 / 04期
基金
美国国家科学基金会;
关键词
facial deformation modeling; facial motion analysis and synthesis; neural networks; real-time speech-driven; talking face with expressions;
D O I
10.1109/TNN.2002.1021892
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A real-time speech-driven synthetic talking face provides an effective multimodal communication interface in distributed collaboration environments. Nonverbal gestures such as facial expressions are important to human communication and should be considered by speech-driven face animation systems. In this paper, we present a framework that systematically addresses facial deformation modeling, automatic facial motion analysis, and real-time speech-driven face animation with expression using neural networks. Based on this framework, we learn a quantitative visual representation of the facial deformations, called the motion units (MUs). An facial deformation can be approximated by a linear combination of the MUs weighted by MU parameters (MVPs). We develop an MU-based facial motion tracking algorithm which is used to collect an audio-visual training database. Then, we construct a real-time audio-to-MUP mapping by training a set of neural networks using the collected audio-visual training database. The quantitative evaluation of the mapping shows the effectiveness of the proposed approach. Using the proposed method, we develop the functionality of real-time speech-driven face animation with expressions for the iFACE system. Experimental results show that the synthetic expressive talking face of the iFACE system is comparable with a real face in terms of the effectiveness of their influences on bimodal human emotion perception.
引用
收藏
页码:916 / 927
页数:12
相关论文
共 50 条
  • [1] Real-time speech-driven 3D face animation
    Hong, PY
    Wen, Z
    Huang, TS
    Shum, HY
    FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 713 - 716
  • [2] SYNTHESIZING REAL-TIME SPEECH-DRIVEN FACIAL ANIMATION
    Luo, Changwei
    Yu, Jun
    Wang, Zengfu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Real-time speech-driven animation of expressive talking faces
    Liu, Jia
    You, Mingyu
    Chen, Chun
    Song, Mingli
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 439 - 455
  • [4] Towards Realistic Real Time Speech-Driven Facial Animation
    Cerekovic, Aleksandra
    Zoric, Goranka
    Smid, Karlo
    Pandzic, Igor S.
    INTELLIGENT VIRTUAL AGENTS, PROCEEDINGS, 2008, 5208 : 476 - 478
  • [5] Real-Time Speech Driven Gesture Animation
    Kasarci, Kenan
    Bozkurt, Elif
    Yemez, Yucel
    Erzin, Engin
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1917 - 1920
  • [6] Speech-driven facial animation using a hierarchical model
    Cosker, DP
    Marshall, AD
    Rosin, PL
    Hicks, YA
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (04): : 314 - 321
  • [7] Speech-driven talking face using embedded confusable system for real time mobile multimedia
    Po-Yi Shih
    Anand Paul
    Jhing-Fa Wang
    Yi-Hung Chen
    Multimedia Tools and Applications, 2014, 73 : 417 - 437
  • [8] Speech-driven talking face using embedded confusable system for real time mobile multimedia
    Shih, Po-Yi
    Paul, Anand
    Wang, Jhing-Fa
    Chen, Yi-Hung
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (01) : 417 - 437
  • [9] Speech-driven animation with meaningful behaviors
    Sadoughi, Najmeh
    Busso, Carlos
    SPEECH COMMUNICATION, 2019, 110 : 90 - 100
  • [10] Expressive speech-driven facial animation
    Cao, Y
    Tien, WC
    Faloutsos, P
    Pighin, F
    ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (04): : 1283 - 1302