A Statistical Quality Model for Data-Driven Speech Animation

被引:5
|
作者
Ma, Xiaohan [1 ]
Deng, Zhigang [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Comp Graph Lab, Houston, TX 77204 USA
基金
美国国家科学基金会;
关键词
Facial animation; data-driven; visual speech animation; lip-sync; quality prediction; statistical models; CAPTURE; MOTION; PERCEPTION; FACES;
D O I
10.1109/TVCG.2012.67
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, data-driven speech animation approaches have achieved significant successes in terms of animation quality. However, how to automatically evaluate the realism of novel synthesized speech animations has been an important yet unsolved research problem. In this paper, we propose a novel statistical model (called SAQP) to automatically predict the quality of on-the-fly synthesized speech animations by various data-driven techniques. Its essential idea is to construct a phoneme-based, Speech Animation Trajectory Fitting (SATF) metric to describe speech animation synthesis errors and then build a statistical regression model to learn the association between the obtained SATF metric and the objective speech animation synthesis quality. Through delicately designed user studies, we evaluate the effectiveness and robustness of the proposed SAQP model. To the best of our knowledge, this work is the first-of-its-kind, quantitative quality model for data-driven speech animation. We believe it is the important first step to remove a critical technical barrier for applying data-driven speech animation techniques to numerous online or interactive talking avatar applications.
引用
收藏
页码:1915 / 1927
页数:13
相关论文
共 50 条
  • [1] Data-driven animation of crowds
    Courty, Nicolas
    Corpetti, Thomas
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, 2007, 4418 : 377 - +
  • [2] Precomputing data-driven tree animation
    Zhang, Long
    Zhang, Yubo
    Jiang, Zhongding
    Li, Luying
    Chen, Wei
    Peng, Qunsheng
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2007, 18 (4-5) : 371 - 382
  • [3] Data-driven Autocompletion for Keyframe Animation
    Zhang, Xinyi
    van de Panne, Michiel
    ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION, AND GAMES (MIG 2018), 2018,
  • [4] Statistical methods in data-driven modeling of Spanish prosody for text to speech
    LopezGonzalo, E
    RodriguezGarcia, JM
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1377 - 1380
  • [5] Data-driven simulation in fluids animation: A survey
    Qian CHEN
    Yue WANG
    Hui WANG
    Xubo YANG
    虚拟现实与智能硬件(中英文), 2021, 3 (02) : 87 - 104
  • [6] Real Traffic Data-Driven Animation Simulation
    Yang, Xin
    Su, Wanchao
    Deng, Jian
    Pan, Zhigeng
    14TH ACM SIGGRAPH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY, VRCAI 2015, 2015, : 93 - 99
  • [7] Data-driven simulation in fluids animation: A survey
    Chen Q.
    Wang Y.
    Wang H.
    Yang X.
    Virtual Reality and Intelligent Hardware, 2021, 3 (02): : 87 - 104
  • [8] Flow Reconstruction for Data-Driven Traffic Animation
    Wilkie, David
    Sewall, Jason
    Lin, Ming
    ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04):
  • [9] Data-driven analysis of speech
    Hermansky, H
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 10 - 18
  • [10] A data-driven non-intrusive measure of speech quality and intelligibility
    Sharma, Dushyant
    Wang, Yu
    Naylor, Patrick A.
    Brookes, Mike
    SPEECH COMMUNICATION, 2016, 80 : 84 - 94