A Statistical Quality Model for Data-Driven Speech Animation

被引：5

作者：

Ma, Xiaohan ^{[1
]}

Deng, Zhigang ^{[1
]}

机构：

[1] Univ Houston, Dept Comp Sci, Comp Graph Lab, Houston, TX 77204 USA

来源：

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS | 2012年 / 18卷 / 11期

基金：

美国国家科学基金会;

关键词：

Facial animation; data-driven; visual speech animation; lip-sync; quality prediction; statistical models; CAPTURE; MOTION; PERCEPTION; FACES;

D O I：

10.1109/TVCG.2012.67

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In recent years, data-driven speech animation approaches have achieved significant successes in terms of animation quality. However, how to automatically evaluate the realism of novel synthesized speech animations has been an important yet unsolved research problem. In this paper, we propose a novel statistical model (called SAQP) to automatically predict the quality of on-the-fly synthesized speech animations by various data-driven techniques. Its essential idea is to construct a phoneme-based, Speech Animation Trajectory Fitting (SATF) metric to describe speech animation synthesis errors and then build a statistical regression model to learn the association between the obtained SATF metric and the objective speech animation synthesis quality. Through delicately designed user studies, we evaluate the effectiveness and robustness of the proposed SAQP model. To the best of our knowledge, this work is the first-of-its-kind, quantitative quality model for data-driven speech animation. We believe it is the important first step to remove a critical technical barrier for applying data-driven speech animation techniques to numerous online or interactive talking avatar applications.

引用

页码：1915 / 1927

页数：13

共 50 条

[1] Data-driven animation of crowds
Courty, Nicolas
Corpetti, Thomas
COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, 2007, 4418 : 377 - +
[2] Precomputing data-driven tree animation
Zhang, Long
Zhang, Yubo
Jiang, Zhongding
Li, Luying
Chen, Wei
Peng, Qunsheng
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2007, 18 (4-5) : 371 - 382
[3] Data-driven Autocompletion for Keyframe Animation
Zhang, Xinyi
van de Panne, Michiel
ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION, AND GAMES (MIG 2018), 2018,
[4] Statistical methods in data-driven modeling of Spanish prosody for text to speech
LopezGonzalo, E
RodriguezGarcia, JM
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1377 - 1380
[5] Data-driven simulation in fluids animation: A survey
Qian CHEN
Yue WANG
Hui WANG
Xubo YANG
虚拟现实与智能硬件(中英文), 2021, 3 (02) : 87 - 104
[6] Real Traffic Data-Driven Animation Simulation
Yang, Xin
Su, Wanchao
Deng, Jian
Pan, Zhigeng
14TH ACM SIGGRAPH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY, VRCAI 2015, 2015, : 93 - 99
[7] Data-driven simulation in fluids animation: A survey
Chen Q.
Wang Y.
Wang H.
Yang X.
Virtual Reality and Intelligent Hardware, 2021, 3 (02): : 87 - 104
[8] Flow Reconstruction for Data-Driven Traffic Animation
Wilkie, David
Sewall, Jason
Lin, Ming
ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04):
[9] Data-driven analysis of speech
Hermansky, H
TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 10 - 18
[10] A data-driven non-intrusive measure of speech quality and intelligibility
Sharma, Dushyant
Wang, Yu
Naylor, Patrick A.
Brookes, Mike
SPEECH COMMUNICATION, 2016, 80 : 84 - 94

← 1 2 3 4 5 →