3D realistic talking face co-driven by text and speech

被引:0
作者
Song, MG [1 ]
Chen, C [1 ]
Bu, JJ [1 ]
Liang, RH [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China
来源
2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS | 2003年
关键词
visemes' transcription; speech segmentation; time vector extraction; static viseme; dynamic visemes generation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To create 3D realistic talking face has been a challenge for a long time. Previous works emphasize text or speech driven talking face respectively while the animation result is not very realistic or natural-looking. In the proposed approach, text and speech are considered to drive the 3D talking face coordinately. The text is translated into a sequence of visemes' transcription. And time vector of the sequence is extracted from the speech corresponding to the text after it is segmented into phonetic sequence. A muscle based viseme vector is defined for static viseme. And then, with the time vector and the static visemes's sequence, dynamic visemes are generated through time-relate dominance function. Finally, according to the frame rate to be rendered, intermediate frames are interpolated between key frames to make the animation result looks more natural and realistic than those obtained based on the text or speech-driven only.
引用
收藏
页码:2175 / 2180
页数:6
相关论文
共 15 条
  • [1] May I talk to you?: Facial Animation from Text
    Albrecht, I
    Haber, J
    Kähler, K
    Schröder, M
    Seidel, HP
    [J]. 10TH PACIFIC CONFERENCE ON COMPUTER GRAPHICS AND APPLICATIONS, PROCEEDINGS, 2002, : 77 - 86
  • [2] Albrecht I, 2002, WSCG'2002, VOLS I AND II, CONFERENCE PROCEEDINGS, P9
  • [3] [Anonymous], P ACM MULT
  • [4] Cassell J, 2001, COMP GRAPH, P477, DOI 10.1145/383259.383315
  • [5] Pump it up: computer animation of a biomechanically based model of muscle using the finite element method
    Chen, David T.
    Zeltzer, David
    [J]. Computer Graphics (ACM), 1992, 26 (02): : 89 - 98
  • [6] Choi KH, 2002, IEEE IMAGE PROC, P984
  • [7] Cohen M. M., 1993, Models and Techniques in Computer Animation, P139
  • [8] Making discourse visible: Coding and animating conversational facial displays
    DeCarlo, D
    Revilla, C
    Stone, M
    Venditti, JJ
    [J]. CA 2002: PROCEEDINGS OF THE COMPUTER ANIMATION 2002, 2002, : 11 - 16
  • [9] GOFF BL, 1996, P INT C SPOK LANG PR, P2163
  • [10] Real-time speech-driven 3D face animation
    Hong, PY
    Wen, Z
    Huang, TS
    Shum, HY
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 713 - 716