HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

被引:0
|
作者
Zhou, Zhenglin [1 ,2 ]
Ma, Fan [2 ]
Fan, Hehe [2 ]
Yang, Zongxin [2 ]
Yang, Yi [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Brain Machine Intelligence, Hangzhou, Peoples R China
[2] Zhejiang Univ, ReLER, CCAI, Hangzhou, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Head avatar animation; Text-guided generation; 3D Gaussian splatting;
D O I
10.1007/978-3-031-73411-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating digital avatars from textual prompts has long been a desirable yet challenging task. Despite the promising results achieved with 2D diffusion priors, current methods struggle to create high-quality and consistent animated avatars efficiently. Previous animatable head models like FLAME have difficulty in accurately representing detailed texture and geometry. Additionally, high-quality 3D static representations face challenges in semantically driving with dynamic priors. In this paper, we introduce HeadStudio, a novel framework that utilizes 3D Gaussian splatting to generate realistic and animatable avatars from text prompts. Firstly, we associate 3D Gaussians with animatable head prior model, facilitating semantic animation on high-quality 3D representations. To ensure consistent animation, we further enhance the optimization from initialization, distillation, and regularization to jointly learn the shape, texture, and animation. Extensive experiments demonstrate the efficacy of HeadStudio in generating animatable avatars from textual prompts, exhibiting appealing appearances. The avatars are capable of rendering high-quality real-time (>= 40 fps) novel views at a resolution of 1024. Moreover, These avatars can be smoothly driven by real-world speech and video. We hope that HeadStudio can enhance digital avatar creation and gain popularity in the community. Code is at: https:// github.com/ZhenglinZhou/HeadStudio.
引用
收藏
页码:145 / 163
页数:19
相关论文
共 50 条
  • [1] HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting
    Dhamo, Helisa
    Nie, Yinyu
    Moreau, Arthur
    Song, Jifei
    Shaw, Richard
    Zhou, Yiren
    Perez-Pellitero, Eduardo
    COMPUTER VISION - ECCV 2024, PT II, 2025, 15060 : 459 - 476
  • [2] DreamHuman: Animatable 3D Avatars from Text
    Kolotouros, Nikos
    Alldieck, Thiemo
    Zanfir, Andrei
    Bazavan, Eduard Gabriel
    Fieraru, Mihai
    Sminchisescu, Cristian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Automatic construction of 3D animatable facial avatars
    Gao, Yujian
    Zhao, Qinping
    Hao, Aimin
    Sezgin, T. M.
    Dodgson, N. A.
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2010, 21 (3-4) : 343 - 354
  • [4] HeadSculpt: Crafting 3D Head Avatars with Text
    Han, Xiao
    Cao, Yukang
    Han, Kai
    Zhu, Xiatian
    Deng, Jiankang
    Song, Yi-Zhe
    Xiang, Tao
    Wong, Kwan-Yee K.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] DreamWaltz: Make a Scene with Complex 3D Animatable Avatars
    Huang, Yukun
    Wang, Jianan
    Zeng, Ailing
    Cao, He
    Qi, Xianbiao
    Shi, Yukai
    Zha, Zheng-Jun
    Zhang, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Deblurring 3D Gaussian Splatting
    Lee, Byeonghyeon
    Lee, Howoong
    Sun, Xiangyu
    Alit, Usman
    Park, Eunbyung
    COMPUTER VISION - ECCV 2024, PT LVIII, 2025, 15116 : 127 - 143
  • [7] ATGT3D: Animatable Texture Generation and Tracking for 3D Avatars
    Chen, Fei
    Choi, Jaeho
    ELECTRONICS, 2024, 13 (22)
  • [8] 3D Gaussian Splatting with Deferred Reflection
    Ye, Keyang
    Hou, Qiming
    Zhou, Kun
    PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS, 2024,
  • [9] LangSplat: 3D Language Gaussian Splatting
    Qin, Minghan
    Li, Wanhua
    Zhou, Jiawei
    Wang, Haoqian
    Pfister, Hanspeter
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 20051 - 20060
  • [10] Recent advances in 3D Gaussian splatting
    Wu, Tong
    Yuan, Yu-Jie
    Zhang, Ling-Xiao
    Yang, Jie
    Cao, Yan-Pei
    Yan, Ling-Qi
    Gao, Lin
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (04) : 613 - 642