HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

被引:0
|
作者
Zhou, Zhenglin [1 ,2 ]
Ma, Fan [2 ]
Fan, Hehe [2 ]
Yang, Zongxin [2 ]
Yang, Yi [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Brain Machine Intelligence, Hangzhou, Peoples R China
[2] Zhejiang Univ, ReLER, CCAI, Hangzhou, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Head avatar animation; Text-guided generation; 3D Gaussian splatting;
D O I
10.1007/978-3-031-73411-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating digital avatars from textual prompts has long been a desirable yet challenging task. Despite the promising results achieved with 2D diffusion priors, current methods struggle to create high-quality and consistent animated avatars efficiently. Previous animatable head models like FLAME have difficulty in accurately representing detailed texture and geometry. Additionally, high-quality 3D static representations face challenges in semantically driving with dynamic priors. In this paper, we introduce HeadStudio, a novel framework that utilizes 3D Gaussian splatting to generate realistic and animatable avatars from text prompts. Firstly, we associate 3D Gaussians with animatable head prior model, facilitating semantic animation on high-quality 3D representations. To ensure consistent animation, we further enhance the optimization from initialization, distillation, and regularization to jointly learn the shape, texture, and animation. Extensive experiments demonstrate the efficacy of HeadStudio in generating animatable avatars from textual prompts, exhibiting appealing appearances. The avatars are capable of rendering high-quality real-time (>= 40 fps) novel views at a resolution of 1024. Moreover, These avatars can be smoothly driven by real-world speech and video. We hope that HeadStudio can enhance digital avatar creation and gain popularity in the community. Code is at: https:// github.com/ZhenglinZhou/HeadStudio.
引用
收藏
页码:145 / 163
页数:19
相关论文
共 50 条
  • [41] FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
    Zuo, Xingxing
    Samangouei, Pouya
    Zhou, Yunwen
    Di, Yan
    Li, Mingyang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (02) : 611 - 627
  • [42] FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally
    Shen, Qiuhong
    Yang, Xingyi
    Wang, Xinchao
    COMPUTER VISION-ECCV 2024, PT XXII, 2025, 15080 : 456 - 472
  • [43] DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance
    Zhang, Longwen
    Qiu, Qiwei
    Lin, Hongyang
    Zhang, Qixuan
    Shi, Cheng
    Yang, Wei
    Shi, Ye
    Yang, Sibei
    Xu, Lan
    Yu, Jingyi
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):
  • [44] Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting
    Bae, Jeongmin
    Kim, Seoha
    Yun, Youngsik
    Lee, Hahyun
    Bang, Gun
    Uh, Youngjung
    COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 321 - 335
  • [45] Gaussian Pancakes: Geometrically-Regularized 3D Gaussian Splatting for Realistic Endoscopic Reconstruction
    Bonilla, Sierra
    Zhang, Shuai
    Psychogyios, Dimitrios
    Stoyanov, Danail
    Vasconcelos, Francisco
    Bano, Sophia
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VI, 2024, 15006 : 274 - 283
  • [46] 3D Gaussian Splatting for Real-Time Radiance Field Rendering
    Kerbl, Bernhard
    Kopanas, Georgios
    Leimkuehler, Thomas
    Drettakis, George
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):
  • [47] Scene reconstruction techniques for autonomous driving: a review of 3D Gaussian splatting
    Zhu, Huixin
    Zhang, Zhili
    Zhao, Junyang
    Duan, Hui
    Ding, Yao
    Xiao, Xiongwu
    Yuan, Junsong
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (01)
  • [48] Fast 3D Gaussian Splatting Rendering via Easily Integrable Improvements
    Diels, Laurens
    Vlaminck, Michiel
    Philips, Wilfried
    Luong, Hiep
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 381 - 385
  • [49] DehazeGS: 3D Gaussian Splatting for Multi-Image Haze Removal
    Ma, Chenjun
    Zhao, Jieyu
    Chen, Jian
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 736 - 740
  • [50] Large-Scale 3D Terrain Reconstruction Using 3D Gaussian Splatting for Visualization and Simulation
    Chen, Meida
    Lal, Devashish
    Yu, Zifan
    Xu, Jiuyi
    Feng, Andrew
    You, Suya
    Nurunnabi, Abdul
    Shi, Yangming
    MID-TERM SYMPOSIUM THE ROLE OF PHOTOGRAMMETRY FOR A SUSTAINABLE WORLD, VOL. 48-2, 2024, : 49 - 54