4D Facial Avatar Reconstruction From Monocular Video via Efficient and Controllable Neural Radiance Fields

被引:1
|
作者
Kwak, Jeong-Gi [1 ]
Ko, Hanseok [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
关键词
Neural radiance field (NeRF); monocular facial avatar reconstruction; face reenactment;
D O I
10.1109/ACCESS.2024.3355052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an efficient approach for monocular 4D facial avatar reconstruction using a dynamic neural radiance field (NeRF). Over the years, NeRFs have been popular methods for 3D scene representation, but lack computational efficiency and controllabilty, thus it is impractical for real world application such as AR/VR, teleconferencing, and immersive experiences. Recent the introduction of grid-based encoding by InstantNGP has enabled the rendering process of NeRF much faster, but it is limited to static 3D scenes. To address the issues, we focus on developing a novel dynamic NeRF that allows explicit control over pose and facial expression, while keeping the computational efficiency. By leveraging a low-dimensional basis from the morphable model (3DMM) with elaborately designed spatial encoding branch and ambient encoding branch, we condition a dynamic radiance field in an ambient space, improving controllability and visual quality. Our model achieves rendering speeds approximately 30x faster at training and 100x faster at inference than the baseline (NeRFace), enabling practical approaches for real world applications. Through qualitative and quantitative experiments, we demonstrate the effectiveness of our approach. The dynamic NeRF exhibits superior controllability, enhanced 3D consistency, and improved visual quality. Our efficient model opens new possibilities for real-time applications, revolutionizing AR/VR and teleconferencing experiences.
引用
收藏
页码:15675 / 15683
页数:9
相关论文
共 39 条
  • [1] Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction
    Gafni, Guy
    Thies, Justus
    Zollhoefer, Michael
    Niessner, Matthias
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8645 - 8654
  • [2] Neural Radiance Fields (NeRF) for 3D Reconstruction of Monocular Endoscopic Video in Sinus Surgery
    Ruthberg, Jeremy S.
    Bly, Randall
    Gunderson, Nicole
    Chen, Pengcheng
    Alighezi, Mahdi
    Seibel, Eric J.
    Abuzeid, Waleed M.
    OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2025,
  • [3] DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video
    Sun, Huiqiang
    Li, Xingyi
    Shen, Liao
    Ye, Xinyi
    Xian, Ke
    Cao, Zhiguo
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 7517 - 7527
  • [4] Temporal residual neural radiance fields for monocular video dynamic human body reconstruction
    Du, Tianle
    Wang, Jie
    Xie, Xiaolong
    Li, Wei
    Su, Pengxiang
    Liu, Jie
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (04)
  • [5] Controllable Free Viewpoint Video Reconstruction Based on Neural Radiance Fields and Motion Graphs
    Zhang, He
    Li, Fan
    Zhao, Jianhui
    Tan, Chao
    Shen, Dongming
    Liu, Yebin
    Yu, Tao
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (12) : 4891 - 4905
  • [6] High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors
    Bai, Yunpeng
    Fan, Yanbo
    Wang, Xuan
    Zhang, Yong
    Sun, Jingxiang
    Yuan, Chun
    Shan, Ying
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4541 - 4551
  • [7] Semantic-aware hyper-space deformable neural radiance fields for facial avatar reconstruction
    Jin, Kaixin
    Gu, Xiaoling
    Wang, Zimeng
    Kuang, Zhenzhong
    Wu, Zizhao
    Tan, Min
    Yu, Jun
    PATTERN RECOGNITION LETTERS, 2024, 185 : 160 - 166
  • [8] DRSM: EFFICIENT NEURAL 4D DECOMPOSITION FOR DYNAMIC RECONSTRUCTION IN STATIONARY MONOCULAR CAMERAS
    Xie, Weixing
    Dong, Xiao
    Yang, Yong
    Lin, Qiqin
    Chen, Jingze
    Yao, Junfeng
    Guo, Xiaohu
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3760 - 3764
  • [9] Monocular thermal SLAM with neural radiance fields for 3D scene reconstruction
    Wu, Yuzhen
    Wang, Lingxue
    Zhang, Lian
    Chen, Mingkun
    Zhao, Wenqu
    Zheng, Dezhi
    Cai, Yi
    NEUROCOMPUTING, 2025, 617
  • [10] Neural Radiance Flow for 4D View Synthesis and Video Processing
    Du, Yilun
    Zhang, Yinan
    Yu, Hong-Xing
    Tenenbaum, Joshua B.
    Wu, Jiajun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14304 - 14314