4D Facial Avatar Reconstruction From Monocular Video via Efficient and Controllable Neural Radiance Fields

被引：1

作者：

Kwak, Jeong-Gi ^{[1
]}

Ko, Hanseok ^{[1
]}

机构：

[1] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Neural radiance field (NeRF); monocular facial avatar reconstruction; face reenactment;

D O I：

10.1109/ACCESS.2024.3355052

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present an efficient approach for monocular 4D facial avatar reconstruction using a dynamic neural radiance field (NeRF). Over the years, NeRFs have been popular methods for 3D scene representation, but lack computational efficiency and controllabilty, thus it is impractical for real world application such as AR/VR, teleconferencing, and immersive experiences. Recent the introduction of grid-based encoding by InstantNGP has enabled the rendering process of NeRF much faster, but it is limited to static 3D scenes. To address the issues, we focus on developing a novel dynamic NeRF that allows explicit control over pose and facial expression, while keeping the computational efficiency. By leveraging a low-dimensional basis from the morphable model (3DMM) with elaborately designed spatial encoding branch and ambient encoding branch, we condition a dynamic radiance field in an ambient space, improving controllability and visual quality. Our model achieves rendering speeds approximately 30x faster at training and 100x faster at inference than the baseline (NeRFace), enabling practical approaches for real world applications. Through qualitative and quantitative experiments, we demonstrate the effectiveness of our approach. The dynamic NeRF exhibits superior controllability, enhanced 3D consistency, and improved visual quality. Our efficient model opens new possibilities for real-time applications, revolutionizing AR/VR and teleconferencing experiences.

引用

页码：15675 / 15683

页数：9

共 39 条

[1] Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction
Gafni, Guy
Thies, Justus
Zollhoefer, Michael
Niessner, Matthias
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8645 - 8654
[2] Neural Radiance Fields (NeRF) for 3D Reconstruction of Monocular Endoscopic Video in Sinus Surgery
Ruthberg, Jeremy S.
Bly, Randall
Gunderson, Nicole
Chen, Pengcheng
Alighezi, Mahdi
Seibel, Eric J.
Abuzeid, Waleed M.
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2025,
[3] DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video
Sun, Huiqiang
Li, Xingyi
Shen, Liao
Ye, Xinyi
Xian, Ke
Cao, Zhiguo
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 7517 - 7527
[4] Temporal residual neural radiance fields for monocular video dynamic human body reconstruction
Du, Tianle
Wang, Jie
Xie, Xiaolong
Li, Wei
Su, Pengxiang
Liu, Jie
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (04)
[5] Controllable Free Viewpoint Video Reconstruction Based on Neural Radiance Fields and Motion Graphs
Zhang, He
Li, Fan
Zhao, Jianhui
Tan, Chao
Shen, Dongming
Liu, Yebin
Yu, Tao
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (12) : 4891 - 4905
[6] High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors
Bai, Yunpeng
Fan, Yanbo
Wang, Xuan
Zhang, Yong
Sun, Jingxiang
Yuan, Chun
Shan, Ying
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4541 - 4551
[7] Semantic-aware hyper-space deformable neural radiance fields for facial avatar reconstruction
Jin, Kaixin
Gu, Xiaoling
Wang, Zimeng
Kuang, Zhenzhong
Wu, Zizhao
Tan, Min
Yu, Jun
PATTERN RECOGNITION LETTERS, 2024, 185 : 160 - 166
[8] DRSM: EFFICIENT NEURAL 4D DECOMPOSITION FOR DYNAMIC RECONSTRUCTION IN STATIONARY MONOCULAR CAMERAS
Xie, Weixing
Dong, Xiao
Yang, Yong
Lin, Qiqin
Chen, Jingze
Yao, Junfeng
Guo, Xiaohu
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3760 - 3764
[9] Monocular thermal SLAM with neural radiance fields for 3D scene reconstruction
Wu, Yuzhen
Wang, Lingxue
Zhang, Lian
Chen, Mingkun
Zhao, Wenqu
Zheng, Dezhi
Cai, Yi
NEUROCOMPUTING, 2025, 617
[10] Neural Radiance Flow for 4D View Synthesis and Video Processing
Du, Yilun
Zhang, Yinan
Yu, Hong-Xing
Tenenbaum, Joshua B.
Wu, Jiajun
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14304 - 14314

← 1 2 3 4 →