LiveHand: Real-time and Photorealistic Neural Hand Rendering

被引:3
作者
Mundra, Akshay [1 ,2 ]
Mallikarjun, B. R. [1 ]
Wang, Jiayi [1 ]
Habermann, Marc [1 ]
Theobalt, Christian [1 ,2 ]
Elgharib, Mohamed [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Saarland Univ, Saarbrucken, Germany
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
关键词
D O I
10.1109/ICCV51070.2023.01653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human hand is the main medium through which we interact with our surroundings, making its digitization an important problem. While there are several works modeling the geometry of hands, little attention has been paid to capturing photo-realistic appearance. Moreover, for applications in extended reality and gaming, real-time rendering is critical. We present the first neural-implicit approach to photo-realistically render hands in real-time. This is a challenging problem as hands are textured and undergo strong articulations with pose-dependent effects. However, we show that this aim is achievable through our carefully designed method. This includes training on a low-resolution rendering of a neural radiance field, together with a 3D-consistent super-resolution module and mesh-guided sampling and space canonicalization. We demonstrate a novel application of perceptual loss on the image space, which is critical for learning details accurately. We also show a live demo where we photo-realistically render the human hand in real-time for the first time, while also modeling poseand view-dependent appearance effects. We ablate all our design choices and show that they optimize for rendering speed and quality. Video results and our code can be accessed from https://vcai.mpi-inf.mpg.de/projects/LiveHand/
引用
收藏
页码:17989 / 17999
页数:11
相关论文
共 45 条
[1]   imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose [J].
Alldieck, Thiemo ;
Xu, Hongyi ;
Sminchisescu, Cristian .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5441-5450
[2]  
[Anonymous], 2020, EUR C COMP VIS ECCV, DOI DOI 10.1109/DCABES50732.2020.00023
[3]  
Bagautdinov Timur, 2021, ACM T GRAPH, V40
[4]  
Bhatnagar Bharat Lal, 2020, EUR C COMP VIS ECCV
[5]   Authentic Volumetric Avatars from a Phone Scan [J].
Cao, Chen ;
Simon, Tomas ;
Kim, Jin Kyu ;
Schwartz, Gabe ;
Zollhoefer, Michael ;
Saito, Shun-Suke ;
Lombardi, Stephen ;
Wei, Shih-En ;
Belko, Danielle ;
Yu, Shoou-, I ;
Sheikh, Yaser ;
Saragih, Jason .
ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (04)
[6]  
Chan Eric R., 2022, CVPR
[7]   LISA: Learning Implicit Shape and Appearance of Hands [J].
Corona, Enric ;
Hodan, Tomas ;
Vo, Minh ;
Moreno-Noguer, Francesc ;
Sweeney, Chris ;
Newcombe, Richard ;
Ma, Lingni .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :20501-20511
[8]   Reconstructing Personalized Semantic Facial NeRF Models From Monocular Video [J].
Gao, Xuan ;
Zhong, Chenglai ;
Xiang, Jun ;
Hong, Yang ;
Guo, Yudong ;
Zhang, Juyong .
ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (06)
[9]   Neural Head Avatars from Monocular RGB Videos [J].
Grassal, Philip-William ;
Prinzler, Malte ;
Leistner, Titus ;
Rother, Carsten ;
Niessner, Matthias ;
Thies, Justus .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18632-18643
[10]  
Habermann Marc, 2023, SCA 23