A Survey of Facial Capture for Virtual Reality

被引:9
作者
Wen, Lihang [1 ]
Zhou, Jianlong [1 ]
Huang, Weidong [2 ]
Chen, Fang [1 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, HCAI Lab, Ultimo, NSW 2007, Australia
[2] Univ Technol Sydney, TD Sch, Ultimo, NSW 2007, Australia
关键词
Headphones; Faces; Tracking; Solid modeling; Costs; Telepresence; Sensors; Facial capture; facial motion capture; facial performance capture; headset; virtual reality;
D O I
10.1109/ACCESS.2021.3138200
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The holograms in Star Wars have inspired many researchers to capture the whole human body in real-time and present it as an avatar into Virtual Reality (VR). Facial capture is important to achieve this because facial expressions are essential for social interaction. However, facial capture works well only when the face is not occluded. While a VR headset has been widely used as a display device for VR, it occludes half of the face and becomes an obstacle. This paper presents a systematic literature review on facial capture for VR. The survey aims to: review the current state-of-the-art facial capture technologies for VR; identify the types of technologies and context of the use, the methodologies, and theoretical groundings; identify research gaps in facial capture for VR; and identify solutions of facial capture for VR headsets. A realism index is defined to evaluate and compare the collected papers. The results show several technology trends in the facial capture for VR: tracking facial motion with markers, facial capture in headsets using cameras or sensors, facial performance capture, and hologram/volumetric capture. It is shown that the Modular Codec Avatar is the best facial capture method in a VR headset, whereas Metahuman has the best output effects. This paper also proposes that an open-design VR headset is an effective approach to lower the Total Cost of Ownership (TCO).
引用
收藏
页码:6042 / 6052
页数:11
相关论文
共 82 条
[1]   Inverse Rendering of Faces with a 3D Morphable Model [J].
Aldrian, Oswald ;
Smith, William A. P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (05) :1080-1093
[2]  
[Anonymous], 2016, ARXIV161003151
[3]   High-Quality Single-Shot Capture of Facial Geometry [J].
Beeler, Thabo ;
Bickel, Bernd ;
Beardsley, Paul ;
Sumner, Bob ;
Gross, Markus .
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)
[4]   High-Quality Passive Facial Performance Capture using Anchor Frames [J].
Beeler, Thabo ;
Hahn, Fabian ;
Bradley, Derek ;
Bickel, Bernd ;
Beardsley, Paul ;
Gotsman, Craig ;
Sumner, Robert W. ;
Gross, Markus .
ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04)
[5]  
Bickel B, 2007, ACM T GRAPHIC, V26, DOI [10.1145/1239451.1239484, 10.1145/1276377.1276419]
[6]   High Resolution Passive Facial Performance Capture [J].
Bradley, Derek ;
Heidrich, Wolfgang ;
Popa, Tiberiu ;
Sheffer, Alla .
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)
[7]  
Bradley Derek., 2008, Computer Vision and Pattern Recognition, P1
[8]   Immersive Light Field Video with a Layered Mesh Representation [J].
Broxton, Michael ;
Flynn, John ;
Overbeck, Ryan ;
Erickson, Daniel ;
Hedman, Peter ;
Duvall, Matthew ;
Dourgarian, Jason ;
Busch, Jay ;
Whalen, Matt ;
Debevec, Paul .
ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04)
[9]   Real-Time High-Fidelity Facial Performance Capture [J].
Cao, Chen ;
Bradley, Derek ;
Zhou, Kun ;
Beeler, Thabo .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04)
[10]   Displaced Dynamic Expression Regression for Real-time Facial Tracking and Animation [J].
Cao, Chen ;
Hou, Qiming ;
Zhou, Kun .
ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (04)