Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality

被引:0
作者
Chen, Yu-Chih [1 ]
Saha, Avinab [1 ]
Chapiro, Alexandre [2 ]
Hane, Christian [2 ]
Bazin, Jean-Charles [2 ]
Qiu, Bo [2 ]
Zanetti, Stefano [2 ]
Katsavounidis, Ioannis [2 ]
Bovik, Alan C. [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Lab Image & Video Engn LIVE, Austin, TX 94025 USA
[2] Meta Platforms Inc, Menlo Pk, CA 94025 USA
基金
美国国家科学基金会;
关键词
Avatars; Videos; Three-dimensional displays; Quality assessment; Monitoring; Databases; Predictive models; Visualization; Solid modeling; Image coding; Virtual reality; video quality assessment; 3D mesh; human avatar video; six degrees of freedom; VISUAL QUALITY; POINT CLOUDS; MESH; INFORMATION;
D O I
10.1109/TIP.2024.3468881
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the visual quality judgments of human subjects on digital human avatars (sometimes referred to as "holograms" in the parlance of virtual reality [VR] and augmented reality [AR] systems) that have been subjected to distortions. We also study the ability of video quality models to predict human judgments. As streaming human avatar videos in VR or AR become increasingly common, the need for more advanced human avatar video compression protocols will be required to address the tradeoffs between faithfully transmitting high-quality visual representations while adjusting to changeable bandwidth scenarios. During transmission over the internet, the perceived quality of compressed human avatar videos can be severely impaired by visual artifacts. To optimize trade-offs between perceptual quality and data volume in practical workflows, video quality assessment (VQA) models are essential tools. However, there are very few VQA algorithms developed specifically to analyze human body avatar videos, due, at least in part, to the dearth of appropriate and comprehensive datasets of adequate size. Towards filling this gap, we introduce the LIVE-Meta Rendered Human Avatar VQA Database, which contains 720 human avatar videos processed using 20 different combinations of encoding parameters, labeled by corresponding human perceptual quality judgments that were collected in six degrees of freedom VR headsets. To demonstrate the usefulness of this new and unique video resource, we use it to study and compare the performances of a variety of state-of-the-art Full Reference and No Reference video quality prediction models, including a new model called HoloQA. As a service to the research community, we publicly releases the metadata of the new database at https://live.ece.utexas.edu/research/LIVE-Meta-rendered-human-avatar/index.html.
引用
收藏
页码:5740 / 5754
页数:15
相关论文
共 80 条
[1]   No-reference mesh visual quality assessment via ensemble of convolutional neural networks and compact multi-linear pooling [J].
Abouelaziz, Ilyass ;
Chetouani, Aladine ;
El Hassouni, Mohammed ;
Latecki, Longin Jan ;
Cherifi, Hocine .
PATTERN RECOGNITION, 2020, 100
[2]  
Abouelaziz I, 2017, IEEE IMAGE PROC, P755, DOI 10.1109/ICIP.2017.8296382
[3]   A Curvature based method for blind mesh visual quality assessment using a general regression neural network [J].
Abouelaziz, Ilyass ;
El Hassouni, Mohammed ;
Cherifi, Hocine .
2016 12TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2016, :793-797
[4]   No-Reference 3D Mesh Quality Assessment Based on Dihedral Angles Model and Support Vector Regression [J].
Abouelaziz, Ilyass ;
El Hassouni, Mohammed ;
Cherifi, Hocine .
IMAGE AND SIGNAL PROCESSING (ICISP 2016), 2016, 9680 :369-377
[5]  
Alexiou E., 2018, P IEEE INT C MULT EX, P1
[6]   PointXR: A toolbox for visualization and subjective evaluation of point clouds in virtual reality [J].
Alexiou, Evangelos ;
Yang, Nanyang ;
Ebrahimi, Touradj .
2020 TWELFTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2020,
[7]  
Alexiou E, 2018, INT WORK QUAL MULTIM, P132
[8]  
Alexiou E, 2017, IEEE INT WORKSH MULT
[9]   On the performance of metrics to predict quality in point cloud representations [J].
Alexiou, Evangelos ;
Ebrahimi, Touradj .
APPLICATIONS OF DIGITAL IMAGE PROCESSING XL, 2017, 10396
[10]  
[Anonymous], 2010, P ACM WORKSHOP 3D OB, DOI DOI 10.1145/1877808.1877819