Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality

被引:0
作者
Chen, Yu-Chih [1 ]
Saha, Avinab [1 ]
Chapiro, Alexandre [2 ]
Hane, Christian [2 ]
Bazin, Jean-Charles [2 ]
Qiu, Bo [2 ]
Zanetti, Stefano [2 ]
Katsavounidis, Ioannis [2 ]
Bovik, Alan C. [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Lab Image & Video Engn LIVE, Austin, TX 94025 USA
[2] Meta Platforms Inc, Menlo Pk, CA 94025 USA
基金
美国国家科学基金会;
关键词
Avatars; Videos; Three-dimensional displays; Quality assessment; Monitoring; Databases; Predictive models; Visualization; Solid modeling; Image coding; Virtual reality; video quality assessment; 3D mesh; human avatar video; six degrees of freedom; VISUAL QUALITY; POINT CLOUDS; MESH; INFORMATION;
D O I
10.1109/TIP.2024.3468881
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the visual quality judgments of human subjects on digital human avatars (sometimes referred to as "holograms" in the parlance of virtual reality [VR] and augmented reality [AR] systems) that have been subjected to distortions. We also study the ability of video quality models to predict human judgments. As streaming human avatar videos in VR or AR become increasingly common, the need for more advanced human avatar video compression protocols will be required to address the tradeoffs between faithfully transmitting high-quality visual representations while adjusting to changeable bandwidth scenarios. During transmission over the internet, the perceived quality of compressed human avatar videos can be severely impaired by visual artifacts. To optimize trade-offs between perceptual quality and data volume in practical workflows, video quality assessment (VQA) models are essential tools. However, there are very few VQA algorithms developed specifically to analyze human body avatar videos, due, at least in part, to the dearth of appropriate and comprehensive datasets of adequate size. Towards filling this gap, we introduce the LIVE-Meta Rendered Human Avatar VQA Database, which contains 720 human avatar videos processed using 20 different combinations of encoding parameters, labeled by corresponding human perceptual quality judgments that were collected in six degrees of freedom VR headsets. To demonstrate the usefulness of this new and unique video resource, we use it to study and compare the performances of a variety of state-of-the-art Full Reference and No Reference video quality prediction models, including a new model called HoloQA. As a service to the research community, we publicly releases the metadata of the new database at https://live.ece.utexas.edu/research/LIVE-Meta-rendered-human-avatar/index.html.
引用
收藏
页码:5740 / 5754
页数:15
相关论文
共 80 条
[11]  
[Anonymous], 2012, document 500-13,
[12]   Driving-Signal Aware Full-Body Avatars [J].
Bagautdinov, Timur ;
Wu, Chenglei ;
Simon, Tomas ;
Prada, Fabian ;
Shiratori, Takaaki ;
Wei, Shih-En ;
Xu, Weipeng ;
Sheikh, Yaser ;
Saragih, Jason .
ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04)
[13]   Progressive compression of arbitrary textured meshes [J].
Caillaud, F. ;
Vidal, V. ;
Dupont, F. ;
Lavoue, G. .
COMPUTER GRAPHICS FORUM, 2016, 35 (07) :475-484
[14]   Visual Quality of Compressed Mesh and Point Cloud Sequences [J].
Cao, Keming ;
Xu, Yi ;
Cosman, Pamela .
IEEE ACCESS, 2020, 8 :171203-171217
[15]   GAMIVAL: Video Quality Prediction on Mobile Cloud Gaming Content [J].
Chen, Yu-Chih ;
Saha, Avinab ;
Davis, Chase ;
Qiu, Bo ;
Wang, Xiaoming ;
Gowda, Rahul ;
Katsavounidis, Ioannis ;
Bovik, Alan C. .
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 :324-328
[16]  
Cruz LAD, 2019, INT WORK QUAL MULTIM, DOI [10.1109/LARS-SBR-WRE48964.2019.00009, 10.1109/qomex.2019.8743258]
[17]   Perceptual Quality Assessment for 3D Triangle Mesh Based on Curvature [J].
Dong, Lu ;
Fang, Yuming ;
Lin, Weisi ;
Seah, Hock Soon .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (12) :2174-2184
[18]   ChipQA: No-Reference Video Quality Prediction via Space-Time Chips [J].
Ebenezer, Joshua Peter ;
Shang, Zaixi ;
Wu, Yongjun ;
Wei, Hai ;
Sethuraman, Sriram ;
Bovik, Alan C. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :8059-8074
[19]  
Ebrahimi Touradj, 2019, INT WORKSHOP QUALITY
[20]   A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences [J].
Fan, Yu ;
Zhang, Zicheng ;
Sun, Wei ;
Min, Xiongkuo ;
Liu, Ning ;
Zhou, Quan ;
He, Jun ;
Wang, Qiyuan ;
Zhai, Guangtao .
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,