Predicting user visual attention in virtual reality with a deep learning model

被引：0

作者：

Xiangdong Li

Yifei Shan

Wenqian Chen

Yue Wu

Praben Hansen

Simon Perrault

机构：

[1] Zhejiang University,College of Computer Science and Technology

[2] Stockholm University,Department of Computer Science and Systems

[3] ISTD,undefined

[4] Singapore University of Technology and Design,undefined

来源：

Virtual Reality | 2021年 / 25卷

关键词：

Visual attention; Virtual reality; Deep learning model; Eye tracking;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recent studies show that user’s visual attention during virtual reality museum navigation can be effectively estimated with deep learning models. However, these models rely on large-scale datasets that usually are of high structure complexity and context specific, which is challenging for nonspecialist researchers and designers. Therefore, we present the deep learning model, ALRF, to generalise on real-time user visual attention prediction in virtual reality context. The model combines two parallel deep learning streams to process the compact dataset of temporal–spatial salient features of user’s eye movements and virtual object coordinates. The prediction accuracy outperformed the state-of-the-art deep learning models by reaching record high 91.03%. Importantly, with quick parametric tuning, the model showed flexible applicability across different environments of the virtual reality museum and outdoor scenes. Implications for how the proposed model may be implemented as a generalising tool for adaptive virtual reality application design and evaluation are discussed.

引用

页码：1123 / 1136

页数：13

共 197 条

[1] Barbieri L(2017)User-centered design of a virtual reality exhibit for archaeological museums Int J Inter Des Manuf (IJIDeM) 12 561-571
[2] Bruno F(2013)State-of-the-art in visual attention modeling IEEE Trans Pattern Anal Mach Intell 35 185-207
[3] Muzzupappa M(2016)Transfer learning with deep networks for saliency prediction in natural video IEEE Int Conf Image Process 2 81-84
[4] Borji A(2020)Deep learning for content-based personalized viewport prediction of 360-degree VR videos IEEE Netw Lett 20 39-68
[5] Itti L(2003)Transferring R&D knowledge: the key factors affecting knowledge transfer success J Eng Tech Manag 7 197-387
[6] Chaabouni S(2017)Measuring game experience using visual distractors Ext Abstr Publ Annu Sympos Comput-Hum Interact Play 22 744-759
[7] Benois-Pineau J(2014)Deep learning: methods and applications Found Trends Signal Process 23 3910-3921
[8] Amar CB(2017)Fixation prediction for 360 video streaming in head-mounted virtual reality Proc Workshop Netw Oper Syst Supp Digit Audio Video 26 4684-4696
[9] Chen X(2019)Optimizing fixation prediction using recurrent neural networks for 360° video streaming in head-mounted virtual reality IEEE Trans Multimed 15 11092-11117
[10] Kasgari ATZ(2014)Video saliency incorporating spatiotemporal cues and uncertainty weighting IEEE Trans Image Process 423 534-48

← 1 2 3 4 5 6 7 8 9 10 →