PRECYSE: Predicting Cybersickness using Transformer for Multimodal Time-Series Sensor Data

被引:5
作者
Jeong, Dayoung [1 ]
Han, Kyungsik [1 ]
机构
[1] Hanyang Univ, Seoul, South Korea
来源
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT | 2024年 / 8卷 / 02期
基金
新加坡国家研究基金会;
关键词
Virtual reality; Cybersickness; Transformer; Multimodal time-series sensor data; MOTION SICKNESS; CONFLICT; SIGNALS;
D O I
10.1145/3659594
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cybersickness, a factor that hinders user immersion in VR, has been the subject of ongoing attempts to predict it using AI. Previous studies have used CNN and LSTM for prediction models and used attention mechanisms and XAI for data analysis, yet none explored a transformer that can better reflect the spatial and temporal characteristics of the data, beneficial for enhancing prediction and feature importance analysis. In this paper, we propose cybersickness prediction models using multimodal time-series sensor data (i.e., eye movement, head movement, and physiological signals) based on a transformer algorithm, considering sensor data pre-processing and multimodal data fusion methods. We constructed the MSCVR dataset consisting of normalized sensor data, spectrogram formatted sensor data, and cybersickness levels collected from 45 participants through a user study. We proposed two methods for embedding multimodal time-series sensor data into the transformer: modality-specific spatial and temporal transformer encoders for normalized sensor data (MS-STTN) and modality-specific spatial-temporal transformer encoder for spectrogram (MS-STTS). MS-STTN yielded the highest performance in the ablation study and the comparison of the existing models. Furthermore, by analyzing the importance of data features, we determined their relevance to cybersickness over time, especially the salience of eye movement features. Our results and insights derived from multimodal time-series sensor data and the transformer model provide a comprehensive understanding of cybersickness and its association with sensor data. Our MSCVR dataset and code are publicly available: https://github.com/dayoung- jeong/PRECYSE.git.
引用
收藏
页数:24
相关论文
共 95 条
[31]  
Hettinger L J, 1990, Mil Psychol, V2, P171, DOI 10.1207/s15327876mp0203_4
[32]   Linear and quadratic time-frequency signal representations [J].
Hlawatsch, F. ;
Boudreaux-Bartels, G. F. .
IEEE SIGNAL PROCESSING MAGAZINE, 1992, 9 (02) :21-67
[33]  
Hu ZX, 2022, IEEE VEH TECHNOL MAG, V17, P57, DOI [10.1109/MVT.2021.3140047, 10.16790/j.cnki.1009-9239.im.2022.12.001]
[34]   Multimodal Transformer for Nursing Activity Recognition [J].
Ijaz, Momal ;
Diaz, Renato ;
Chen, Chen .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :2064-2073
[35]   Towards Forecasting the Onset of Cybersickness by Fusing Physiological, Head-tracking and Eye-tracking with Multimodal Deep Fusion Network [J].
Islam, Rifatul ;
Desai, Kevin ;
Quarles, John .
2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2022), 2022, :121-130
[36]   Cybersickness Prediction from Integrated HMD's Sensors: A Multimodal Deep Fusion Approach using Eye-tracking and Head-tracking Data [J].
Islam, Rifatul ;
Desai, Kevin ;
Quarles, John .
2021 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2021), 2021, :31-40
[37]   CyberSense: A Closed-Loop Framework to Detect Cybersickness Severity and Adaptively apply Reduction Techniques [J].
Islam, Rifatul ;
Ang, Samuel ;
Quarles, John .
2021 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2021), 2021, :148-155
[38]   Automatic Detection and Prediction of Cybersickness Severity using Deep Neural Networks from user's Physiological Signals [J].
Islam, Rifatul ;
Lee, Yonggun ;
Jaloli, Mehrad ;
Muhammad, Imtiaz ;
Zhu, Dakai ;
Rad, Paul ;
Huang, Yufei ;
Quarles, John .
2020 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2020), 2020, :400-411
[39]   Applying Machine Learning Techniques to Transportation Mode Recognition Using Mobile Phone Sensor Data [J].
Jahangiri, Arash ;
Rakha, Hesham A. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (05) :2406-2417
[40]  
Jeong D, 2019, 2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), P827, DOI [10.1109/VR.2019.8798334, 10.1109/vr.2019.8798334]