Alohomora: Motion-Based Hotword Detection in Head-Mounted Displays

被引:3
作者
Gu, Jiaxi [1 ]
Yu, Zhiwen [1 ]
Shen, Kele [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] Cloud IoT Business Unit, Alibaba Grp, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Motion detection; Resists; Speech recognition; Internet of Things; Microphones; Microsoft Windows; Facial muscles; Hotword detection; machine learning; motion sensor; virtual reality (VR); TIME; CLASSIFICATION; SIMILARITY; MODELS;
D O I
10.1109/JIOT.2019.2946593
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of multimedia and computer graphics technologies, virtual reality (VR) is attracting more and more attention from both the academic communities and industrial companies. A head-mounted display (HMD) is the core equipment of VR. It envelops the entire sight of the wearer and reacts to some specific actions, mainly the head movement. Different from common video watching or game playing, VR poses the strict requirement of immersion so interaction methods need to be carefully designed. The hotword-based interaction as a typical hands-free method is very suitable for VR scenarios. However, the traditional hotword detection methods use a microphone to permit audio signal analysis. They not only incur significant recording overheads but are also susceptible to the surrounding noises. Instead of using the audio signals, we propose a motion-based hotword detection method called Alohomora. A multivariate time series (MTS) classification is formulated for processing the sensor data from multiple dimensions and types of motion sensors. We use a word extraction method for extracting and selecting patterns from MTS of motion data. Then, a classification model is trained using those discriminative patterns and finally the hotword can be detected in time. Alohomora is purely based on the motion sensors in HMDs without using any extra components such as microphone. As head tracking is always necessary in VR applications themselves, the overhead of Alohomora is nearly negligible. Finally, through extensive experiments, the final detection accuracy of Alohomora can exceed 90.
引用
收藏
页码:611 / 620
页数:10
相关论文
共 28 条
  • [1] NASR: NonAuditory Speech Recognition with Motion Sensors in Head-Mounted Displays
    Gu, Jiaxi
    Shen, Kele
    Wang, Jiliang
    Yu, Zhiwen
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2018), 2018, 10874 : 754 - 759
  • [2] Enabling Focus Cues in Head-Mounted Displays
    Hua, Hong
    PROCEEDINGS OF THE IEEE, 2017, 105 (05) : 805 - 824
  • [3] A Robust Camera-Based Method for Optical Distortion Calibration of Head-Mounted Displays
    Lee, Sangyoon
    Hua, Hong
    JOURNAL OF DISPLAY TECHNOLOGY, 2015, 11 (10): : 845 - 853
  • [4] A review of the use of virtual reality head-mounted displays in education and training
    Lasse Jensen
    Flemming Konradsen
    Education and Information Technologies, 2018, 23 : 1515 - 1529
  • [5] Real-time Apparent Resolution Enhancement for Head-mounted Displays
    Lee, Haebom
    Didyk, Piotr
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2018, 1 (01)
  • [6] A review of cybersickness in head-mounted displays: raising attention to individual susceptibility
    Tian, Nana
    Lopes, Phil
    Boulic, Ronan
    VIRTUAL REALITY, 2022, 26 (04) : 1409 - 1441
  • [7] Distance Accuracy of Real Environments in Virtual Reality Head-Mounted Displays
    El Jamiy, Fatima
    Chandra, Ananth N. Ramaseri
    Marsh, Ronald
    2020 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2020, : 281 - 287
  • [8] A review of the use of virtual reality head-mounted displays in education and training
    Jensen, Lasse
    Konradsen, Flemming
    EDUCATION AND INFORMATION TECHNOLOGIES, 2018, 23 (04) : 1515 - 1529
  • [9] Comparing Gaze, Head and Controller Selection of Dynamically Revealed Targets in Head-Mounted Displays
    Sidenmark, Ludwig
    Prummer, Franziska
    Newn, Joshua
    Gellersen, Hans
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (11) : 4740 - 4750
  • [10] Channel Performance Metrics and Evaluation for XR Head-Mounted Displays With mmWave Arrays
    Marinsek, Alexander
    Cai, Xuesong
    De Strycker, Lieven
    Tufvesson, Fredrik
    van der Perre, Liesbet
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (10) : 6442 - 6456