Alohomora: Motion-Based Hotword Detection in Head-Mounted Displays

被引:3
作者
Gu, Jiaxi [1 ]
Yu, Zhiwen [1 ]
Shen, Kele [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] Cloud IoT Business Unit, Alibaba Grp, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Motion detection; Resists; Speech recognition; Internet of Things; Microphones; Microsoft Windows; Facial muscles; Hotword detection; machine learning; motion sensor; virtual reality (VR); TIME; CLASSIFICATION; SIMILARITY; MODELS;
D O I
10.1109/JIOT.2019.2946593
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of multimedia and computer graphics technologies, virtual reality (VR) is attracting more and more attention from both the academic communities and industrial companies. A head-mounted display (HMD) is the core equipment of VR. It envelops the entire sight of the wearer and reacts to some specific actions, mainly the head movement. Different from common video watching or game playing, VR poses the strict requirement of immersion so interaction methods need to be carefully designed. The hotword-based interaction as a typical hands-free method is very suitable for VR scenarios. However, the traditional hotword detection methods use a microphone to permit audio signal analysis. They not only incur significant recording overheads but are also susceptible to the surrounding noises. Instead of using the audio signals, we propose a motion-based hotword detection method called Alohomora. A multivariate time series (MTS) classification is formulated for processing the sensor data from multiple dimensions and types of motion sensors. We use a word extraction method for extracting and selecting patterns from MTS of motion data. Then, a classification model is trained using those discriminative patterns and finally the hotword can be detected in time. Alohomora is purely based on the motion sensors in HMDs without using any extra components such as microphone. As head tracking is always necessary in VR applications themselves, the overhead of Alohomora is nearly negligible. Finally, through extensive experiments, the final detection accuracy of Alohomora can exceed 90.
引用
收藏
页码:611 / 620
页数:10
相关论文
共 28 条
[21]   Real-Time Onboard Object Detection for Augmented Reality: Enhancing Head-Mounted Display with YOLOv8 [J].
Lysakowski, Mikolaj ;
Zywanowski, Kamil ;
Banaszczyk, Adam ;
Nowicki, Michal R. ;
Skrzypczynski, Piotr ;
Tadeja, Slawomir K. .
2023 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND COMMUNICATIONS, EDGE, 2023, :364-371
[22]   MODEEP: a motion-based object detection and pose estimation method for airborne FLIR sequences [J].
Strehl, A ;
Aggarwal, JK .
MACHINE VISION AND APPLICATIONS, 2000, 11 (06) :267-276
[23]   Divergent Component of Motion-Based Gait Intention Detection Method Using Motion Information From Single Leg [J].
Oh, Hye-Won ;
Hong, Young-Dae .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 107 (04)
[24]   SurgeonAssist-Net: Towards Context-Aware Head-Mounted Display-Based Augmented Reality for Surgical Guidance [J].
Doughty, Mitchell ;
Singh, Karan ;
Ghugre, Nilesh R. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT IV, 2021, 12904 :667-677
[25]   Motion-based skin region of interest detection with a real-time connected component labeling algorithm [J].
Song, Wei ;
Wu, Dong ;
Xi, Yulong ;
Park, Yong Woon ;
Cho, Kyungeun .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (09) :11199-11214
[26]   Detection of step displacements in fMRI head motion data based on machine learning [J].
Plisko, Anastasia ;
Serafimovich, Paul ;
Davydov, Nikita ;
Nikonorov, Artem ;
Koush, Yury .
2020 VI INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND NANOTECHNOLOGY (IEEE ITNT-2020), 2020,
[27]   Graph-Based Motion Artifacts Detection Method from Head Computed Tomography Images [J].
Liu, Yiwen ;
Wen, Tao ;
Sun, Wei ;
Liu, Zhenyu ;
Song, Xiaoying ;
He, Xuan ;
Zhang, Shuo ;
Wu, Zhenning .
SENSORS, 2022, 22 (15)
[28]   Motion Artifact Detection Based on Regional-Temporal Graph Attention Network from Head Computed Tomography Images [J].
Liu, Yiwen ;
Wen, Tao ;
Wu, Zhenning .
ELECTRONICS, 2024, 13 (04)