Model-based Head Orientation Estimation for Smart Devices

被引:7
|
作者
Yang, Qiang [1 ]
Zheng, Yuanqing [1 ]
机构
[1] Hong Kong Polytech Univ, Hung Hom, Kowloon, 11 Yuk Choi Rd, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT | 2021年 / 5卷 / 03期
关键词
acoustic sensing; head orientation; smart devices; SPEAKER POSITION; LOCALIZATION; CLASSIFICATION; TRACKING;
D O I
10.1145/3478089
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Voice interaction is friendly and convenient for users. Smart devices such as Amazon Echo allow users to interact with them by voice commands and become increasingly popular in our daily life. In recent years, research works focus on using the microphone array built in smart devices to localize the user's position, which adds additional context information to voice commands. In contrast, few works explore the user's head orientation, which also contains useful context information. For example, when a user says, "turn on the light", the head orientation could infer which light the user is referring to. Existing model-based works require a large number of microphone arrays to form an array network, while machine learning-based approaches need laborious data collection and training workload. The high deployment/usage cost of these methods is unfriendly to users. In this paper, we propose HOE, a model-based system that enables Head Orientation Estimation for smart devices with only two microphone arrays, which requires a lower training overhead than previous approaches. HOE first estimates the user's head orientation candidates by measuring the voice energy radiation pattern. Then, the voice frequency radiation pattern is leveraged to obtain the final result. Real-world experiments are conducted, and the results show that HOE can achieve a median estimation error of 23 degrees. To the best of our knowledge, HOE is the first model-based attempt to estimate the head orientation by only two microphone arrays without the arduous data training overhead.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Localization of a Single Source with Orientation-Aware Smart Devices
    Tunon, D.
    Taghavi, T.
    Chamberland, J. -F.
    Huff, G. H.
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 117 - 120
  • [2] Online Model-based Gait Age and Gender Estimation
    Shehata, Allam
    Alsherfawi, Ammar
    Gaher, Levin
    Li, Xiang
    Makihara, Yasushi
    Yagi, Yasushi
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [3] Bayesian estimation of membership uncertainty in model-based clustering
    Chen, Liyuan
    Brown, Steven D.
    JOURNAL OF CHEMOMETRICS, 2014, 28 (05) : 358 - 369
  • [4] Two-tensor model-based bootstrapping on classified tensor morphologies: estimation of uncertainty in fiber orientation and probabilistic tractography
    Ratnarajah, Nagulan
    Simmons, Andrew
    Bertoni, Miguel
    Hojjatoleslami, Ali
    MAGNETIC RESONANCE IMAGING, 2013, 31 (02) : 296 - 312
  • [5] Model-based State Estimation of Two-Wheelers
    Wirth, Florian
    Wadephul, Julian
    Scheid, Alexander
    Fernandez-Lopez, Carlos
    Stiller, Christoph
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5382 - 5388
  • [6] Spatial orientation: Model-based approach to multi-sensory mechanisms
    Kheradmand, Amir
    Otero-Millan, Jorge
    MATHEMATICAL MODELLING IN MOTOR NEUROSCIENCE: STATE OF THE ART AND TRANSLATION TO THE CLINIC. OCULAR MOTOR PLANT AND GAZE STABILIZATION MECHANISMS, 2019, 248 : 209 - 223
  • [7] Head orientation estimation using gait observation
    Nakazawa, Mitsuru
    Mitsugami, Ikuhisa
    Yamazoe, Hirotake
    Yagi, Yasushi
    IPSJ Transactions on Computer Vision and Applications, 2014, 6 : 63 - 67
  • [8] Tracking Human Poses with Head Orientation Estimation
    TIAN Jinglan
    WANG Zhengyuan
    LI Ling
    LIU Wanquan
    Instrumentation, 2017, 4 (03) : 40 - 46
  • [9] Model-based car tracking through the integration of search and estimation
    Sahli, H
    Mertens, M
    Cornelis, J
    ENHANCED AND SYNTHETIC VISION 1998, 1998, 3364 : 160 - 166
  • [10] Virtual acoustic environments for comprehensive evaluation of model-based hearing devices
    Grimm, Giso
    Luberadzka, Joanna
    Hohmann, Volker
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2018, 57 : S112 - S117