Voice Interaction Using Gaussian Mixture Models for Augmented Reality Applications

被引:0
|
作者
Hamidia, Mahfoud [1 ,2 ]
Zenati, Nadia [1 ]
Belghit, Hayet [1 ]
Guetiteni, Kamila [2 ]
Achour, Nouara [2 ]
机构
[1] CDTA, BP 17, Algiers 16303, Algeria
[2] USTHB, Fac Elect & Comp Sci, Algiers 16111, Algeria
关键词
Augmented Reality (AR); voice interaction; Automatique Speech Recognition (ASR); ARToolKit; Gaussian Mixture Models (GMM);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the human computer interaction techniques for Augmented Reality (AR) applications. In fact, AR aims at inserting 2D or 3D virtual object generated by the computer in a real video filmed by a camera. On the other hand, the interaction in AR allows the user to take an action and control the virtual objects. In this work, Automatic Speech Recognition (ASR) system based on Gaussian Mixture Models (GMM) is investigated for voice interaction in AR. Experimental results show that good performance of the developed system. Also, the voice interaction provides an intuitive and a natural workspace for interacting with the augmented environment.
引用
收藏
页码:387 / +
页数:4
相关论文
共 50 条
  • [1] Voice Conversion Using Gaussian Mixture Models
    D'souza, Kevin
    Talele, K. T. V.
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT), 2015,
  • [2] STORYTELLING VOICE CONVERSION: EVALUATION EXPERIMENT USING GAUSSIAN MIXTURE MODELS
    Pribil, Jiri
    Pribilova, Anna
    Durackova, Daniela
    JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2015, 66 (04): : 194 - 202
  • [3] Voice conversion based on Gaussian processes by using kernels modeling the spectral density with Gaussian mixture models
    Bao, Jingyi
    Xu, Ning
    MODERN PHYSICS LETTERS B, 2018, 32 (34-36):
  • [4] Advanced Interaction Techniques for Augmented Reality Applications
    Billinghurst, Mark
    Kato, Hirokazu
    Myojin, Seiko
    VIRTUAL AND MIXED REALITY, PROCEEDINGS, 2009, 5622 : 13 - +
  • [5] Integrating Peripheral Interaction into Augmented Reality Applications
    Schipor, Ovidiu-Andrei
    Vatavu, Radu-Daniel
    Wu, Wenjun
    ADJUNCT PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT 2019), 2019, : 358 - 359
  • [6] Audio-Visual Emotion Recognition using Gaussian Mixture Models for Face and Voice
    Metallinou, Angeliki
    Lee, Sungbok
    Narayanan, Shrikanth
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 250 - 257
  • [7] Voice Conversion Using Structrued Gaussian Mixture Model
    Zeng, Daojian
    Yu, Yibiao
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 541 - 544
  • [8] Voice Interaction for Augmented Reality Navigation Interfaces with Natural Language Understanding
    Zhao, Junhong
    Parry, Christopher James
    dos Anjos, Rafael
    Anslow, Craig
    Rhee, Taehyun
    2020 35TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2020,
  • [9] Coding using Gaussian mixture and generalized Gaussian models
    Su, JK
    Mersereau, RM
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL I, 1996, : 217 - 220
  • [10] Multimodal Interaction Concepts for Mobile Augmented Reality Applications
    Hurst, Wolfgang
    van Wezel, Casper
    ADVANCES IN MULTIMEDIA MODELING, PT II, 2011, 6524 : 157 - 167