Voice Interaction Using Gaussian Mixture Models for Augmented Reality Applications

被引:0
|
作者
Hamidia, Mahfoud [1 ,2 ]
Zenati, Nadia [1 ]
Belghit, Hayet [1 ]
Guetiteni, Kamila [2 ]
Achour, Nouara [2 ]
机构
[1] CDTA, BP 17, Algiers 16303, Algeria
[2] USTHB, Fac Elect & Comp Sci, Algiers 16111, Algeria
关键词
Augmented Reality (AR); voice interaction; Automatique Speech Recognition (ASR); ARToolKit; Gaussian Mixture Models (GMM);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the human computer interaction techniques for Augmented Reality (AR) applications. In fact, AR aims at inserting 2D or 3D virtual object generated by the computer in a real video filmed by a camera. On the other hand, the interaction in AR allows the user to take an action and control the virtual objects. In this work, Automatic Speech Recognition (ASR) system based on Gaussian Mixture Models (GMM) is investigated for voice interaction in AR. Experimental results show that good performance of the developed system. Also, the voice interaction provides an intuitive and a natural workspace for interacting with the augmented environment.
引用
收藏
页码:387 / +
页数:4
相关论文
共 50 条
  • [21] On the partitioning of urban networks for MFD-based applications using Gaussian Mixture Models
    Batista, Sergio F. A.
    Lopez, Clelia
    Menendez, Monica
    2021 7TH INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS (MT-ITS), 2021,
  • [22] Applications of Augmented Reality
    White, Jules
    Schmidt, Douglas C.
    Golparvar-Fard, Mani
    PROCEEDINGS OF THE IEEE, 2014, 102 (02) : 120 - 123
  • [23] Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models
    Doi, Hironori
    Nakamura, Keigo
    Toda, Tomoki
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2472 - 2482
  • [24] Rapidly constructed appearance models for tracking in augmented reality applications
    Neubert, Jeremiah
    Pretlove, John
    Drummond, Tom
    MACHINE VISION AND APPLICATIONS, 2012, 23 (05) : 843 - 856
  • [25] Rapidly constructed appearance models for tracking in augmented reality applications
    Jeremiah Neubert
    John Pretlove
    Tom Drummond
    Machine Vision and Applications, 2012, 23 : 843 - 856
  • [26] Wearable Augmented Reality System using Gaze Interaction
    Park, Hyung Min
    Lee, Seok Han
    Choi, Jong Soo
    7TH IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY 2008, PROCEEDINGS, 2008, : 175 - 176
  • [27] Voice conversion using structured Gaussian mixture model in eigen space
    Li, Yangchun
    Yu, Yibiao
    Shengxue Xuebao/Acta Acustica, 2015, 40 (01): : 12 - 19
  • [28] Voice conversion using structured Gaussian mixture model in cepstrum eigenspace
    LI Yangchun
    YU Yibiao
    ChineseJournalofAcoustics, 2015, 34 (03) : 325 - 336
  • [29] Voice conversion using Viterbi algorithm based on Gaussian mixture model
    Jian Zhi-Hua
    Yang Zhen
    2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 40 - 43
  • [30] On Using Error Correction for Mobile Augmented Reality Applications
    Narendra, N.
    Sahoo, Dushyant
    Reddy, Pavan K.
    Varghese, Ashley
    Kumar, Kriti
    Chandra, M. Girish
    Balamuralidhar, P.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,