Vision-Based Technique and Issues for Multimodal Interaction in Augmented Reality

被引:8
作者
Ismail, Ajune Wanis [1 ]
Billinghurst, Mark [2 ]
Sunar, Mohd Shahrizal [1 ]
机构
[1] Univ Teknol Malaysia, UTM IRDA Digital Media Ctr, MaGIC X Media & Games Innovat Ctr Excellence, Skudai Johor 81310, Malaysia
[2] Univ Canterbury, Human Interface Technol Lab New Zealand HITLabNZ, Christchurch 8041, New Zealand
来源
8TH INTERNATIONAL SYMPOSIUM ON VISUAL INFORMATION COMMUNICATION AND INTERACTION (VINCI 2015) | 2015年
关键词
Augmented Reality; Multimodal Interaction; Vision Technique; COMPUTER;
D O I
10.1145/2801040.2801058
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Although many progresses have been accomplished in multimodal interaction, most researchers still treat each modality such as vision and speech, separately. They integrate the results at the application stage. This is because the roles of multiple modalities and their interactions continue to be quantified and precisely understood. However, there are many remaining issues in combining each modality individually. This paper will highlight the main vision problems based on our review for multimodal applications. This review paper will give an overview of the Augmented Reality (AR) technologies which are contributing in most of recent multimodal applications. We cluster vision techniques according to the natural human senses such as face, gesture, and speech that are frequently used in multimodal applications. The main contribution of this paper is to consolidate some of the main issues and approaches in vision-based technique, and to study some of the applications in AR that have been developed within the context of multimodal interaction. We conclude this paper with the future directions.
引用
收藏
页码:75 / 82
页数:8
相关论文
共 51 条
  • [1] Alexander G, 1989, P ACM C HUM FACT COM, P241
  • [2] [Anonymous], 2010, MULTIMODEL SPEECH GE
  • [3] Bai H., 2013, 2013 IEEE INT S MIXE, P1
  • [4] Bai Huidong., 2013, SIGGRAPH Asia 2013 Symposium on Mobile Graphics and Interactive Applications, P22
  • [5] Bailey A, 2001, INTERNETWEEK, P20
  • [6] Bailly G., 2004, ISSUES VISUAL AUDIO
  • [7] Billinghurst Mark, 2013, P 15 ACM INT C MULT
  • [8] Chu CCP, 1997, IEEE INT CONF ROBOT, P1329, DOI 10.1109/ROBOT.1997.614321
  • [9] QuickSet: Multimodal interaction for distributed applications
    Cohen, PR
    Johnston, M
    McGee, D
    Oviatt, S
    Pittman, J
    Smith, I
    Chen, L
    Clow, J
    [J]. ACM MULTIMEDIA 97, PROCEEDINGS, 1997, : 31 - 40
  • [10] Corradini A., 2002, JDCTA, P52