Audio-visual perception-based multimodal HCI

被引:2
|
作者
Yang, Shu [1 ]
Guan, Ye-peng [1 ,2 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Minist Educ, Key Lab Adv Displays & Syst Applicat, Shanghai 200444, Peoples R China
来源
JOURNAL OF ENGINEERING-JOE | 2018年 / 04期
关键词
D O I
10.1049/joe.2017.0333
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Human-computer interaction (HCI) has great potential for applications in many fields. The diversity of interaction habits and low recognition rate are main factors to limit its development. In this paper, a framework of multi-modality-based HCI is constructed. The interactive target can be determined by different modalities including gaze, hand pointing and speech in a non-contact and non-wearable way. The corresponding response is fed back timely to users in the form of audio-visual sense with an immersive experience. Besides, the decision matrix-based fusion strategy is proposed to improve the system's accuracy and adapt to different interaction habits which are considered in an ordinary hardware from a crowded scene without any hypothesis that the interactive user and his corresponding actions are known in advance. Experimental results have highlighted that the proposed method has better robustness and real-time performance in the actual scene by comparisons.
引用
收藏
页码:190 / 198
页数:9
相关论文
共 50 条
  • [1] Multimodal and Temporal Perception of Audio-visual Cues for Emotion Recognition
    Ghaleb, Esam
    Popa, Mirela
    Asteriadis, Stylianos
    2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
  • [2] Audio-Visual User Identification in HCI Scenarios
    Kaechele, Markus
    Meudt, Sascha
    Schwarz, Andrej
    Schwenker, Friedhelm
    MULTIMODAL PATTERN RECOGNITION OF SOCIAL SIGNALS IN HUMAN-COMPUTER-INTERACTION, 2015, 8869 : 113 - 122
  • [3] Audio-visual interaction in multimodal communication
    Chellappa, R
    Chen, TH
    Katsaggelos, A
    IEEE SIGNAL PROCESSING MAGAZINE, 1997, 14 (04) : 37 - 38
  • [4] Audio-visual integration in multimodal communication
    Chen, T
    Rao, RR
    PROCEEDINGS OF THE IEEE, 1998, 86 (05) : 837 - 852
  • [5] Audio-Visual Causality and Stimulus Reliability Affect Audio-Visual Synchrony Perception
    Li, Shao
    Ding, Qi
    Yuan, Yichen
    Yue, Zhenzhu
    FRONTIERS IN PSYCHOLOGY, 2021, 12
  • [6] Multimodal Dance Generation Networks Based on Audio-Visual Analysis
    Duan, Lijuan
    Xu, Xiao
    En, Qing
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2021, 12 (01): : 17 - 32
  • [7] Audio-visual speech perception is special
    Tuomainen, J
    Andersen, TS
    Tiippana, K
    Sams, M
    COGNITION, 2005, 96 (01) : B13 - B22
  • [8] AUDIO-VISUAL TRAINING OF PERCEPTION IN AGEING
    O'Brien, Jessica
    Jason, Chan
    Setti, Annalisa
    AGE AND AGEING, 2019, 48
  • [9] Audio-visual integration in temporal perception
    Wada, Y
    Kitagawa, N
    Noguchi, K
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2003, 50 (1-2) : 117 - 124
  • [10] Audio-Visual Learning for Multimodal Emotion Recognition
    Fan, Siyu
    Jing, Jianan
    Wang, Chongwen
    SYMMETRY-BASEL, 2025, 17 (03):