Multimodal approaches for emotion recognition: A survey

被引:70
作者
Sebe, N [1 ]
Cohen, I [1 ]
Gevers, T [1 ]
Huang, TS [1 ]
机构
[1] Univ Amsterdam, Fac Sci, NL-1012 WX Amsterdam, Netherlands
来源
INTERNET IMAGING VI | 2005年 / 5670卷
关键词
emotion recognition; multimodal approach; human-computer interaction;
D O I
10.1117/12.600746
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent technological advances have enabled human users to interact with computers in ways previously unimaginable. Beyond the confines of the keyboard and mouse, new modalities for human-computer interaction such as voice, gesture, and force-feedback are emerging. Despite important advances, one necessary ingredient for natural interaction is still missing-emotions. Emotions play an important role in human-to-human communication and interaction, allowing people to express themselves beyond the verbal domain. The ability to understand human emotions is desirable for the computer in several applications. This paper explores new ways of human-computer interaction that enable the computer to be more aware of the user's emotional and attentional expressions. We present the basic research in the field and the recent advances into the emotion recognition from facial, voice, and pshysiological signals, where the different modalities are treated independently. We then describe the challenging problem of multimodal emotion recognition and we advocate the use of probabilistic graphical models when fusing the different modalities. We also discuss the difficult issues of obtaining reliable affective data, obtaining ground truth for emotion recognition, and the use of unlabeled data.
引用
收藏
页码:56 / 67
页数:12
相关论文
共 47 条
[1]  
[Anonymous], 2000, THESIS U ILLINOIS UR
[2]  
[Anonymous], 1996, MEDIA EQUATION HOW P
[4]  
BLACK MJ, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P374, DOI 10.1109/ICCV.1995.466915
[5]   INFERRING PSYCHOLOGICAL SIGNIFICANCE FROM PHYSIOLOGICAL SIGNALS [J].
CACIOPPO, JT ;
TASSINARY, LG .
AMERICAN PSYCHOLOGIST, 1990, 45 (01) :16-28
[6]   The James-Lange theory of emotions: A critical examination and an alternative theory [J].
Cannon, WB .
AMERICAN JOURNAL OF PSYCHOLOGY, 1927, 39 :106-124
[7]  
Chen LS, 2000, 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, P423, DOI 10.1109/ICME.2000.869630
[8]   Emotion recognition from audiovisual information [J].
Chen, LS ;
Tao, H ;
Huang, TS ;
Miyasato, T ;
Nakatsu, R .
1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, :83-88
[9]  
Chiu C. C., 1994, 1994 International Computer Symposium Conference Proceedings, P83
[10]   Facial expression recognition from video sequences: temporal and static modeling [J].
Cohen, I ;
Sebe, N ;
Garg, A ;
Chen, LS ;
Huang, TS .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 91 (1-2) :160-187