Emotional facial sensing and multimodal fusion in a continuous 2D affective space

被引：10

作者：

Cerezo, Eva ^{[1
]}

Hupont, Isabelle ^{[2
]}

Baldassarri, Sandra ^{[1
]}

Ballano, Sergio ^{[2
]}

机构：

[1] Univ Zaragoza, Dept Informat & Ingn Sistemas, Zaragoza 50018, Spain

[2] ITA, Cuarte 22197, Huesca, Spain

来源：

JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING | 2012年 / 3卷 / 01期

关键词：

Affective Computing; Kansei (sense/emotion) engineering; Human factors; Facial expression analysis; Multimodal fusion; RECOGNITION; EXPRESSIONS;

D O I：

10.1007/s12652-011-0087-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper deals with two main research focuses on Affective Computing: facial emotion recognition and multimodal fusion of affective information coming from different channels. The facial sensing system developed implements an emotional classification mechanism that combines, in a novel and robust manner, the five most commonly used classifiers in the field of affect sensing, obtaining at the output an associated weight of the facial expression to each of the six Ekman's universal emotional categories plus the neutral. The system is able to analyze any subject, male or female, of any age, and ethnicity and has been validated by means of statistical evaluation strategies, such as cross-validation, classification accuracy ratios and confusion matrices. The categorical facial sensing system has been subsequently expanded to a continuous 2D affective space which has made it also possible to face the problem of multimodal human affect recognition. A novel fusion methodology able to fuse any number of affective modules, with very different time-scales and output labels, is proposed. It relies on the 2D Whissell affective space and is able to output a continuous emotional path characterizing the user's affective progress over time. A Kalman filtering technique controls this path in real-time to ensure temporal consistency and robustness to the system. Moreover, the methodology is adaptive to eventual temporal changes in the reliability of the different inputs' quality. The potential of the multimodal fusion methodology is demonstrated by fusing dynamic affective information extracted from different channels (video, typed-in text and emoticons) of an Instant Messaging tool.

引用

页码：31 / 46

页数：16

共 45 条

[1]

[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264

[2]

[Anonymous], 2007, P BRIT MACH VIS C

[3]

[Anonymous], 1953, KATHIMERINI

[4]

[Anonymous], 2002, Manual and Investigators Guide

[5]

[Anonymous], THE

[6]

[Anonymous], 1998, CORRELATION BASED FE

[7]

[Anonymous], 2006, P 8 INT C MULT INT, DOI [DOI 10.1145/1180995.1181029, 10.1145/1180995.1181029]

[8]

[Anonymous], HDB COGNITION EMOTIO

[9] EMOTION RECOGNITION - ROLE OF FACIAL MOVEMENT AND THE RELATIVE IMPORTANCE OF UPPER AND LOWER AREAS OF THE FACE [J].

BASSILI, JN .

JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1979, 37 (11) :2049-2058

[10]

Boukricha H., 2007, 2 INT WORKSH EM COMP, P22

← 1 2 3 4 5 →