EmotiCon: Context-Aware Multimodal Emotion Recognition using Frege's Principle

被引：114

作者：

Mittal, Trisha ^{[1
]}

Guhan, Pooja ^{[1
]}

Bhattacharya, Uttaran ^{[1
]}

Chandra, Rohan ^{[1
]}

Bera, Aniket ^{[1
]}

Manocha, Dinesh ^{[1
,2
]}

机构：

[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA

[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年

关键词：

EXPRESSION; FUTURE; MODEL;

D O I：

10.1109/CVPR42600.2020.01424

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present EmotiCon, a learning-based algorithm for context-aware perceived human emotion recognition from videos and images. Motivated by Frege's Context Principle from psychology, our approach combines three interpretations of context for emotion recognition. Our first interpretation is based on using multiple modalities (e.g. faces and gaits) for emotion recognition. For the second interpretation, we gather semantic context from the input image and use a self-attention-based CNN to encode this information. Finally, we use depth maps to model the third interpretation related to socio-dynamic interactions and proximity among agents. We demonstrate the efficiency of our network through experiments on EMOTIC, a benchmark dataset. We report an Average Precision (AP) score of 35.48 across 26 classes, which is an improvement of 7-8 over prior methods. We also introduce a new dataset, GroupWalk, which is a collection of videos captured in multiple real-world settings of people walking. We report an AP of 65.83 across 4 categories on GroupWalk, which is also an improvement over prior methods.

引用

页码：14222 / 14231

页数：10

共 60 条

[1]

Akputu Kingsley Oryina., 2013, 2nd International Conference on Machine Learning and Computer Science (IMLCS2013), P9

[2] The Future of Emotion Regulation Research: Capturing Context [J].

Aldao, Amelia .

PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2013, 8 (02) :155-172

[3]

[Anonymous], 2013, IEEE TAC

[4]

[Anonymous], 2012, BODY CUES NOT FACIAL

[5]

[Anonymous], 2005, PNAS

[6]

Baltrusaitis Tadas, 2016, WACV, P1, DOI [DOI 10.1109/WACV.2016.7477553, 10.1109/WACV.2016.7477553]

[7]

Barrett F.L., 2010, The mind in context, P1

[8] Context in Emotion Perception [J].

Barrett, Lisa Feldman ;

Mesquita, Batja ;

Gendron, Maria .

CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2011, 20 (05) :286-290

[9] Laplacian eigenmaps for dimensionality reduction and data representation [J].

Belkin, M ;

Niyogi, P .

NEURAL COMPUTATION, 2003, 15 (06) :1373-1396

[10] EmotioNet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild [J].

Benitez-Quiroz, C. Fabian ;

Srinivasan, Ramprakash ;

Martinez, Aleix M. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5562-5570

← 1 2 3 4 5 6 →