Harnessing Label Uncertainty to Improve Modeling: An Application to Student Engagement Recognition

被引：11

作者：

Aung, Arkar Min ^{[1
]}

Whitehill, Jacob R. ^{[1
]}

机构：

[1] Worcester Polytech Inst, Worcester, MA 01609 USA

来源：

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018) | 2018年

基金：

美国国家科学基金会;

关键词：

data annotation; label regularization; automatic facial expression recognition; student engagement recognition; FACIAL EXPRESSIONS;

D O I：

10.1109/FG.2018.00033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic facial expression recognition systems are usually trained from target labels that model each example as belonging unambiguously to a single class (e.g., "non-engaged", "very engaged", etc.). However, in some settings, ground-truth labels can be more aptly modeled as probability distributions (e.g., [0.1, 0.1, 0.5, 0.3] over 4 engagement categories) that capture the uncertainty that can arise during the annotation process. In this paper, we explore how harnessing the full probability distribution of each label ("soft labels"), rather than just a scalar summary statistic ("hard labels", e.g., majority class or mean), can yield better recognition accuracy when training automated detectors. Our results on a face image dataset (10698 faces over 20 subjects) labeled for perceived student engagement suggest that training on soft labels can deliver engagement detectors that fit the data stat. sig. more accurately (lower cross-entropy for classification, higher Pearson correlation for regression) than when training on hard labels. Moreover, we explore possible reasons for this effect and provide evidence that it is due to implicit regularization that the soft labels enact on the trained engagement detector. This effect is similar to, but empirically seems stronger than, the "label smoothing" approach proposed by Szegedy, et al. [1].

引用

页码：166 / 170

页数：5

共 25 条

[1]

[Anonymous], 2009, Advances in Neural Information Processing Systems

[2]

[Anonymous], 2017, ARXIV170308774

[3] Reinterpreting the Application of Gabor Filters as a Manipulation of the Margin in Linear Support Vector Machines [J].

Ashraf, Ahmed Bilal ;

Lucey, Simon ;

Chen, Tsuhan .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (07) :1335-1341

[4]

Bosch N., 2015, P 20 INT C INT US IN, P379

[5] Using Video to Automatically Detect Learner Affect in Computer-Enabled Classrooms [J].

Bosch, Nigel ;

D'Mello, Sidney K. ;

Ocumpaugh, Jaclyn ;

Baker, Ryan S. ;

Shute, Valerie .

ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2016, 6 (02)

[6] Do facial expressions signal specific emotions? Judging emotion from the face in context [J].

Carroll, JM ;

Russell, JA .

JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1996, 70 (02) :205-218

[7]

Chittaranjan Gokul, 2011, Proceedings 2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011), P734, DOI 10.1109/FG.2011.5771339

[8] AN ARGUMENT FOR BASIC EMOTIONS [J].

EKMAN, P .

COGNITION & EMOTION, 1992, 6 (3-4) :169-200

[9]

Eyben F., 2011, Proceedings 2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011), P322, DOI 10.1109/FG.2011.5771417

[10]

Glorot X., 2010, P 13 INT C ART INT S, P249

← 1 2 3 →