Fast Adaptation of Deep Models for Facial Action Unit Detection Using Model-Agnostic Meta-Learning

被引：0

作者：

Lee, Mihee ^{[1
]}

Rudovic, Ognjen ^{[2
,3
]}

Pavlovic, Vladimir ^{[1
,4
]}

Pantic, Maja ^{[4
,5
]}

机构：

[1] Rutgers State Univ, Piscataway, NJ 08854 USA

[2] MIT, Media Lab, Cambridge, MA USA

[3] Augsburg Univ, Augsburg, Germany

[4] Samsung AI Ctr, Cambridge, England

[5] Imperial Coll, London, England

来源：

WORKSHOP ON ARTIFICIAL INTELLIGENCE IN AFFECTIVE COMPUTING, VOL 122 | 2019年 / 122卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting facial action unit (AU) activations is one of the key steps in automatic recognition of facial expressions of human emotion and cognitive states. While there are different approaches proposed for this task, most of these are trained only for a specific (sub)set of AUs. As such, they cannot easily adapt to the task of detection of new AUs which are not initially used to train the target models. In this paper, we propose a deep learning approach for facial AU detection that can adapt to a new AU and/or target subject by leveraging only a few labeled samples from the new task (either an AU or subject). We use the notion of the model-agnostic meta-learning, originally proposed for the general image recognition/detection tasks, to design our deep learning models for AU detection. Specifically, each subject and/or AU is treated as a new learning task and the model learns to adapt based on the knowledge of the previously seen tasks. We show on two benchmark datasets (BP4D and DISFA) for facial AU detection that the proposed approach can easily be adapted to new tasks. By using as few as one or five labeled examples from the target task, our approach achieves large improvements over the baseline (non-adapted) deep models.

引用

页码：9 / 27

页数：19

共 20 条

[1] Selective Transfer Machine for Personalized Facial Expression Analysis
Chu, Wen-Sheng
De la Torre, Fernando
Cohn, Jeffrey F.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (03) : 529 - 545
[2] Finn C, 2017, PR MACH LEARN RES, V70
[3] github, Deepfakes
[4] Biconvex sets and optimization with biconvex functions: a survey and extensions
Gorski, Jochen
Pfeuffer, Frank
Klamroth, Kathrin
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 66 (03) : 373 - 407
[5] Koch G, 2015, ICML DEEP LEARN WORK, V2
[6] Multi-Label Zero-Shot Learning with Structured Knowledge Graphs
Lee, Chung-Wei
Fang, Wei
Yeh, Chih-Kuan
Wang, Yu-Chiang Frank
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1576 - 1585
[7] Action Unit Detection with Region Adaptation, Multi-labeling Learning and Optimal Temporal Fusing
Li, Wei
Abtahi, Farnaz
Zhu, Zhigang
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6766 - 6775
[8] EAC-Net: A Region-based Deep Enhancing and Cropping Approach for Facial Action Unit Detection
Li, Wei
Abtahi, Farnaz
Zhu, Zhigang
Yin, Lijun
[J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 103 - 110
[9] Lucey Patrick, 2010, Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), DOI 10.1109/CVPRW.2010.5543262
[10] DISFA: A Spontaneous Facial Action Intensity Database
Mavadati, S. Mohammad
Mahoor, Mohammad H.
Bartlett, Kevin
Trinh, Philip
Cohn, Jeffrey F.
[J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (02) : 151 - 160

← 1 2 →