Fast Adaptation of Deep Models for Facial Action Unit Detection Using Model-Agnostic Meta-Learning

被引:0
作者
Lee, Mihee [1 ]
Rudovic, Ognjen [2 ,3 ]
Pavlovic, Vladimir [1 ,4 ]
Pantic, Maja [4 ,5 ]
机构
[1] Rutgers State Univ, Piscataway, NJ 08854 USA
[2] MIT, Media Lab, Cambridge, MA USA
[3] Augsburg Univ, Augsburg, Germany
[4] Samsung AI Ctr, Cambridge, England
[5] Imperial Coll, London, England
来源
WORKSHOP ON ARTIFICIAL INTELLIGENCE IN AFFECTIVE COMPUTING, VOL 122 | 2019年 / 122卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting facial action unit (AU) activations is one of the key steps in automatic recognition of facial expressions of human emotion and cognitive states. While there are different approaches proposed for this task, most of these are trained only for a specific (sub)set of AUs. As such, they cannot easily adapt to the task of detection of new AUs which are not initially used to train the target models. In this paper, we propose a deep learning approach for facial AU detection that can adapt to a new AU and/or target subject by leveraging only a few labeled samples from the new task (either an AU or subject). We use the notion of the model-agnostic meta-learning, originally proposed for the general image recognition/detection tasks, to design our deep learning models for AU detection. Specifically, each subject and/or AU is treated as a new learning task and the model learns to adapt based on the knowledge of the previously seen tasks. We show on two benchmark datasets (BP4D and DISFA) for facial AU detection that the proposed approach can easily be adapted to new tasks. By using as few as one or five labeled examples from the target task, our approach achieves large improvements over the baseline (non-adapted) deep models.
引用
收藏
页码:9 / 27
页数:19
相关论文
共 20 条
  • [1] Selective Transfer Machine for Personalized Facial Expression Analysis
    Chu, Wen-Sheng
    De la Torre, Fernando
    Cohn, Jeffrey F.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (03) : 529 - 545
  • [2] Finn C, 2017, PR MACH LEARN RES, V70
  • [3] github, Deepfakes
  • [4] Biconvex sets and optimization with biconvex functions: a survey and extensions
    Gorski, Jochen
    Pfeuffer, Frank
    Klamroth, Kathrin
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 66 (03) : 373 - 407
  • [5] Koch G, 2015, ICML DEEP LEARN WORK, V2
  • [6] Multi-Label Zero-Shot Learning with Structured Knowledge Graphs
    Lee, Chung-Wei
    Fang, Wei
    Yeh, Chih-Kuan
    Wang, Yu-Chiang Frank
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1576 - 1585
  • [7] Action Unit Detection with Region Adaptation, Multi-labeling Learning and Optimal Temporal Fusing
    Li, Wei
    Abtahi, Farnaz
    Zhu, Zhigang
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6766 - 6775
  • [8] EAC-Net: A Region-based Deep Enhancing and Cropping Approach for Facial Action Unit Detection
    Li, Wei
    Abtahi, Farnaz
    Zhu, Zhigang
    Yin, Lijun
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 103 - 110
  • [9] Lucey Patrick, 2010, Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), DOI 10.1109/CVPRW.2010.5543262
  • [10] DISFA: A Spontaneous Facial Action Intensity Database
    Mavadati, S. Mohammad
    Mahoor, Mohammad H.
    Bartlett, Kevin
    Trinh, Philip
    Cohn, Jeffrey F.
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (02) : 151 - 160