Knowledge-Driven Self-Supervised Representation Learning for Facial Action Unit Recognition

被引:14
作者
Chang, Yanan [1 ]
Wang, Shangfei [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.01977
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial action unit (AU) recognition is formulated as a supervised learning problem by recent works. However, the complex labeling process makes it challenging to provide AU annotations for large amounts of facial images. To remedy this, we utilize AU labeling rules defined by the Facial Action Coding System (FACS) to design a novel knowledge-driven self-supervised representation learning framework for AU recognition. The representation encoder is trained using large amounts of facial images without AU annotations. AU labeling rules are summarized from FACS to design facial partition manners and determine correlations between facial regions. The method utilizes a backbone network to extract local facial area representations and a project head to map the representations into a low-dimensional latent space. In the latent space, a contrastive learning component leverages the inter-area difference to learn AU-related local representations while maintaining intra-area instance discrimination. Correlations between facial regions summarized from AU labeling rules are also explored to further learn representations using a predicting learning component. Evaluation on two benchmark databases demonstrates that the learned representation is powerful and data-efficient for AU recognition.
引用
收藏
页码:20385 / 20394
页数:10
相关论文
共 30 条
  • [1] Chen T, 2020, PR MACH LEARN RES, V119
  • [2] Learning Spatial and Temporal Cues for Multi-label Facial Action Unit Detection
    Chu, Wen-Sheng
    De la Torre, Fernando
    Cohn, Jeffrey F.
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 25 - 32
  • [3] THE SYMMETRY OF EMOTIONAL AND DELIBERATE FACIAL ACTIONS
    EKMAN, P
    HAGER, JC
    FRIESEN, WV
    [J]. PSYCHOPHYSIOLOGY, 1981, 18 (02) : 101 - 106
  • [4] Friesen, 1978, FACIAL ACTION CODING
  • [5] Optimizing Filter Size in Convolutional Neural Networks for Facial Action Unit Recognition
    Han, Shizhong
    Meng, Zibo
    Li, Zhiyuan
    O'Reilly, James
    Cai, Jie
    Wang, Xiaofeng
    Tong, Yan
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5070 - 5078
  • [6] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [7] Momentum Contrast for Unsupervised Visual Representation Learning
    He, Kaiming
    Fan, Haoqi
    Wu, Yuxin
    Xie, Saining
    Girshick, Ross
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9726 - 9735
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Henaff O., 2020, ABS190509272 CORR, P4182
  • [10] Speed/accuracy trade-offs for modern convolutional object detectors
    Huang, Jonathan
    Rathod, Vivek
    Sun, Chen
    Zhu, Menglong
    Korattikara, Anoop
    Fathi, Alireza
    Fischer, Ian
    Wojna, Zbigniew
    Song, Yang
    Guadarrama, Sergio
    Murphy, Kevin
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3296 - +