Knowledge-Driven Self-Supervised Representation Learning for Facial Action Unit Recognition

被引：14

作者：

Chang, Yanan ^{[1
]}

Wang, Shangfei ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR52688.2022.01977

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Facial action unit (AU) recognition is formulated as a supervised learning problem by recent works. However, the complex labeling process makes it challenging to provide AU annotations for large amounts of facial images. To remedy this, we utilize AU labeling rules defined by the Facial Action Coding System (FACS) to design a novel knowledge-driven self-supervised representation learning framework for AU recognition. The representation encoder is trained using large amounts of facial images without AU annotations. AU labeling rules are summarized from FACS to design facial partition manners and determine correlations between facial regions. The method utilizes a backbone network to extract local facial area representations and a project head to map the representations into a low-dimensional latent space. In the latent space, a contrastive learning component leverages the inter-area difference to learn AU-related local representations while maintaining intra-area instance discrimination. Correlations between facial regions summarized from AU labeling rules are also explored to further learn representations using a predicting learning component. Evaluation on two benchmark databases demonstrates that the learned representation is powerful and data-efficient for AU recognition.

引用

页码：20385 / 20394

页数：10

共 30 条

[1] Chen T, 2020, PR MACH LEARN RES, V119
[2] Learning Spatial and Temporal Cues for Multi-label Facial Action Unit Detection
Chu, Wen-Sheng
De la Torre, Fernando
Cohn, Jeffrey F.
[J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 25 - 32
[3] THE SYMMETRY OF EMOTIONAL AND DELIBERATE FACIAL ACTIONS
EKMAN, P
HAGER, JC
FRIESEN, WV
[J]. PSYCHOPHYSIOLOGY, 1981, 18 (02) : 101 - 106
[4] Friesen, 1978, FACIAL ACTION CODING
[5] Optimizing Filter Size in Convolutional Neural Networks for Facial Action Unit Recognition
Han, Shizhong
Meng, Zibo
Li, Zhiyuan
O'Reilly, James
Cai, Jie
Wang, Xiaofeng
Tong, Yan
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5070 - 5078
[6] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[7] Momentum Contrast for Unsupervised Visual Representation Learning
He, Kaiming
Fan, Haoqi
Wu, Yuxin
Xie, Saining
Girshick, Ross
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9726 - 9735
[8] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[9] Henaff O., 2020, ABS190509272 CORR, P4182
[10] Speed/accuracy trade-offs for modern convolutional object detectors
Huang, Jonathan
Rathod, Vivek
Sun, Chen
Zhu, Menglong
Korattikara, Anoop
Fathi, Alireza
Fischer, Ian
Wojna, Zbigniew
Song, Yang
Guadarrama, Sergio
Murphy, Kevin
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3296 - +

← 1 2 3 →