Weakly Supervised Dual Learning for Facial Action Unit Recognition

被引：8

作者：

Wang, Shangfei ^{[1
]}

Peng, Guozhu ^{[1
]}

机构：

[1] Univ Sci & Technol China, Key Lab Comp & Commun Software Anhui Prov, Hefei 230027, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2019年 / 21卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Action unit recognition; weakly-supervised; dual learning; EXPRESSION RECOGNITION; PAIN;

D O I：

10.1109/TMM.2019.2916063

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Current research on facial action unit (AU) recognition typically requires fully AU-annotated facial images. Compared to facial expression labeling, AU annotation is a time-consuming, expensive, and error-prone process. Inspired by dual learning, we propose a novel weakly supervised dual learning mechanism to train facial action unit classifiers from expression-annotated images. Specifically, we consider AU recognition from facial images as the main task, and face synthesis given AUs as the auxiliary task. For AU recognition, we force the recognized AUs to satisfy the expression-dependent and expression-independent AU dependencies, i.e., the domain knowledge about expressions and AUs. For face synthesis given AUs, we minimize the difference between the synthetic face and the ground truth face, which has identical recognized and given AUs. By optimizing the dual tasks simultaneously, we successfully leverage their intrinsic connections as well as domain knowledge about expressions and AUs to facilitate the learning of AU classifiers from expression-annotated image. Furthermore, we extend the proposed weakly supervised dual learning mechanism to semi-supervised dual learning scenarios with partially AU-annotated images. Experimental results on three benchmark databases demonstrate the effectiveness of the proposed approach for both tasks.

引用

页码：3218 / 3230

页数：13

共 50 条

[31] Spatial-temporal correlations learning and action-background jointed attention for weakly-supervised temporal action localization
Xia, Huifen
Zhan, Yongzhao
Cheng, Keyang
MULTIMEDIA SYSTEMS, 2022, 28 (04) : 1529 - 1541
[32] GLNet: Global Local Network for Weakly Supervised Action Localization
Zhang, Shiwei
Song, Lin
Gao, Changxin
Sang, Nong
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2610 - 2622
[33] Weakly Supervised Object Detection Based on Active Learning
Wang, Xiao
Xiang, Xiang
Zhang, Baochang
Liu, Xuhui
Zheng, Jianying
Hu, Qinglei
NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5169 - 5183
[34] Image Piece Learning for Weakly Supervised Semantic Segmentation
Li, Yi
Guo, Yanqing
Kao, Yueying
He, Ran
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (04): : 648 - 659
[35] Weakly Supervised Object Detection Based on Active Learning
Xiao Wang
Xiang Xiang
Baochang Zhang
Xuhui Liu
Jianying Zheng
QingLei Hu
Neural Processing Letters, 2022, 54 : 5169 - 5183
[36] Facial Emotion Recognition with Inter-Modality-Attention-Transformer-Based Self-Supervised Learning
Chaudhari, Aayushi
Bhatt, Chintan
Krishna, Achyut
Travieso-Gonzalez, Carlos M.
ELECTRONICS, 2023, 12 (02)
[37] Deep cascaded action attention network for weakly-supervised temporal action localization
Xia, Hui-fen
Zhan, Yong-zhao
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29769 - 29787
[38] Deep cascaded action attention network for weakly-supervised temporal action localization
Hui-fen Xia
Yong-zhao Zhan
Multimedia Tools and Applications, 2023, 82 : 29769 - 29787
[39] A survey of automatic facial action units recognition
Zhao H.
Wang Z.
Liu Y.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2010, 22 (05): : 894 - 906
[40] Dual Semantic Reconstruction Network for Weakly Supervised Temporal Sentence Grounding
Tang, Kefan
He, Lihuo
Wang, Nannan
Gao, Xinbo
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 95 - 107

← 1 2 3 4 5 →