Weakly Supervised Dual Learning for Facial Action Unit Recognition

被引:8
作者
Wang, Shangfei [1 ]
Peng, Guozhu [1 ]
机构
[1] Univ Sci & Technol China, Key Lab Comp & Commun Software Anhui Prov, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Action unit recognition; weakly-supervised; dual learning; EXPRESSION RECOGNITION; PAIN;
D O I
10.1109/TMM.2019.2916063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current research on facial action unit (AU) recognition typically requires fully AU-annotated facial images. Compared to facial expression labeling, AU annotation is a time-consuming, expensive, and error-prone process. Inspired by dual learning, we propose a novel weakly supervised dual learning mechanism to train facial action unit classifiers from expression-annotated images. Specifically, we consider AU recognition from facial images as the main task, and face synthesis given AUs as the auxiliary task. For AU recognition, we force the recognized AUs to satisfy the expression-dependent and expression-independent AU dependencies, i.e., the domain knowledge about expressions and AUs. For face synthesis given AUs, we minimize the difference between the synthetic face and the ground truth face, which has identical recognized and given AUs. By optimizing the dual tasks simultaneously, we successfully leverage their intrinsic connections as well as domain knowledge about expressions and AUs to facilitate the learning of AU classifiers from expression-annotated image. Furthermore, we extend the proposed weakly supervised dual learning mechanism to semi-supervised dual learning scenarios with partially AU-annotated images. Experimental results on three benchmark databases demonstrate the effectiveness of the proposed approach for both tasks.
引用
收藏
页码:3218 / 3230
页数:13
相关论文
共 50 条
  • [31] Spatial-temporal correlations learning and action-background jointed attention for weakly-supervised temporal action localization
    Xia, Huifen
    Zhan, Yongzhao
    Cheng, Keyang
    MULTIMEDIA SYSTEMS, 2022, 28 (04) : 1529 - 1541
  • [32] GLNet: Global Local Network for Weakly Supervised Action Localization
    Zhang, Shiwei
    Song, Lin
    Gao, Changxin
    Sang, Nong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2610 - 2622
  • [33] Weakly Supervised Object Detection Based on Active Learning
    Wang, Xiao
    Xiang, Xiang
    Zhang, Baochang
    Liu, Xuhui
    Zheng, Jianying
    Hu, Qinglei
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5169 - 5183
  • [34] Image Piece Learning for Weakly Supervised Semantic Segmentation
    Li, Yi
    Guo, Yanqing
    Kao, Yueying
    He, Ran
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (04): : 648 - 659
  • [35] Weakly Supervised Object Detection Based on Active Learning
    Xiao Wang
    Xiang Xiang
    Baochang Zhang
    Xuhui Liu
    Jianying Zheng
    QingLei Hu
    Neural Processing Letters, 2022, 54 : 5169 - 5183
  • [36] Facial Emotion Recognition with Inter-Modality-Attention-Transformer-Based Self-Supervised Learning
    Chaudhari, Aayushi
    Bhatt, Chintan
    Krishna, Achyut
    Travieso-Gonzalez, Carlos M.
    ELECTRONICS, 2023, 12 (02)
  • [37] Deep cascaded action attention network for weakly-supervised temporal action localization
    Xia, Hui-fen
    Zhan, Yong-zhao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29769 - 29787
  • [38] Deep cascaded action attention network for weakly-supervised temporal action localization
    Hui-fen Xia
    Yong-zhao Zhan
    Multimedia Tools and Applications, 2023, 82 : 29769 - 29787
  • [39] A survey of automatic facial action units recognition
    Zhao H.
    Wang Z.
    Liu Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2010, 22 (05): : 894 - 906
  • [40] Dual Semantic Reconstruction Network for Weakly Supervised Temporal Sentence Grounding
    Tang, Kefan
    He, Lihuo
    Wang, Nannan
    Gao, Xinbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 95 - 107