Multi-task network with inter-task consistency learning for face parsing and facial expression recognition at real-time speed

被引:0
|
作者
Wang, Haoyu [1 ]
Song, Haiyu [2 ,3 ]
Li, Peihong [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Zhejiang Univ Finance & Econ, Coll Informat Management & Artificial Intelligence, Hangzhou, Peoples R China
[3] Zhejiang Univ Technol, Binjiang Inst Artificial Intelligence, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-task network; Face parsing; Real-time inference; Facial expression recognition; TRANSFORMER;
D O I
10.1016/j.jvcir.2024.104213
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, face parsing and facial expression recognition have attracted increasing interest. Even though there are relevant results about face parsing and face representation, these approaches seek accuracy at the expense of speed. In this paper, we design a novel multi-task learning network for face parsing and facial expression recognition (MPENet). Specifically, MPENet consists of shared encoders and three downstream branches. In the edge perceiving branch, we use category edge and binary edge to extract face boundary information and improve localization of face boundaries. In the segmentation branch, we use graph learning to fuse edge and semantic information of the image, analyze the relations between different feature regions, and capture more contextual relationships. Finally, we design a consistent learning loss function, forcing different branches to learn the same predictions. We have carried out experiments on face datasets, and found that it shows high precision and fast inference speed. Specifically, MPENet achieves F1 scores of 85.9 on the CelebAMask-HQ dataset and 92.9 on the Lapa dataset, with an inference speed of 92.9 FPS. Moreover, MPENet precisely delineates the semantic boundaries of facial regions and, through consistent multi-task learning, effectively facilitates synergy among various tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Real-Time Facial Attribute Recognition Using Multi-Task Learning
    Yuan, Huaqing
    He, Yi
    Du, Peng
    Song, Lu
    Xu, Yanbin
    2024 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC 2024, 2024,
  • [2] Facial Expression Recognition by Regional Attention and Multi-task Learning
    Cui, Longlei
    Tian, Ying
    ENGINEERING LETTERS, 2021, 29 (03) : 919 - 925
  • [3] Discriminative deep multi-task learning for facial expression recognition
    Zheng, Hao
    Wang, Ruili
    Ji, Wanting
    Zong, Ming
    Wong, Wai Keung
    Lai, Zhihui
    Lv, Hexin
    INFORMATION SCIENCES, 2020, 533 : 60 - 71
  • [4] A multi-task model for simultaneous face identification and facial expression recognition
    Zheng, Hao
    Geng, Xin
    Tao, Dacheng
    Jin, Zhong
    NEUROCOMPUTING, 2016, 171 : 515 - 523
  • [5] Real-Time Multi-Task Facial Analytics With Event Cameras
    Ryan, Cian
    Elrasad, Amr
    Shariff, Waseem
    Lemley, Joe
    Kielty, Paul
    Hurney, Patrick
    Corcoran, Peter
    IEEE ACCESS, 2023, 11 : 76964 - 76976
  • [6] Real-Time Multi-task Network for Autonomous Driving
    Dat, Vu Thanh
    Bao, Ngo Viet Hoai
    Hung, Phan Duy
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 207 - 218
  • [7] A REAL-TIME MULTI-TASK SINGLE SHOT FACE DETECTOR
    Chen, Jun-Cheng
    Lin, Wei-An
    Zheng, Jingxiao
    Chellappa, Rama
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 176 - 180
  • [8] Residual multi-task learning for facial landmark localization and expression recognition
    Chen, Boyu
    Guan, Wenlong
    Li, Peixia
    Ikeda, Naoki
    Hirasawa, Kosuke
    Lu, Huchuan
    PATTERN RECOGNITION, 2021, 115
  • [9] Parallel Multi-task Cascade Convolution Neural Network Optimization Algorithm for Real-time Dynamic Face Recognition
    Jiang, Bin
    Ren, Qiang
    Dai, Fei
    Zhou, Tian
    Gui, Guan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (10): : 4117 - 4135
  • [10] Multi-Task Learning of Facial Landmarks and Expression
    Devries, Terrance
    Biswaranjan, Kumar
    Taylor, Graham W.
    2014 CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2014, : 98 - 103