Facial Depression Recognition by Deep Joint Label Distribution and Metric Learning

被引:29
|
作者
Zhou, Xiuzhuang [1 ]
Wei, Zeqiang [1 ]
Xu, Min [2 ]
Qu, Shan [3 ]
Guo, Guodong [4 ,5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Capital Normal Univ, Coll Informat & Engn, Beijing 100048, Peoples R China
[3] Peking Univ Peoples Hosp, Dept Psychiat, Beijing 100044, Peoples R China
[4] Baidu Res, Inst Deep Learning, Beijing, Peoples R China
[5] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100085, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Face recognition; Measurement; Predictive models; Histograms; Spatiotemporal phenomena; Faces; Depression recognition; label distribution learning; metric learning; label-aware histogram loss; spatiotemporal feature; SCALE;
D O I
10.1109/TAFFC.2020.3022732
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While existing prediction models built on popular deep architectures have shown promising results in facial depression recognition, they still lack sufficient discriminative power due to the issues of 1) limited amount of labeled depression data for deep representation learning and, 2) large variation in facial expression across different persons of the same depression score and the subtle difference in facial expression across different depression levels. In this article, we formulate the facial depression recognition as a label distribution learning (LDL) problem, and propose a deep joint label distribution and metric learning (DJ-LDML) method to address these issues. In DJ-LDML, LDL exploits label relevance inherent in depression data to implicitly increase the amount of training data associated with each depression level without actually enlarging the dataset, while deep metric learning (DML) aims at learning a deep ordinal embedding with a specifically designed label-aware histogram loss, allowing semantics similarity between video sequences (described by ordinal labels) to be preserved for discriminative feature learning. The two learning modules in our DJ-LDML work collaboratively to enhance the representation ability and discriminative power of the deeply learned spatiotemporal feature, leading to improved depression prediction. We empirically evaluate our method on two benchmark datasets and the results demonstrate the effectiveness of our formulation.
引用
收藏
页码:1605 / 1618
页数:14
相关论文
共 50 条
  • [21] Joint Expression Synthesis and Representation Learning for Facial Expression Recognition
    Zhang, Xi
    Zhang, Feifei
    Xu, Changsheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1681 - 1695
  • [22] Deep learning-based depression recognition through facial expression: A systematic review
    Cao, Xiaoming
    Zhai, Lingling
    Zhai, Pengpeng
    Li, Fangfei
    He, Tao
    He, Lang
    NEUROCOMPUTING, 2025, 627
  • [23] Facial Expression Recognition via Deep Learning
    Zhao, Xiaoming
    Shi, Xugan
    Zhang, Shiqing
    IETE TECHNICAL REVIEW, 2015, 32 (05) : 347 - 355
  • [24] Dual Learning for Joint Facial Landmark Detection and Action Unit Recognition
    Wang, Shangfei
    Chang, Yanan
    Wang, Can
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1404 - 1416
  • [25] Spontaneous facial expression recognition: A robust metric learning approach
    Wan, Shaohua
    Aggarwal, J. K.
    PATTERN RECOGNITION, 2014, 47 (05) : 1859 - 1868
  • [26] Capsule Embedding and Emotional Metric Learning for Facial Expression Recognition
    Hu, Jiajing
    Zhou, Yu
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT X, 2025, 15210 : 99 - 106
  • [27] Learning Sequential Variation Information for Dynamic Facial Expression Recognition
    Pan, Bei
    Hirota, Kaoru
    Dai, Yaping
    Jia, Zhiyang
    Shao, Shuai
    She, Jinhua
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [28] Deep Facial Diagnosis: Deep Transfer Learning From Face Recognition to Facial Diagnosis
    Jin, Bo
    Cruz, Leandro
    Goncalves, Nuno
    IEEE ACCESS, 2020, 8 (08): : 123649 - 123661
  • [29] Impact of Deep Learning Approaches on Facial Expression Recognition in Healthcare Industries
    Bisogni, Carmen
    Castiglione, Aniello
    Hossain, Sanoar
    Narducci, Fabio
    Umer, Saiyed
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (08) : 5619 - 5627
  • [30] Deep metric learning for robust radar signal recognition
    Chen, Kuiyu
    Zhang, Jingyi
    Chen, Si
    Zhang, Shuning
    DIGITAL SIGNAL PROCESSING, 2023, 137