Facial Depression Recognition by Deep Joint Label Distribution and Metric Learning

被引:29
|
作者
Zhou, Xiuzhuang [1 ]
Wei, Zeqiang [1 ]
Xu, Min [2 ]
Qu, Shan [3 ]
Guo, Guodong [4 ,5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Capital Normal Univ, Coll Informat & Engn, Beijing 100048, Peoples R China
[3] Peking Univ Peoples Hosp, Dept Psychiat, Beijing 100044, Peoples R China
[4] Baidu Res, Inst Deep Learning, Beijing, Peoples R China
[5] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100085, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Feature extraction; Face recognition; Measurement; Predictive models; Histograms; Spatiotemporal phenomena; Faces; Depression recognition; label distribution learning; metric learning; label-aware histogram loss; spatiotemporal feature; SCALE;
D O I
10.1109/TAFFC.2020.3022732
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While existing prediction models built on popular deep architectures have shown promising results in facial depression recognition, they still lack sufficient discriminative power due to the issues of 1) limited amount of labeled depression data for deep representation learning and, 2) large variation in facial expression across different persons of the same depression score and the subtle difference in facial expression across different depression levels. In this article, we formulate the facial depression recognition as a label distribution learning (LDL) problem, and propose a deep joint label distribution and metric learning (DJ-LDML) method to address these issues. In DJ-LDML, LDL exploits label relevance inherent in depression data to implicitly increase the amount of training data associated with each depression level without actually enlarging the dataset, while deep metric learning (DML) aims at learning a deep ordinal embedding with a specifically designed label-aware histogram loss, allowing semantics similarity between video sequences (described by ordinal labels) to be preserved for discriminative feature learning. The two learning modules in our DJ-LDML work collaboratively to enhance the representation ability and discriminative power of the deeply learned spatiotemporal feature, leading to improved depression prediction. We empirically evaluate our method on two benchmark datasets and the results demonstrate the effectiveness of our formulation.
引用
收藏
页码:1605 / 1618
页数:14
相关论文
共 50 条
  • [1] Identity-Aware Facial Expression Recognition Via Deep Metric Learning Based on Synthesized Images
    Huang, Wei
    Zhang, Siyuan
    Zhang, Peng
    Zha, Yufei
    Fang, Yuming
    Zhang, Yanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3327 - 3339
  • [2] Label-Sensitive Deep Metric Learning for Facial Age Estimation
    Liu, Hao
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (02) : 292 - 305
  • [3] Joint Deep Learning of Facial Expression Synthesis and Recognition
    Yan, Yan
    Huang, Ying
    Chen, Si
    Shen, Chunhua
    Wang, Hanzi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2792 - 2807
  • [4] Joint Metric Learning and Hierarchical Network for Gait Recognition
    Xu, Huanhuan
    Li, Yuqian
    Sun, Xuemei
    Wang, Shengjin
    IEEE ACCESS, 2020, 8 : 228088 - 228098
  • [5] Facial Attractiveness Prediction by Deep Adaptive Label Distribution Learning
    Chen, Luyan
    Deng, Weihong
    BIOMETRIC RECOGNITION (CCBR 2019), 2019, 11818 : 198 - 206
  • [6] Histogram distance metric learning for facial expression recognition
    Sadeghi, Hamid
    Raie, Abolghasem-A.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 152 - 165
  • [7] LQGDNet: A Local Quaternion and Global Deep Network for Facial Depression Recognition
    Shang, Yuanyuan
    Pan, Yuchen
    Jiang, Xiao
    Shao, Zhuhong
    Guo, Guodong
    Liu, Tie
    Ding, Hui
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2557 - 2563
  • [8] Self-learning weight network based on label distribution training for facial expression recognition
    Chen, Yangbo
    Peng, Chunyan
    Wang, Xuan
    Zheng, Yuhui
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [9] Visually Interpretable Representation Learning for Depression Recognition from Facial Images
    Zhou, Xiuzhuang
    Jin, Kai
    Shang, Yuanyuan
    Guo, Guodong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (03) : 542 - 552
  • [10] Integrating Deep Facial Priors Into Landmarks for Privacy Preserving Multimodal Depression Recognition
    Pan, Yuchen
    Shang, Yuanyuan
    Shao, Zhuhong
    Liu, Tie
    Guo, Guodong
    Ding, Hui
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 828 - 836