Facial Depression Recognition by Deep Joint Label Distribution and Metric Learning

被引:29
作者
Zhou, Xiuzhuang [1 ]
Wei, Zeqiang [1 ]
Xu, Min [2 ]
Qu, Shan [3 ]
Guo, Guodong [4 ,5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Capital Normal Univ, Coll Informat & Engn, Beijing 100048, Peoples R China
[3] Peking Univ Peoples Hosp, Dept Psychiat, Beijing 100044, Peoples R China
[4] Baidu Res, Inst Deep Learning, Beijing, Peoples R China
[5] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100085, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Face recognition; Measurement; Predictive models; Histograms; Spatiotemporal phenomena; Faces; Depression recognition; label distribution learning; metric learning; label-aware histogram loss; spatiotemporal feature; SCALE;
D O I
10.1109/TAFFC.2020.3022732
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While existing prediction models built on popular deep architectures have shown promising results in facial depression recognition, they still lack sufficient discriminative power due to the issues of 1) limited amount of labeled depression data for deep representation learning and, 2) large variation in facial expression across different persons of the same depression score and the subtle difference in facial expression across different depression levels. In this article, we formulate the facial depression recognition as a label distribution learning (LDL) problem, and propose a deep joint label distribution and metric learning (DJ-LDML) method to address these issues. In DJ-LDML, LDL exploits label relevance inherent in depression data to implicitly increase the amount of training data associated with each depression level without actually enlarging the dataset, while deep metric learning (DML) aims at learning a deep ordinal embedding with a specifically designed label-aware histogram loss, allowing semantics similarity between video sequences (described by ordinal labels) to be preserved for discriminative feature learning. The two learning modules in our DJ-LDML work collaboratively to enhance the representation ability and discriminative power of the deeply learned spatiotemporal feature, leading to improved depression prediction. We empirically evaluate our method on two benchmark datasets and the results demonstrate the effectiveness of our formulation.
引用
收藏
页码:1605 / 1618
页数:14
相关论文
共 50 条
  • [31] Dysarthric Speech Recognition Based on Deep Metric Learning
    Takashima, Yuki
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    INTERSPEECH 2020, 2020, : 4796 - 4800
  • [32] Deep Discrete Hashing for Label Distribution Learning
    Zhang, Zhen
    Zhu, Lei
    Li, Yaping
    Xu, Yang
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 832 - 836
  • [33] A Unified Deep Model for Joint Facial Expression Recognition, Face Synthesis, and Face Alignment
    Zhang, Feifei
    Zhang, Tianzhu
    Mao, Qirong
    Xu, Changsheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6574 - 6589
  • [34] Deep learning for Depression Recognition from Speech
    Tian, Han
    Zhu, Zhang
    Jing, Xu
    MOBILE NETWORKS & APPLICATIONS, 2023, 29 (4) : 1212 - 1227
  • [35] Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild
    Zhao, Zengqun
    Liu, Qingshan
    Wang, Shanmin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6544 - 6556
  • [36] Facial age recognition based on deep manifold learning
    Zhang, Huiying
    Lin, Jiayan
    Zhou, Lan
    Shen, Jiahui
    Sheng, Wenshun
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4485 - 4500
  • [37] Deep Adversarial Metric Learning
    Duan, Yueqi
    Lu, Jiwen
    Zheng, Wenzhao
    Zhou, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (01) : 2037 - 2051
  • [38] Large Scale Landmark Recognition via Deep Metric Learning
    Boiarov, Andrei
    Tyantov, Eduard
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 169 - 178
  • [39] Personality Recognition on Social Media With Label Distribution Learning
    Xue, Di
    Hong, Zheng
    Guo, Shize
    Gao, Liang
    Wu, Lifa
    Zheng, Jinghua
    Zhao, Nan
    IEEE ACCESS, 2017, 5 : 13478 - 13488
  • [40] Semantic Neighborhood-Aware Deep Facial Expression Recognition
    Fu, Yongjian
    Wu, Xintian
    Li, Xi
    Pan, Zhijie
    Luo, Daxin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 6535 - 6548