Facial Depression Recognition by Deep Joint Label Distribution and Metric Learning

被引：29

作者：

Zhou, Xiuzhuang ^{[1
]}

Wei, Zeqiang ^{[1
]}

Xu, Min ^{[2
]}

Qu, Shan ^{[3
]}

Guo, Guodong ^{[4
,5
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[2] Capital Normal Univ, Coll Informat & Engn, Beijing 100048, Peoples R China

[3] Peking Univ Peoples Hosp, Dept Psychiat, Beijing 100044, Peoples R China

[4] Baidu Res, Inst Deep Learning, Beijing, Peoples R China

[5] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100085, Peoples R China

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2022年 / 13卷 / 03期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Feature extraction; Face recognition; Measurement; Predictive models; Histograms; Spatiotemporal phenomena; Faces; Depression recognition; label distribution learning; metric learning; label-aware histogram loss; spatiotemporal feature; SCALE;

D O I：

10.1109/TAFFC.2020.3022732

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While existing prediction models built on popular deep architectures have shown promising results in facial depression recognition, they still lack sufficient discriminative power due to the issues of 1) limited amount of labeled depression data for deep representation learning and, 2) large variation in facial expression across different persons of the same depression score and the subtle difference in facial expression across different depression levels. In this article, we formulate the facial depression recognition as a label distribution learning (LDL) problem, and propose a deep joint label distribution and metric learning (DJ-LDML) method to address these issues. In DJ-LDML, LDL exploits label relevance inherent in depression data to implicitly increase the amount of training data associated with each depression level without actually enlarging the dataset, while deep metric learning (DML) aims at learning a deep ordinal embedding with a specifically designed label-aware histogram loss, allowing semantics similarity between video sequences (described by ordinal labels) to be preserved for discriminative feature learning. The two learning modules in our DJ-LDML work collaboratively to enhance the representation ability and discriminative power of the deeply learned spatiotemporal feature, leading to improved depression prediction. We empirically evaluate our method on two benchmark datasets and the results demonstrate the effectiveness of our formulation.

引用

页码：1605 / 1618

页数：14

共 50 条

[1] Identity-Aware Facial Expression Recognition Via Deep Metric Learning Based on Synthesized Images
Huang, Wei
Zhang, Siyuan
Zhang, Peng
Zha, Yufei
Fang, Yuming
Zhang, Yanning
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3327 - 3339
[2] Label-Sensitive Deep Metric Learning for Facial Age Estimation
Liu, Hao
Lu, Jiwen
Feng, Jianjiang
Zhou, Jie
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (02) : 292 - 305
[3] Joint Deep Learning of Facial Expression Synthesis and Recognition
Yan, Yan
Huang, Ying
Chen, Si
Shen, Chunhua
Wang, Hanzi
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2792 - 2807
[4] Joint Metric Learning and Hierarchical Network for Gait Recognition
Xu, Huanhuan
Li, Yuqian
Sun, Xuemei
Wang, Shengjin
IEEE ACCESS, 2020, 8 : 228088 - 228098
[5] Facial Attractiveness Prediction by Deep Adaptive Label Distribution Learning
Chen, Luyan
Deng, Weihong
BIOMETRIC RECOGNITION (CCBR 2019), 2019, 11818 : 198 - 206
[6] Histogram distance metric learning for facial expression recognition
Sadeghi, Hamid
Raie, Abolghasem-A.
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 152 - 165
[7] LQGDNet: A Local Quaternion and Global Deep Network for Facial Depression Recognition
Shang, Yuanyuan
Pan, Yuchen
Jiang, Xiao
Shao, Zhuhong
Guo, Guodong
Liu, Tie
Ding, Hui
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2557 - 2563
[8] Self-learning weight network based on label distribution training for facial expression recognition
Chen, Yangbo
Peng, Chunyan
Wang, Xuan
Zheng, Yuhui
IET IMAGE PROCESSING, 2025, 19 (01)
[9] Visually Interpretable Representation Learning for Depression Recognition from Facial Images
Zhou, Xiuzhuang
Jin, Kai
Shang, Yuanyuan
Guo, Guodong
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (03) : 542 - 552
[10] Integrating Deep Facial Priors Into Landmarks for Privacy Preserving Multimodal Depression Recognition
Pan, Yuchen
Shang, Yuanyuan
Shao, Zhuhong
Liu, Tie
Guo, Guodong
Ding, Hui
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 828 - 836

← 1 2 3 4 5 →