Visually Interpretable Representation Learning for Depression Recognition from Facial Images

被引:165
作者
Zhou, Xiuzhuang [1 ]
Jin, Kai [2 ]
Shang, Yuanyuan [2 ]
Guo, Guodong [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Automat, Beijing 100876, Peoples R China
[2] Capital Normal Univ, Coll Informat Engn, Beijing 100048, Peoples R China
[3] West Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Visualization; Feature extraction; Videos; Face recognition; Face; Computer architecture; Image recognition; Depression recognition; face recognition; deep convolutional neural network; depression activation map; DISEASE;
D O I
10.1109/TAFFC.2018.2828819
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent evidence in mental health assessment have demonstrated that facial appearance could be highly indicative of depressive disorder. While previous methods based on the facial analysis promise to advance clinical diagnosis of depressive disorder in a more efficient and objective manner, challenges in visual representation of complex depression pattern prevent widespread practice of automated depression diagnosis. In this paper, we present a deep regression network termed DepressNet to learn a depression representation with visual explanation. Specifically, a deep convolutional neural network equipped with a global average pooling layer is first trained with facial depression data, which allows for identifying salient regions of input image in terms of its severity score based on the generated depression activation map (DAM). We then propose a multi-region DepressNet, with which multiple local deep regression models for different face regions are jointly leaned and their responses are fused to improve the overall recognition performance. We evaluate our method on two benchmark datasets, and the results show that our method significantly boosts state-of-the-art performance of the visual-based depression recognition. Most importantly, the DAM induced by our learned deep model may help reveal the visual depression pattern on faces and understand the insights of automated depression diagnosis.
引用
收藏
页码:542 / 552
页数:11
相关论文
共 49 条
[1]  
A. Bosch, 2007, P 6 ACM I NT C IM VI, P401
[2]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
[3]   Multimodal Depression Detection: Fusion Analysis of Paralinguistic, Head Pose and Eye Gaze Behaviors [J].
Alghowinem, Sharifa ;
Goecke, Roland ;
Wagner, Michael ;
Epps, Julien ;
Hyett, Matthew ;
Parker, Gordon ;
Breakspear, Michael .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) :478-490
[4]   Local Gabor Binary Patterns from Three Orthogonal Planes for Automatic Facial Expression Recognition [J].
Almaev, Timur R. ;
Valstar, Michel F. .
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, :356-361
[5]  
[Anonymous], 2014, P INT C LEARN REPR I
[6]  
[Anonymous], 2014, ISTANB J SOCIOL STUD
[7]  
[Anonymous], 2015, LANCET, DOI DOI 10.1016/S0140-6736(15)60692-4
[8]  
[Anonymous], 2018, IEEE T AFFECT COMPUT, DOI DOI 10.1109/TAFFC.2017.2650899
[9]   Comparison of Beck Depression Inventories-IA and -II in psychiatric outpatients [J].
Beck, AT ;
Steer, RA ;
Ball, R ;
Ranieri, WF .
JOURNAL OF PERSONALITY ASSESSMENT, 1996, 67 (03) :588-597
[10]   Major depressive disorder [J].
Otte, Christian ;
Gold, Stefan M. ;
Penninx, Brenda W. ;
Pariante, Carmine M. ;
Etkin, Amit ;
Fava, Maurizio ;
Mohr, David C. ;
Schatzberg, Alan F. .
NATURE REVIEWS DISEASE PRIMERS, 2016, 2