Automated Depression Diagnosis Based on Deep Networks to Encode Facial Appearance and Dynamics

被引:168
作者
Zhu, Yu [1 ]
Shang, Yuanyuan [2 ,3 ]
Shao, Zhuhong [2 ,3 ]
Guo, Guodong [1 ,2 ]
机构
[1] West Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA
[2] Beijing Adv Innovat Ctr Imaging Technol, Beijing 100048, Peoples R China
[3] Capital Normal Univ, Coll Informat Engn, Beijing 100048, Peoples R China
关键词
Automated depression diagnosis; nonverbal behavior; deep convolutional neural networks; flow dynamics; SEVERITY;
D O I
10.1109/TAFFC.2017.2650899
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a severe psychiatric disorder disease, depression is a state of low mood and aversion to activity, which prevents a person from functioning normally in both work and daily lives. The study on automated mental health assessment has been given increasing attentions in recent years. In this paper, we study the problem of automatic diagnosis of depression. A new approach to predict the Beck Depression Inventory II (BDI-II) values from video data is proposed based on the deep networks. The proposed framework is designed in a two stream manner, aiming at capturing both the facial appearance and dynamics. Further, we employ joint tuning layers that can implicitly integrate the appearance and dynamic information. Experiments are conducted on two depression databases, AVEC2013 and AVEC2014. The experimental results show that our proposed approach significantly improve the depression prediction performance, compared to other visual-based approaches.
引用
收藏
页码:578 / 584
页数:7
相关论文
共 36 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]  
Alghowinem S, 2013, IEEE IMAGE PROC, P4220, DOI 10.1109/ICIP.2013.6738869
[3]   Local Gabor Binary Patterns from Three Orthogonal Planes for Automatic Facial Expression Recognition [J].
Almaev, Timur R. ;
Valstar, Michel F. .
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, :356-361
[4]  
[Anonymous], 2014, LEARNING FACE REPRES
[5]  
[Anonymous], 2007, P 6 ACM INT C IM VID, DOI [DOI 10.1145/1282280.1282340, 10.1145/1282280.1282340]
[6]  
[Anonymous], 2009, P 2009 3 INT C AFF C
[7]   Comparison of Beck Depression Inventories-IA and -II in psychiatric outpatients [J].
Beck, AT ;
Steer, RA ;
Ball, R ;
Ranieri, WF .
JOURNAL OF PERSONALITY ASSESSMENT, 1996, 67 (03) :588-597
[8]  
Belmaker RH, 2008, NEW ENGL J MED, V358, P55, DOI [10.1056/NEJMra073096, 10.1038/nrdp.2016.65]
[9]  
Birdwhistell RayL., 1974, Nonverbal communi- cation, P134
[10]   High accuracy optical flow estimation based on a theory for warping [J].
Brox, T ;
Bruhn, A ;
Papenberg, N ;
Weickert, J .
COMPUTER VISION - ECCV 2004, PT 4, 2004, 2034 :25-36