Spectral Representation of Behaviour Primitives for Depression Analysis

被引:64
作者
Song, Siyang [1 ,2 ]
Jaiswal, Shashank [2 ]
Shen, Linlin [1 ,3 ,4 ]
Valstar, Michel [2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Comp Vis Inst, Shenzhen 518060, Peoples R China
[2] Univ Nottingham, Sch Comp Sci, Comp Vis Lab, Nottingham NG8 1BB, England
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518060, Peoples R China
[4] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Depression; Videos; Task analysis; Interviews; Feature extraction; Magnetic heads; Neural networks; Automatic depression analysis; fourier transform; spectral representation; time-frequency analysis; convolution neural networks; FACIAL APPEARANCE; EXPRESSION; NETWORKS;
D O I
10.1109/TAFFC.2020.2970712
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression is a serious mental disorder affecting millions of people all over the world. Traditional clinical diagnosis methods are subjective, complicated and require extensive participation of clinicians. Recent advances in automatic depression analysis systems promise a future where these shortcomings are addressed by objective, repeatable, and readily available diagnostic tools to aid health professionals in their work. Yet there remain a number of barriers to the development of such tools. One barrier is that existing automatic depression analysis algorithms base their predictions on very brief sequential segments, sometimes as little as one frame. Another barrier is that existing methods do not take into account what the context of the measured behaviour is. In this article, we extract multi-scale video-level features for video-based automatic depression analysis. We propose to use automatically detected human behaviour primitives as the low-dimensional descriptor for each frame. We also propose two novel spectral representations, i.e., spectral heatmaps and spectral vectors, to represent video-level multi-scale temporal dynamics of expressive behaviour. Constructed spectral representations are fed to Convolution Neural Networks (CNNs) and Artificial Neural Networks (ANNs) for depression analysis. We conducted experiments on the AVEC 2013 and AVEC 2014 benchmark datasets to investigate the influence of interview tasks on depression analysis. In addition to achieving state of the art accuracy in severity of depression estimation, we show that the task conducted by the user matters, that fusion of a combination of tasks reaches highest accuracy, and that longer tasks are more informative than shorter tasks, up to a point.
引用
收藏
页码:829 / 844
页数:16
相关论文
共 71 条
[1]   Video-Based Depression Level Analysis by Encoding Deep Spatiotemporal Features [J].
Al Jazaery, Mohamad ;
Guo, Guodong .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (01) :262-268
[2]   Detecting Depression with Audio/Text Sequence Modeling of Interviews [J].
Alhanai, Tuka ;
Ghassemi, Mohammad ;
Glass, James .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :1716-1720
[3]  
[Anonymous], 2013, People with mental health problems still waiting over a year for talking treatments
[4]  
Bai S., 2018, ARXIV180301271
[5]   OpenFace 2.0: Facial Behavior Analysis Toolkit [J].
Baltrusaitis, Tadas ;
Zadeh, Amir ;
Lim, Yao Chong ;
Morency, Louis-Philippe .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :59-66
[6]  
Brozgold A Z, 1998, Appl Neuropsychol, V5, P15, DOI 10.1207/s15324826an0501_2
[7]   Further Evidence for the Cultural Norm Hypothesis: Positive Emotion in Depressed and Control European American and Asian American Women [J].
Chentsova-Dutton, Yulia E. ;
Tsai, Jeanne L. ;
Gotlib, Ian H. .
CULTURAL DIVERSITY & ETHNIC MINORITY PSYCHOLOGY, 2010, 16 (02) :284-295
[8]  
Cohn Jeffrey F., 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), P1, DOI 10.1109/CVPR.2009.5204260
[9]   Psychiatric diagnosis: impersonal, imperfect and important [J].
Craddock, Nick ;
Mynors-Wallis, Laurence .
BRITISH JOURNAL OF PSYCHIATRY, 2014, 204 (02) :93-95
[10]  
Cummins N., 2013, PROC 3 ACM INT WORKS, P11, DOI 10.1145/2512530.2512535