Decision Tree Based Depression Classification from Audio Video and Language Information

被引：89

作者：

Yang, Le ^{[1
]}

Jiang, Dongmei ^{[1
]}

He, Lang ^{[1
]}

Pei, Ercheng ^{[1
]}

Oveneke, Meshia Cedric ^{[2
]}

Sahli, Hichem ^{[3
,4
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, NPU VUB Joint AVSP Lab, 127 Youyi Xilu, Xian 710072, Peoples R China

[2] Vrije Univ Brussel, ETRO, Dept Elect & Informat, NPU VUB Joint AVSP Lab, Pl Laan 2, B-1050 Brussels, Belgium

[3] VUB, Dept ETRO, NPU VUB Joint AVSP Lab, Pl Laan 2, B-1050 Brussels, Belgium

[4] Interuniv Microelect Ctr, Kepeldreef 75, B-3001 Heverlee, Belgium

来源：

PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON AUDIO/VISUAL EMOTION CHALLENGE (AVEC'16) | 2016年

关键词：

Depression classification; decision tree; multi-modal;

D O I：

10.1145/2988257.2988269

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In order to improve the recognition accuracy of the Depression Classification Sub-Challenge (DCC) of the AVEC 2016, in this paper we propose a decision tree for depression classification. The decision tree is constructed according to the distribution of the multimodal prediction of PHQ-8 scores and participants' characteristics (PTSD/Depression Diagnostic, sleep-status, feeling and personality) obtained via the analysis of the transcript files of the participants. The proposed gender specific decision tree provides a way of fusing the upper level language information with the results obtained using low level audio and visual features. Experiments are carried out on the Distress Analysis Interview Corpus - Wizard of Oz (DAIC-WOZ) database, results show that the proposed depression classification schemes obtain very promising results on the development set, with F1 score reaching 0.857 for class depressed and 0.964 for class not depressed. Despite of the over-fitting problem in training the models of predicting the PHQ-8 scores, the classification schemes still obtain satisfying performance on the test set. The Fl score reaches 0.571 for class depressed and 0.877 for class not depressed, with the average 0.724 which is higher than the baseline result 0.700.

引用

页码：89 / 96

页数：8

共 23 条

[11]

He L, 2015, INT CONF AFFECT, P260, DOI 10.1109/ACII.2015.7344581

[12]

Howes Christine., 2014, Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, P7

[13] Multimodal assistive technologies for depression diagnosis and monitoring [J].