Automatic depression recognition using CNN with attention mechanism from videos

被引:103
作者
He, Lang [1 ,2 ]
Chan, Jonathan Cheung-Wai [3 ]
Wang, Zhongmin [1 ,2 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Shaanxi, Peoples R China
[2] Xian Univ Posts & Telecommun, Shaanxi Key Lab Network Data Anal & Intelligent P, Xian 710121, Shaanxi, Peoples R China
[3] Vrije Univ Brussel VUB, Dept Elect & Informat, B-1050 Brussels, Belgium
关键词
Depression; CNN with attention mechanism; Local Attention based CNN (LA-CNN); Global Attention based CNN (GA-CNN);
D O I
10.1016/j.neucom.2020.10.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial intelligence (AI) has incorporated various automatic systems and frameworks to diagnose the severity of depression using hand-crafted features. However, process of feature selection needs domain knowledge and is still time-consuming and subjective. Deep learning technology has been successfully adopted for depression recognition. Most previous works pre-train the deep models on large databases followed by fine-tuning with depression databases (i.e., AVEC2013, AVEC2014). In the present paper we propose an integrated framework - Deep Local Global Attention Convolutional Neural Network (DLGA-CNN) for depression recognition, which adopts CNN with attention mechanism as well as weighted spatial pyramid pooling (WSPP) to learn a deep and global representation. Two branches are introduced: Local Attention based CNN (LA-CNN) focuses on the local patches, while Global Attention based CNN (GA-CNN) learns the global patterns from the entire facial region. To capture the complementary information between the two branches, Local-Global Attention-based CNN (LGA-CNN) is proposed. After feature aggregation, WSPP is used to learn the depression patterns. Comprehensive experiments on AVEC2013 and AVEC2014 depression databases have demonstrated that the proposed method is capable of mining the underlying depression patterns of facial videos and outperforms the most of the state-of the-art video-based depression recognition approaches. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:165 / 175
页数:11
相关论文
共 39 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
AL CHANTI D.A., IEEE T AFFECTIVE COM
[3]  
Al Jazaery M., IEEE T AFFECTIVE COM
[4]  
[Anonymous], 2015, ACS SYM SER
[5]  
[Anonymous], 2013, P 3 ACM INT WORKSH A
[6]  
Baltrusaitis T, 2016, IEEE WINT CONF APPL
[7]   Comparison of Beck Depression Inventories-IA and -II in psychiatric outpatients [J].
Beck, AT ;
Steer, RA ;
Ball, R ;
Ranieri, WF .
JOURNAL OF PERSONALITY ASSESSMENT, 1996, 67 (03) :588-597
[8]  
Cummins N., 2013, P 3 ACM INT WORKSH A, P11
[9]  
de Melo W.C., 2019, ICIP
[10]  
de Melo W.C., 2019, COMBINING GLOBAL LOC