Intelligent system for depression scale estimation with facial expressions and case study in industrial intelligence

被引:39
作者
He, Lang [1 ,2 ]
Guo, Chenguang [3 ]
Tiwari, Prayag [4 ]
Pandey, Hari Mohan [5 ]
Dang, Wei [6 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Comp Sci, Xian, Shaanxi, Peoples R China
[2] Xian Univ Posts & Telecommun, Shaanxi Key Lab Network Data Anal & Intelligent P, Xian, Shaanxi, Peoples R China
[3] Northwestern Polytech Univ, Sch Elect & Informat, Dept Elect & Informat Engn, Xian 710072, Peoples R China
[4] Aalto Univ, Dept Comp Sci, Espoo, Finland
[5] Edge Hill Univ, Dept Comp Sci, Ormskirk L39 4QP, England
[6] Shaanxi Mental Hlth Ctr, Xian, Shaanxi, Peoples R China
关键词
3D‐ CNN; depression; industrial intelligent system; pattern recognition; vector of local aggregated descriptors;
D O I
10.1002/int.22426
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a mental disorder, depression has affected people's lives, works, and so on. Researchers have proposed various industrial intelligent systems in the pattern recognition field for audiovisual depression detection. This paper presents an end-to-end trainable intelligent system to generate high-level representations over the entire video clip. Specifically, a three-dimensional (3D) convolutional neural network equipped with a module spatiotemporal feature aggregation module (STFAM) is trained from scratch on audio/visual emotion challenge (AVEC)2013 and AVEC2014 data, which can model the discriminative patterns closely related to depression. In the STFAM, channel and spatial attention mechanism and an aggregation method, namely 3D DEP-NetVLAD, are integrated to learn the compact characteristic based on the feature maps. Extensive experiments on the two databases (i.e., AVEC2013 and AVEC2014) are illustrated that the proposed intelligent system can efficiently model the underlying depression patterns and obtain better performances over the most video-based depression recognition approaches. Case studies are presented to describes the applicability of the proposed intelligent system for industrial intelligence.
引用
收藏
页码:10140 / 10157
页数:18
相关论文
共 38 条
[1]   Video-Based Depression Level Analysis by Encoding Deep Spatiotemporal Features [J].
Al Jazaery, Mohamad ;
Guo, Guodong .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (01) :262-268
[2]   Emergency decision support modeling for COVID-19 based on spherical fuzzy information [J].
Ashraf, Shahzaib ;
Abdullah, Saleem .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (11) :1601-1645
[3]   Comparison of Beck Depression Inventories-IA and -II in psychiatric outpatients [J].
Beck, AT ;
Steer, RA ;
Ball, R ;
Ranieri, WF .
JOURNAL OF PERSONALITY ASSESSMENT, 1996, 67 (03) :588-597
[4]  
Boerma, 2020, DEPRESSION OTHER COM
[5]  
Bogduk, 2013, DIAGN STAT MAN MENT
[6]   Z-uncertain probabilistic linguistic variables and its application in emergency decision making for treatment of COVID-19 patients [J].
Chai, Jiahui ;
Xian, Sidong ;
Lu, Sichong .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (01) :362-402
[7]  
Cummins N., 2013, P 3 ACM INT WORKSH A, P11
[8]  
de Melo WC, 2019, IEEE IMAGE PROC, P4544, DOI [10.1109/ICIP.2019.8803467, 10.1109/icip.2019.8803467]
[9]  
de Melo WC, 2019, IEEE INT CONF AUTOMA, P554, DOI 10.1109/fg.2019.8756568
[10]  
Dhall A, 2015, INT CONF AFFECT, P255, DOI 10.1109/ACII.2015.7344580