Artificial Intelligent System for Automatic Depression Level Analysis Through Visual and Vocal Expressions

被引:135
作者
Jan, Asim [1 ]
Meng, Hongying [1 ]
Gaus, Yona Falinie Binti A. [1 ]
Zhang, Fan [1 ]
机构
[1] Brunel Univ London, Dept Elect & Comp Engn, Uxbridge UB8 3PH, Middx, England
基金
中国国家自然科学基金;
关键词
Artificial system; Beck depression inventory (BDI); deep learning; depression; facial expression; regression; vocal expression; TEXTURE CLASSIFICATION; FACIAL EXPRESSION; RECOGNITION;
D O I
10.1109/TCDS.2017.2721552
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A human being's cognitive system can be simulated by artificial intelligent systems. Machines and robots equipped with cognitive capability can automatically recognize a humans mental state through their gestures and facial expressions. In this paper, an artificial intelligent system is proposed to monitor depression. It can predict the scales of Beck depression inventory II (BDI-11) from vocal and visual expressions. First, different visual features are extracted from facial expression images. Deep learning method is utilized to extract key visual features from the facial expression frames. Second, spectral low-level descriptors and mel-frequency cepstral coefficients features arc extracted from short audio segments to capture the vocal expressions. Third, feature dynamic history histogram (FDHH) is proposed to capture the temporal movement on the feature space. Finally, these FDHH and audio features are fused using regression techniques for the prediction of the BDI-II scales. The proposed method has been tested on the public Audio/Visual Emotion Challenges 2014 dataset as it is tuned to be more focused on the study of depression. The results outperform all the other existing methods on the same dataset.
引用
收藏
页码:668 / 680
页数:13
相关论文
共 62 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]   Local Gabor Binary Patterns from Three Orthogonal Planes for Automatic Facial Expression Recognition [J].
Almaev, Timur R. ;
Valstar, Michel F. .
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, :356-361
[3]  
[Anonymous], 2014, P 4 INT WORKSHOP AUD
[4]  
[Anonymous], 2015, MATCONVNET CONVOLUTI
[5]  
[Anonymous], 2008, Tech. rep.
[6]  
[Anonymous], 2007, 2007 IEEE C COMPUTER
[7]  
[Anonymous], 2015, P 11 IEEE INT C WORK
[8]  
[Anonymous], 2009, P 2009 3 INT C AFF C
[9]  
[Anonymous], 2014, P 4 INT WORKSH AUD V, DOI [10.1145/2661806.2661815, DOI 10.1145/2661806.2661815]
[10]  
[Anonymous], 1990, Advances in neural information processing systems