Artificial Intelligent System for Automatic Depression Level Analysis Through Visual and Vocal Expressions

被引:135
作者
Jan, Asim [1 ]
Meng, Hongying [1 ]
Gaus, Yona Falinie Binti A. [1 ]
Zhang, Fan [1 ]
机构
[1] Brunel Univ London, Dept Elect & Comp Engn, Uxbridge UB8 3PH, Middx, England
基金
中国国家自然科学基金;
关键词
Artificial system; Beck depression inventory (BDI); deep learning; depression; facial expression; regression; vocal expression; TEXTURE CLASSIFICATION; FACIAL EXPRESSION; RECOGNITION;
D O I
10.1109/TCDS.2017.2721552
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A human being's cognitive system can be simulated by artificial intelligent systems. Machines and robots equipped with cognitive capability can automatically recognize a humans mental state through their gestures and facial expressions. In this paper, an artificial intelligent system is proposed to monitor depression. It can predict the scales of Beck depression inventory II (BDI-11) from vocal and visual expressions. First, different visual features are extracted from facial expression images. Deep learning method is utilized to extract key visual features from the facial expression frames. Second, spectral low-level descriptors and mel-frequency cepstral coefficients features arc extracted from short audio segments to capture the vocal expressions. Third, feature dynamic history histogram (FDHH) is proposed to capture the temporal movement on the feature space. Finally, these FDHH and audio features are fused using regression techniques for the prediction of the BDI-II scales. The proposed method has been tested on the public Audio/Visual Emotion Challenges 2014 dataset as it is tuned to be more focused on the study of depression. The results outperform all the other existing methods on the same dataset.
引用
收藏
页码:668 / 680
页数:13
相关论文
共 62 条
[21]  
Friesen E., 1978, FACIAL ACTION CODING, V3, P5
[22]  
Girard JM, 2013, IEEE INT CONF AUTOMA
[23]  
Gupta R., 2014, P 4 INT WORKSH AUD V, P33, DOI [DOI 10.1145/2661806.2661810, 10.1145/2661806]
[24]   Learning Computational Models of Video Memorability from fMRI Brain Imaging [J].
Han, Junwei ;
Chen, Changyuan ;
Shao, Ling ;
Hu, Xintao ;
Han, Jungong ;
Liu, Tianming .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) :1692-1703
[25]   Video abstraction based on fMRI-driven visual attention model [J].
Han, Junwei ;
Li, Kaiming ;
Shao, Ling ;
Hu, Xintao ;
He, Sheng ;
Guo, Lei ;
Han, Jungong ;
Liu, Tianming .
INFORMATION SCIENCES, 2014, 281 :781-796
[26]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[27]  
Hongying Meng, 2011, Proceedings 2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011), P854, DOI 10.1109/FG.2011.5771362
[28]  
Jain V., 2014, 2014 ACM INT WORKSH, P87
[29]  
Jan Asim., 2014, Proceedings of the ACM 4th International Workshop on Audio/Visual Emotion Challenge, P73
[30]  
Jenkins Emily, 2012, Depress Res Treat, V2012, P469680, DOI 10.1155/2012/469680