INFLUENCE OF ACOUSTIC LOW-LEVEL DESCRIPTORS IN THE DETECTION OF CLINICAL DEPRESSION IN ADOLESCENTS

被引:48
作者
Low, Lu-Shih Alex [1 ]
Maddage, Namunu C. [1 ]
Lech, Margaret [1 ]
Sheeber, Lisa [3 ]
Allen, Nicholas [2 ]
机构
[1] RMIT Univ, Sch Elect & Comp Engn, Melbourne, Vic 3001, Australia
[2] Univ Melbourne, ORYGEN Res Ctr & Dept Psychol, Melbourne, Vic 3010, Australia
[3] Oregon Res Inst, Eugene, OR 97403 USA
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
基金
澳大利亚研究理事会;
关键词
Clinical depression; prosodic feature; spectral feature; acoustic features; Gaussian Mixture Model; SPEECH; CLASSIFICATION; INDICATORS;
D O I
10.1109/ICASSP.2010.5495018
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we report the influence that classification accuracies have in speech analysis from a clinical dataset by adding acoustic low-level descriptors (LLD) belonging to prosodic (i.e. pitch, formants, energy, jitter, shimmer) and spectral features (i.e. spectral flux, centroid, entropy and roll-off) along with their delta (Delta) and delta-delta (Delta-Delta) coefficients to two baseline features of Mel frequency cepstral coefficients and Teager energy critical-band based autocorrelation envelope. Extracted acoustic low-level descriptors (LLD) that display an increase in accuracy after being added to these baseline features were finally modeled together using Gaussian mixture models and tested. A clinical data set of speech from 139 adolescents, including 68 (49 girls and 19 boys) diagnosed as clinically depressed, was used in the classification experiments. For male subjects, the combination of (TEO-CB-Auto-Env + Delta + Delta-Delta) + F0 + (LogE + Delta + Delta-Delta) + (Shimmer + Delta) + Spectral Flux + Spectral Roll-off gave the highest classification rate of 77.82% while for the female subjects, using TEO-CB-Auto-Env gave an accuracy of 74.74%.
引用
收藏
页码:5154 / 5157
页数:4
相关论文
共 12 条
[1]  
Australian Bureau of Statistics, 2008, CAUS DEATH AUSTR 200
[2]  
Childers D.G., 2000, Speech processing and synthesis toolboxes
[3]   Vocal indicators of mood change in depression [J].
Ellgring, H ;
Scherer, KR .
JOURNAL OF NONVERBAL BEHAVIOR, 1996, 20 (02) :83-110
[4]   Acoustical properties of speech as indicators of depression and suicidal risk [J].
France, DJ ;
Shiavi, RG ;
Silverman, S ;
Silverman, M ;
Wilkes, DM .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2000, 47 (07) :829-837
[5]  
Hops H., 2003, OREGON RES I UNPUB
[6]  
Low L. A., 2009, EUSIPCO AUG GLASG SC
[7]   Critical analysis of the impact of glottal features in the classification of clinical depression in speech [J].
Moore, Elliot, II ;
Clements, Mark A. ;
Peifer, John W. ;
Weisser, Lydia .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2008, 55 (01) :96-107
[8]  
*NHMRC, 1997, DEPR YOUNG PEOPL GUI
[9]   MEASURING THE RATE OF CHANGE OF VOICE FUNDAMENTAL-FREQUENCY IN FLUENT SPEECH DURING MENTAL DEPRESSION [J].
NILSONNE, A ;
SUNDBERG, J ;
TERNSTROM, S ;
ASKENFELT, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 83 (02) :716-728
[10]   Interactional processes in families with depressed and non-depressed adolescents: reinforcement of depressive behavior [J].
Sheeber, L ;
Hops, H ;
Andrews, J ;
Alpert, T ;
Davis, B .
BEHAVIOUR RESEARCH AND THERAPY, 1998, 36 (04) :417-427