Depression assessment in people with Parkinson's disease: The combination of acoustic features and natural language processing

被引:5
作者
Andrea Perez-Toro, Paula [1 ,2 ]
Arias-Vergara, Tomas [1 ,2 ,3 ]
Klumpp, Philipp [1 ]
Camilo Vasquez-Correa, Juan [2 ]
Schuster, Maria [3 ]
Noeth, Elmar [1 ]
Rafael Orozco-Arroyave, Juan [1 ,2 ]
机构
[1] Friedrich Alexander Univ, Pattern Recognit Lab, Erlangen, Germany
[2] Univ Antioquia UdeA, Fac Ingn, Calle 70 52-21, Medellin, Colombia
[3] Ludwig Maximilians Univ Munchen, Dept Otorhinolaryngol Head & Neck Surg, Munich, Germany
关键词
Parkinson?s disease; Natural language processing; Acoustics analysis; Depression; QUALITY-OF-LIFE; PSYCHOMOTOR RETARDATION; EMOTION RECOGNITION; DISORDERS; SYMPTOMS; MODELS; AUDIO;
D O I
10.1016/j.specom.2022.09.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Parkinson's disease produces motor impairments such as bradykinesia, rigidity, and different speech impair-ments, as same as non-motor symptoms like cognitive decline and depression disturbances. Most studies are focused on the analysis of motor symptoms, and only few works study non-motor impairments. Depression is one of the typical non-motor symptoms developed by many Parkinson's patients. Impairments in speech production together with depression produce negative effects in the communication capabilities and social interaction of patients. This study proposes a combination of speech analysis and natural language processing methods to extract features from spontaneous speech utterances and their transcripts. We consider state-of-the-art word-embedding methods like Bidirectional Encoder Representations from Transformer (BERT) to process the transcripts, and traditional acoustic features such as Bark band energies and Mel frequency cepstral coefficients to model the speech signals. The features are processed with supervectors generated by Gaussian Mixture Model-Universal Background Model (GMM-UBM) and Support Vector Machine (SVM) classifiers. The dataset consists of 60 Parkinson's patients divided into two classes according to the depression item of the MDS-UPDRS. The automatic classification of depressed and non-depressed Parkinson's patients showed F-scores of up to 0.77, which confirms that acoustic and linguistic information embedded in language production can be used for depression analysis in Parkinson's patients. We present one of the few studies that evaluates depression in Parkinson's patients considering the combination of acoustic and linguistic information.
引用
收藏
页码:10 / 20
页数:11
相关论文
共 59 条
[1]  
[Anonymous], 1989, Cognition and Emotion, DOI [10.1080/02699938908408075, DOI 10.1080/02699938908408075]
[2]  
Antoniou P, 2017, COUNS PSYCHOTHER RES, V17, P137, DOI 10.1002/capr.12116
[3]   Speaker models for monitoring Parkinson's disease progression considering different communication channels and acoustic conditions [J].
Arias-Vergara, T. ;
Vasquez-Correa, J. C. ;
Orozco-Arroyave, J. R. ;
Noeth, E. .
SPEECH COMMUNICATION, 2018, 101 :11-25
[4]   Fully automated assessment of the severity of Parkinson's disease from speech [J].
Bayestehtashk, Alireza ;
Asgari, Meysam ;
Shafran, Izhak ;
McNames, James .
COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01) :172-185
[5]   Deep neural networks for emotion recognition combining audio and transcripts [J].
Cho, Jaejin ;
Pappagari, Raghavendra ;
Kulkarni, Purva ;
Villalba, Jesus ;
Carmiel, Yishay ;
Dehak, Najim .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :247-251
[6]   INTERVIEWER EFFECT ON RESPONSES TO A QUESTIONNAIRE RELATING TO MOOD [J].
CHOI, IC ;
COMSTOCK, GW .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1975, 101 (01) :84-92
[7]   Analysis of acoustic space variability in speech affected by depression [J].
Cummins, Nicholas ;
Sethu, Vidhyasaharan ;
Epps, Julien ;
Schnieder, Sebastian ;
Krajewski, Jarek .
SPEECH COMMUNICATION, 2015, 75 :27-49
[8]   A review of depression and suicide risk assessment using speech analysis [J].
Cummins, Nicholas ;
Scherer, Stefan ;
Krajewski, Jarek ;
Schnieder, Sebastian ;
Epps, Julien ;
Quatieri, Thomas F. .
SPEECH COMMUNICATION, 2015, 71 :10-49
[9]   Modeling prosodic features with joint factor analysis for speaker verification [J].
Dehak, Najim ;
Dumouchel, Pierre ;
Kenny, Patrick .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :2095-2103
[10]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805