Artificial Intelligence-Enabled End-To-End Detection and Assessment of Alzheimer's Disease Using Voice

被引:23
作者
Agbavor, Felix [1 ]
Liang, Hualou [1 ]
机构
[1] Drexel Univ, Sch Biomed Engn Sci & Hlth Syst, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
Alzheimer's disease; dementia; end-to-end; data2vec; large language models; speech and language;
D O I
10.3390/brainsci13010028
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
There is currently no simple, widely available screening method for Alzheimer's disease (AD), partly because the diagnosis of AD is complex and typically involves expensive and sometimes invasive tests not commonly available outside highly specialized clinical settings. Here, we developed an artificial intelligence (AI)-powered end-to-end system to detect AD and predict its severity directly from voice recordings. At the core of our system is the pre-trained data2vec model, the first high-performance self-supervised algorithm that works for speech, vision, and text. Our model was internally evaluated on the ADReSSo (Alzheimer's Dementia Recognition through Spontaneous Speech only) dataset containing voice recordings of subjects describing the Cookie Theft picture, and externally validated on a test dataset from DementiaBank. The AI model can detect AD with average area under the curve (AUC) of 0.846 and 0.835 on held-out and external test set, respectively. The model was well-calibrated (Hosmer-Lemeshow goodness-of-fit p-value = 0.9616). Moreover, the model can reliably predict the subject's cognitive testing score solely based on raw voice recordings. Our study demonstrates the feasibility of using the AI-powered end-to-end model for early AD diagnosis and severity prediction directly based on voice, showing its potential for screening Alzheimer's disease in a community setting.
引用
收藏
页数:13
相关论文
共 44 条
[1]   Predicting dementia from spontaneous speech using large language models [J].
Agbavor, Felix ;
Liang, Hualou .
PLOS DIGITAL HEALTH, 2022, 1 (12)
[2]   Automated detection of mild cognitive impairment and dementia from voice recordings: A natural language processing approach [J].
Amini, Samad ;
Hao, Boran ;
Zhang, Lifu ;
Song, Mengting ;
Gupta, Aman ;
Karjadi, Cody ;
Kolachalama, Vijaya B. ;
Au, Rhoda ;
Paschalidis, Ioannis Ch .
ALZHEIMERS & DEMENTIA, 2023, 19 (03) :946-955
[3]  
[Anonymous], 1993, An introduction to the bootstrap, monographs on statistics and applied probability
[4]  
Baevski A, 2020, ADV NEUR IN, V33
[5]  
Baevski A, 2022, Arxiv, DOI [arXiv:2202.03555, DOI 10.48550/ARXIV.2202.03555]
[6]  
Balagopalan A, 2021, Arxiv, DOI arXiv:2106.01555
[7]  
Balagopalan A, 2020, Arxiv, DOI arXiv:2008.01551
[8]   THE NATURAL-HISTORY OF ALZHEIMERS-DISEASE - DESCRIPTION OF STUDY COHORT AND ACCURACY OF DIAGNOSIS [J].
BECKER, JT ;
BOLLER, F ;
LOPEZ, OL ;
SAXTON, J ;
MCGONIGLE, KL ;
MOOSSY, J ;
HANIN, I ;
WOLFSON, SK ;
DETRE, K ;
HOLLAND, A ;
GUR, D ;
LATCHAW, R ;
BRENNER, R .
ARCHIVES OF NEUROLOGY, 1994, 51 (06) :585-594
[9]  
DEGROOT MH, 1983, J ROY STAT SOC D-STA, V32, P12
[10]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845