Artificial Intelligence-Enabled End-To-End Detection and Assessment of Alzheimer's Disease Using Voice

被引:23
作者
Agbavor, Felix [1 ]
Liang, Hualou [1 ]
机构
[1] Drexel Univ, Sch Biomed Engn Sci & Hlth Syst, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
Alzheimer's disease; dementia; end-to-end; data2vec; large language models; speech and language;
D O I
10.3390/brainsci13010028
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
There is currently no simple, widely available screening method for Alzheimer's disease (AD), partly because the diagnosis of AD is complex and typically involves expensive and sometimes invasive tests not commonly available outside highly specialized clinical settings. Here, we developed an artificial intelligence (AI)-powered end-to-end system to detect AD and predict its severity directly from voice recordings. At the core of our system is the pre-trained data2vec model, the first high-performance self-supervised algorithm that works for speech, vision, and text. Our model was internally evaluated on the ADReSSo (Alzheimer's Dementia Recognition through Spontaneous Speech only) dataset containing voice recordings of subjects describing the Cookie Theft picture, and externally validated on a test dataset from DementiaBank. The AI model can detect AD with average area under the curve (AUC) of 0.846 and 0.835 on held-out and external test set, respectively. The model was well-calibrated (Hosmer-Lemeshow goodness-of-fit p-value = 0.9616). Moreover, the model can reliably predict the subject's cognitive testing score solely based on raw voice recordings. Our study demonstrates the feasibility of using the AI-powered end-to-end model for early AD diagnosis and severity prediction directly based on voice, showing its potential for screening Alzheimer's disease in a community setting.
引用
收藏
页数:13
相关论文
共 44 条
[31]  
Meek PD, 1998, PHARMACOTHERAPY, V18, P68
[32]  
Murphy A. H., 1977, Journal of The Royal Statistical Society Series C-applied Statistics, V26, P41
[33]   The montreal cognitive assessment, MoCA:: A brief screening tool for mild cognitive impairment [J].
Nasreddine, ZS ;
Phillips, NA ;
Bédirian, V ;
Charbonneau, S ;
Whitehead, V ;
Collin, I ;
Cummings, JL ;
Chertkow, H .
JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2005, 53 (04) :695-699
[34]   Using the Outputs of Different Automatic Speech Recognition Paradigms for Acoustic- and BERT-based Alzheimer's Dementia Detection through Spontaneous Speech [J].
Pan, Yilin ;
Mirheidari, Bahman ;
Harris, Jennifer M. ;
Thompson, Jennifer C. ;
Jones, Matthew ;
Snowden, Julie S. ;
Blackburn, Daniel ;
Christensen, Heidi .
INTERSPEECH 2021, 2021, :3810-3814
[35]  
Panayotov V, 2015, INT CONF ACOUST SPEE, P5206, DOI 10.1109/ICASSP.2015.7178964
[36]  
ROBERTSON T., 1988, Order restricted statistical inference
[37]   THE CENTRAL ROLE OF THE PROPENSITY SCORE IN OBSERVATIONAL STUDIES FOR CAUSAL EFFECTS [J].
ROSENBAUM, PR ;
RUBIN, DB .
BIOMETRIKA, 1983, 70 (01) :41-55
[38]  
Seeley W.W., 2018, Harrison's Principles of Internal Medicine, 20e, V20
[39]   Mini-Cog for the diagnosis of Alzheimer's disease dementia and other dementias within a primary care setting [J].
Seitz, Dallas P. ;
Chan, Calvin C. H. ;
Newton, Hailey T. ;
Gill, Sudeep S. ;
Herrmann, Nathan ;
Smailagic, Nadja ;
Nikolaou, Vasilis ;
Fage, Bruce A. .
COCHRANE DATABASE OF SYSTEMATIC REVIEWS, 2018, (02)
[40]  
Weiner MW., 2013, ALZHEIMERS DEMENT, V9, pe111