Extraction of sleep information from clinical notes of Alzheimer's disease patients using natural language processing

被引:1
作者
Sivarajkumar, Sonish [1 ]
Tam, Thomas Yu Chow [2 ]
Mohammad, Haneef Ahamed [2 ]
Viggiano, Samuel [2 ]
Oniani, David [2 ]
Visweswaran, Shyam [1 ,3 ]
Wang, Yanshan [1 ,2 ,3 ,4 ]
机构
[1] Univ Pittsburgh, Intelligent Syst Program, Pittsburgh, PA 15260 USA
[2] Univ Pittsburgh, Dept Hlth Informat Management, 6026 Forbes Tower, Pittsburgh, PA 15260 USA
[3] Univ Pittsburgh, Dept Biomed Informat, Pittsburgh, PA 15260 USA
[4] Univ Pittsburgh, Clin & Translat Sci Inst, Pittsburgh, PA 15260 USA
基金
美国国家卫生研究院;
关键词
natural language processing; Alzheimer's disease; sleep; electronic health records; information extraction; clinical notes; ELECTRONIC HEALTH RECORDS; COGNITIVE DECLINE; DAYTIME SLEEPINESS; DEMENTIA; RISK; DISORDERS; DURATION; LIFE;
D O I
10.1093/jamia/ocae177
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives Alzheimer's disease (AD) is the most common form of dementia in the United States. Sleep is one of the lifestyle-related factors that has been shown critical for optimal cognitive function in old age. However, there is a lack of research studying the association between sleep and AD incidence. A major bottleneck for conducting such research is that the traditional way to acquire sleep information is time-consuming, inefficient, non-scalable, and limited to patients' subjective experience. We aim to automate the extraction of specific sleep-related patterns, such as snoring, napping, poor sleep quality, daytime sleepiness, night wakings, other sleep problems, and sleep duration, from clinical notes of AD patients. These sleep patterns are hypothesized to play a role in the incidence of AD, providing insight into the relationship between sleep and AD onset and progression.Materials and Methods A gold standard dataset is created from manual annotation of 570 randomly sampled clinical note documents from the adSLEEP, a corpus of 192 000 de-identified clinical notes of 7266 AD patients retrieved from the University of Pittsburgh Medical Center (UPMC). We developed a rule-based natural language processing (NLP) algorithm, machine learning models, and large language model (LLM)-based NLP algorithms to automate the extraction of sleep-related concepts, including snoring, napping, sleep problem, bad sleep quality, daytime sleepiness, night wakings, and sleep duration, from the gold standard dataset.Results The annotated dataset of 482 patients comprised a predominantly White (89.2%), older adult population with an average age of 84.7 years, where females represented 64.1%, and a vast majority were non-Hispanic or Latino (94.6%). Rule-based NLP algorithm achieved the best performance of F1 across all sleep-related concepts. In terms of positive predictive value (PPV), the rule-based NLP algorithm achieved the highest PPV scores for daytime sleepiness (1.00) and sleep duration (1.00), while the machine learning models had the highest PPV for napping (0.95) and bad sleep quality (0.86), and LLAMA2 with finetuning had the highest PPV for night wakings (0.93) and sleep problem (0.89).Discussion Although sleep information is infrequently documented in the clinical notes, the proposed rule-based NLP algorithm and LLM-based NLP algorithms still achieved promising results. In comparison, the machine learning-based approaches did not achieve good results, which is due to the small size of sleep information in the training data.Conclusion The results show that the rule-based NLP algorithm consistently achieved the best performance for all sleep concepts. This study focused on the clinical notes of patients with AD but could be extended to general sleep information extraction for other diseases.
引用
收藏
页码:2217 / 2227
页数:11
相关论文
共 47 条
  • [1] 2018 Alzheimer's disease facts and figures
    不详
    [J]. ALZHEIMERS & DEMENTIA, 2018, 14 (03) : 367 - 425
  • [2] Cross-sectional and longitudinal association of sleep and Alzheimer biomarkers in cognitively unimpaired adults
    Blackman, Jonathan
    Stankeviciute, Laura
    Arenaza-Urquijo, Eider M.
    Suarez-Calvet, Marc
    Sanchez-Benavides, Gonzalo
    Vilor-Tejedor, Natalia
    Iranzo, Alejandro
    Molinuevo, Jose Luis
    Gispert, Juan Domingo
    Coulthard, Elizabeth
    Grau-Rivera, Oriol
    [J]. BRAIN COMMUNICATIONS, 2022, 4 (06)
  • [3] Launching HITECH
    Blumenthal, David
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2010, 362 (05) : 382 - 385
  • [4] Psychosocial risk factors and Alzheimer's disease: the associative effect of depression, sleep disturbance, and anxiety
    Burke, Shanna L.
    Cadet, Tamara
    Alcide, Amary
    O'Driscoll, Janice
    Maramaldi, Peter
    [J]. AGING & MENTAL HEALTH, 2018, 22 (12) : 1577 - 1584
  • [5] Health care resource utilisation in primary care prior to and after a diagnosis of Alzheimer's disease: a retrospective, matched case-control study in the United Kingdom
    Chen, Lei
    Reed, Catherine
    Happich, Michael
    Nyhuis, Allen
    Lenox-Smith, Alan
    [J]. BMC GERIATRICS, 2014, 14
  • [6] Longitudinal associations between sleep duration and cognitive impairment in Chinese elderly
    Chen, Wei-chao
    Wang, Xiao-yan
    [J]. FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
  • [7] The impact of insomnia on cognitive functioning in older adults
    Cricco, M
    Simonsick, EM
    Foley, DJ
    [J]. JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2001, 49 (09) : 1185 - 1189
  • [8] Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation
    Cusick, Marika
    Adekkanattu, Prakash
    Campion, Thomas R., Jr.
    Sholle, Evan T.
    Myers, Annie
    Banerjee, Samprit
    Alexopoulos, George
    Wang, Yanshan
    Pathak, Jyotishman
    [J]. JOURNAL OF PSYCHIATRIC RESEARCH, 2021, 136 : 95 - 102
  • [9] Sleep Duration in Relation to Cognitive Function among Older Adults: A Systematic Review of Observational Studies
    Devore, Elizabeth E.
    Grodstein, Francine
    Schernhammer, Eva S.
    [J]. NEUROEPIDEMIOLOGY, 2016, 46 (01) : 57 - 78
  • [10] General Symptom Extraction from VA Electronic Medical Notes
    Divita, Guy
    Luo, Gang
    Tran, Le-Thuy T.
    Workman, Elizabeth
    Gundlapalli, Adi V.
    Samore, Matthew H.
    [J]. MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 356 - 360