Predicting future falls in older people using natural language processing of general practitioners' clinical notes

被引:7
|
作者
Dormosh, Noman [1 ,2 ]
Schut, Martijn C. [1 ,3 ,4 ]
Heymans, Martijn W. [5 ,6 ]
Maarsingh, Otto [7 ,8 ]
Bouman, Jonathan [9 ]
van der Velde, Nathalie [10 ,11 ]
Abu-Hanna, Ameen [1 ,2 ]
机构
[1] Univ Amsterdam, Amsterdam UMC, Dept Med Informat, Amsterdam, Netherlands
[2] Amsterdam Publ Hlth, Aging & Later Life & Methodol Amsterdam, Amsterdam, Netherlands
[3] Vrije Univ Amsterdam, Amsterdam UMC, Dept Clin Chem, Amsterdam, Netherlands
[4] Amsterdam Publ Hlth, Methodol & Qual Care, Amsterdam, Netherlands
[5] Vrije Univ Amsterdam, Amsterdam UMC, Dept Epidemiol & Data Sci, Amsterdam, Netherlands
[6] Amsterdam Publ Hlth, Methodol & Personalized Med, Amsterdam, Netherlands
[7] Vrije Univ Amsterdam, Amsterdam UMC, Dept Gen practice, Amsterdam, Netherlands
[8] Amsterdam Publ Hlth, Aging & Later Life & Mental Hlth, Amsterdam, Netherlands
[9] Univ Amsterdam, Amsterdam UMC, Dept Gen Practice, Amsterdam, Netherlands
[10] Univ Amsterdam, Amsterdam UMC, Dept Internal Med, Sect Geriatr Med, Amsterdam, Netherlands
[11] Amsterdam Publ Hlth, Aging & Later Life, Amsterdam, Netherlands
关键词
accidental falls; fall prediction; natural language processing; electronic health records; free text; topic modelling; older people; RISK-FACTORS; ADULTS; CARE; CONSEQUENCES; INFORMATION; CHALLENGES; INJURIES; MODELS;
D O I
10.1093/ageing/afad046
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Background Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls. Methods We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach. Results Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700-0.719), 0.685 (0.676-0.694) and 0.718 (0.708-0.727), respectively. All the models showed good calibration. Conclusions Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Topic evolution before fall incidents in new fallers through natural language processing of general practitioners' clinical notes
    Dormosh, Noman
    Abu-Hanna, Ameen
    Calixto, Iacer
    Schut, Martijn C.
    Heymans, Martijn W.
    van der Velde, Nathalie
    AGE AND AGEING, 2024, 53 (02)
  • [2] Prevalence of Sensitive Terms in Clinical Notes Using Natural Language Processing Techniques: Observational Study
    Lee, Jennifer
    Yang, Samuel
    Holland-Hall, Cynthia
    Sezgin, Emre
    Gill, Manjot
    Linwood, Simon
    Huang, Yungui
    Hoffman, Jeffrey
    JMIR MEDICAL INFORMATICS, 2022, 10 (06)
  • [3] Identifying Symptom Information in Clinical Notes Using Natural Language Processing
    Koleck, Theresa A.
    Tatonetti, Nicholas P.
    Bakken, Suzanne
    Mitha, Shazia
    Henderson, Morgan M.
    George, Maureen
    Miaskowski, Christine
    Smaldone, Arlene
    Topaz, Maxim
    NURSING RESEARCH, 2021, 70 (03) : 173 - 183
  • [4] Using Natural Language Processing to Identify Stigmatizing Language in Labor and Birth Clinical Notes
    Barcelona, Veronica
    Scharp, Danielle
    Moen, Hans
    Davoudi, Anahita
    Idnay, Betina R.
    Cato, Kenrick
    Topaz, Maxim
    MATERNAL AND CHILD HEALTH JOURNAL, 2023, 28 (3) : 578 - 586
  • [5] Using Natural Language Processing to Identify Stigmatizing Language in Labor and Birth Clinical Notes
    Veronica Barcelona
    Danielle Scharp
    Hans Moen
    Anahita Davoudi
    Betina R. Idnay
    Kenrick Cato
    Maxim Topaz
    Maternal and Child Health Journal, 2024, 28 : 578 - 586
  • [6] Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review
    Sheikhalishahi, Seyedmostafa
    Miotto, Riccardo
    Dudley, Joel T.
    Lavelli, Alberto
    Rinaldi, Fabio
    Osmani, Venet
    JMIR MEDICAL INFORMATICS, 2019, 7 (02) : 15 - 32
  • [7] Extraction of clinical phenotypes for Alzheimer's disease dementia from clinical notes using natural language processing
    Oh, Inez Y.
    Schindler, Suzanne E.
    Ghoshal, Nupur
    Lai, Albert M.
    Payne, Philip R. O.
    Gupta, Aditi
    JAMIA OPEN, 2023, 6 (01)
  • [8] Identifying stigmatizing and positive/preferred language in obstetric clinical notes using natural language processing
    Scroggins, Jihye Kim
    Hulchafo, Ismael I.
    Harkins, Sarah
    Scharp, Danielle
    Moen, Hans
    Davoudi, Anahita
    Cato, Kenrick
    Tadiello, Michele
    Topaz, Maxim
    Barcelona, Veronica
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, : 308 - 317
  • [9] Evaluating Natural Language Processing Packages for Predicting Hospital-Acquired Pressure Injuries From Clinical Notes
    Gu, Siyi
    Lee, Eric W.
    Zhang, Wenhui
    Simpson, Roy L.
    Hertzberg, Vicki Stover
    Ho, Joyce C.
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (03) : 184 - 192
  • [10] Relation Detection to Identify Stroke Assertions from Clinical Notes Using Natural Language Processing
    Yang, Audrey
    Kamien, Sam
    Davoudi, Anahita
    Hwang, Sy
    Gandhi, Meet
    Urbanowicz, Ryan
    Mowery, Danielle
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 619 - 623