Predicting future falls in older people using natural language processing of general practitioners' clinical notes

被引:7
|
作者
Dormosh, Noman [1 ,2 ]
Schut, Martijn C. [1 ,3 ,4 ]
Heymans, Martijn W. [5 ,6 ]
Maarsingh, Otto [7 ,8 ]
Bouman, Jonathan [9 ]
van der Velde, Nathalie [10 ,11 ]
Abu-Hanna, Ameen [1 ,2 ]
机构
[1] Univ Amsterdam, Amsterdam UMC, Dept Med Informat, Amsterdam, Netherlands
[2] Amsterdam Publ Hlth, Aging & Later Life & Methodol Amsterdam, Amsterdam, Netherlands
[3] Vrije Univ Amsterdam, Amsterdam UMC, Dept Clin Chem, Amsterdam, Netherlands
[4] Amsterdam Publ Hlth, Methodol & Qual Care, Amsterdam, Netherlands
[5] Vrije Univ Amsterdam, Amsterdam UMC, Dept Epidemiol & Data Sci, Amsterdam, Netherlands
[6] Amsterdam Publ Hlth, Methodol & Personalized Med, Amsterdam, Netherlands
[7] Vrije Univ Amsterdam, Amsterdam UMC, Dept Gen practice, Amsterdam, Netherlands
[8] Amsterdam Publ Hlth, Aging & Later Life & Mental Hlth, Amsterdam, Netherlands
[9] Univ Amsterdam, Amsterdam UMC, Dept Gen Practice, Amsterdam, Netherlands
[10] Univ Amsterdam, Amsterdam UMC, Dept Internal Med, Sect Geriatr Med, Amsterdam, Netherlands
[11] Amsterdam Publ Hlth, Aging & Later Life, Amsterdam, Netherlands
关键词
accidental falls; fall prediction; natural language processing; electronic health records; free text; topic modelling; older people; RISK-FACTORS; ADULTS; CARE; CONSEQUENCES; INFORMATION; CHALLENGES; INJURIES; MODELS;
D O I
10.1093/ageing/afad046
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Background Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls. Methods We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach. Results Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700-0.719), 0.685 (0.676-0.694) and 0.718 (0.708-0.727), respectively. All the models showed good calibration. Conclusions Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Automated identification of wound information in clinical notes of patients with heart diseases: Developing and validating a natural language processing application
    Topaz, Maxim
    Lai, Kenneth
    Dowding, Dawn
    Lei, Victor J.
    Zisberg, Anna
    Bowles, Kathryn H.
    Zhou, Li
    INTERNATIONAL JOURNAL OF NURSING STUDIES, 2016, 64 : 25 - 31
  • [42] Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes
    Khalifa, Abdulrahman
    Meystre, Stephane
    JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 : S128 - S132
  • [43] Predicting Severity in People with Aphasia: A Natural Language Processing and Machine Learning Approach
    Day, Marjory
    Dey, Rupam Kumar
    Baucum, Matthew
    Paek, Eun Jin
    Park, Hyejin
    Khojandi, Anahita
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2299 - 2302
  • [44] Collaboration, not Confrontation: Understanding General Practitioners' Attitudes Towards Natural Language and Text Automation in Clinical Practice
    Navarro, David Fraile
    Kocaballi, A. Baki
    Dras, Mark
    Berkovsky, Shlomo
    ACM TRANSACTIONS ON COMPUTER-HUMAN INTERACTION, 2023, 30 (02)
  • [45] A Natural Language Processing System That Links Medical Terms in Electronic Health Record Notes to Lay Definitions: System Development Using Physician Reviews
    Chen, Jinying
    Druhl, Emily
    Ramesh, Balaji Polepalli
    Houston, Thomas K.
    Brandt, Cynthia A.
    Zulman, Donna M.
    Vimalananda, Varsha G.
    Malkani, Samir
    Yu, Hong
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2018, 20 (01)
  • [46] Predicting Math Performance Using Natural Language Processing Tools
    Crossley, Scott
    Liu, Ran
    McNamara, Danielle
    SEVENTH INTERNATIONAL LEARNING ANALYTICS & KNOWLEDGE CONFERENCE (LAK'17), 2017, : 339 - 347
  • [47] Security in IoT systems using natural language processing: Future challenges and directions
    Kumar, Yogendra
    Kumar, Vijay
    INTERNET TECHNOLOGY LETTERS, 2023, 6 (04)
  • [48] Using Natural Language Processing and Machine Learning to Identify Gout Flares From Electronic Clinical Notes
    Zheng, Chengyi
    Rashid, Nazia
    Wu, Yi-Lin
    Koblick, River
    Lin, Antony T.
    Levy, Gerald D.
    Cheetham, T. Craig
    ARTHRITIS CARE & RESEARCH, 2014, 66 (11) : 1740 - 1748
  • [49] Detecting inpatient falls by using natural language processing of electronic medical records
    Shin-ichi Toyabe
    BMC Health Services Research, 12
  • [50] Natural Language Processing Methods to Extract Lifestyle Exposures for Alzheimer's Disease from Clinical Notes
    Yi, Yoonkwon
    Shen, Zitao
    Anusha, Bompelli
    Fang, Yu
    Wang, Yanshan
    Zhang, Rui
    2020 8TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2020), 2020, : 535 - 536