Using machine learning to detect sarcopenia from electronic health records

被引:15
作者
Luo, Xiao [1 ]
Ding, Haoran [1 ]
Broyles, Andrea [2 ]
Warden, Stuart J. [3 ,4 ]
Moorthi, Ranjani N. [4 ,5 ]
Imel, Erik A. [4 ,5 ]
机构
[1] Indiana Univ Purdue Univ Indianapolis, Sch Engn & Technol, Indianapolis, IN USA
[2] Regenstrief Inst Hlth Care, Indianapolis, IN USA
[3] Indiana Univ, Sch Hlth & Human Sci, Dept Phys Therapy, Indianapolis, IN USA
[4] Indiana Univ Sch Med, Indiana Ctr Musculoskeletal Hlth, Indianapolis, IN 46202 USA
[5] Indiana Univ Sch Med, Dept Med, 1120 West Michigan St,CL 380, Indianapolis, IN 46202 USA
关键词
Sarcopenia; machine learning; health informatics; musculoskeletal; OLDER-ADULTS; FRAILTY; MUSCLE; PREVALENCE; INTERVENTIONS; DEFINITION; DISABILITY; MORTALITY; CONSENSUS; STRENGTH;
D O I
10.1177/20552076231197098
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Introduction: Sarcopenia (low muscle mass and strength) causes dysmobility and loss of independence. Sarcopenia is often not directly coded or described in electronic health records (EHR). The objective was to improve sarcopenia detection using structured data from EHR. Methods: Adults undergoing musculoskeletal testing (December 2017-March 2020) were classified as meeting sarcopenia thresholds for 0 (controls), >= 1 (Sarcopenia-1), or >= 2 (Sarcopenia-2) tests. Electronic health record diagnoses, medications, and laboratory testing were extracted from the Indiana Network for Patient Care. Five machine learning models were applied to EHR data for predicting sarcopenia. Results: Of 1304 participants, 1055 were controls, 249 met Sarcopenia-1 and 76 met Sarcopenia-2. Sarcopenic participants were older, with higher fat mass, Charlson Comorbidity Index, and more chronic diseases. All models performed better for Sarcopenia-2 than Sarcopenia-1. The top performing models for Sarcopenia-1 were Logistic Regression [area under the curve (AUC) 71.59 (95% confidence interval [CI], 71.51-71.66)] and Multi-Layer Perceptron [AUC 71.48 (95%CI, 71.00-71.97)]. The top performing models for Sarcopenia-2 were Logistic Regression [AUC 91.44 (95%CI, 91.28-91.60)] and Support Vector Machine [AUC 90.81 (95%CI, 88.41-93.20)]. For the best Logistic Regression Model, important sarcopenia predictors included diabetes mellitus, digestive system complaints, signs and symptoms involving the nervous, musculoskeletal and respiratory systems, metabolic disorders, and kidney or urinary tract disorders. Opioids, corticosteroids, and anti-hyperlipidemic drugs were also more common among sarcopenic participants. Conclusions: Applying machine learning models, sarcopenia can be predicted from structured data in EHR, which may be developed through future studies to facilitate large-scale early detection and intervention in clinical populations.
引用
收藏
页数:13
相关论文
共 50 条
[31]   Predicting Glaucoma Surgical Outcomes Using Neural Networks and Machine Learning on Electronic Health Records [J].
Barry, Samuel ;
Wang, Sophia Y. .
TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2024, 13 (06)
[32]   Prediction of Endometrial Carcinoma Using the Combination of Electronic Health Records and an Ensemble Machine Learning Method [J].
Wang, Wenwen ;
Xu, Yang ;
Yuan, Suzhen ;
Li, Zhiying ;
Zhu, Xin ;
Zhou, Qin ;
Shen, Wenfeng ;
Wang, Shixuan .
FRONTIERS IN MEDICINE, 2022, 9
[33]   Predicting neurodevelopmental disorders using machine learning models and electronic health records - status of the field [J].
Rajagopalan, Shyam Sundar ;
Tammimies, Kristiina .
JOURNAL OF NEURODEVELOPMENTAL DISORDERS, 2024, 16 (01)
[34]   Dynamic Delirium Prediction in the Intensive Care Unit using Machine Learning on Electronic Health Records [J].
Contreras, Miguel ;
Silva, Brandon ;
Shickel, Benjamin ;
Bandyopadhyay, Sabyasachi ;
Guan, Ziyuan ;
Ren, Yuanfang ;
Ozrazgat-Baslanti, Tezcan ;
Khezeli, Kia ;
Bihorac, Azra ;
Rashidi, Parisa .
2023 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, BHI, 2023,
[35]   Prediction and diagnosis of depression using machine learning with electronic health records data: a systematic review [J].
David Nickson ;
Caroline Meyer ;
Lukasz Walasek ;
Carla Toro .
BMC Medical Informatics and Decision Making, 23
[36]   Machine learning approaches for electronic health records phenotyping: a methodical review [J].
Yang, Siyue ;
Varghese, Paul ;
Stephenson, Ellen ;
Tu, Karen ;
Gronsbell, Jessica .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (02) :367-381
[37]   Machine Learning Algorithms to Predict Breast Cancer Recurrence Using Structured and Unstructured Sources from Electronic Health Records [J].
Gonzalez-Castro, Lorena ;
Chavez, Marcela ;
Duflot, Patrick ;
Bleret, Valerie ;
Martin, Alistair G. ;
Zobel, Marc ;
Nateqi, Jama ;
Lin, Simon ;
Pazos-Arias, Jose J. ;
Del Fiol, Guilherme ;
Lopez-Nores, Martin .
CANCERS, 2023, 15 (10)
[38]   Machine Learning-Based Identification of Obesity from Positive and Unlabelled Electronic Health Records [J].
Blanes-Selva, Vicent ;
Tortajada, Salvador ;
Vilar, Ruth ;
Valdivieso, Bernardo ;
Garcia-Gomez, Juan M. .
DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 :864-868
[39]   Bias or biology? Importance of model interpretation in machine learning studies from electronic health records [J].
Momenzadeh, Amanda ;
Shamsa, Ali ;
Meyer, Jesse G. .
JAMIA OPEN, 2022, 5 (03)
[40]   Machine Learning Prediction of Hypoglycemia and Hyperglycemia From Electronic Health Records: Algorithm Development and Validation [J].
Witte, Harald ;
Nakas, Christos ;
Bally, Lia ;
Leichtle, Alexander Benedikt .
JMIR FORMATIVE RESEARCH, 2022, 6 (07)