Using machine learning to detect sarcopenia from electronic health records

被引:15
作者
Luo, Xiao [1 ]
Ding, Haoran [1 ]
Broyles, Andrea [2 ]
Warden, Stuart J. [3 ,4 ]
Moorthi, Ranjani N. [4 ,5 ]
Imel, Erik A. [4 ,5 ]
机构
[1] Indiana Univ Purdue Univ Indianapolis, Sch Engn & Technol, Indianapolis, IN USA
[2] Regenstrief Inst Hlth Care, Indianapolis, IN USA
[3] Indiana Univ, Sch Hlth & Human Sci, Dept Phys Therapy, Indianapolis, IN USA
[4] Indiana Univ Sch Med, Indiana Ctr Musculoskeletal Hlth, Indianapolis, IN 46202 USA
[5] Indiana Univ Sch Med, Dept Med, 1120 West Michigan St,CL 380, Indianapolis, IN 46202 USA
关键词
Sarcopenia; machine learning; health informatics; musculoskeletal; OLDER-ADULTS; FRAILTY; MUSCLE; PREVALENCE; INTERVENTIONS; DEFINITION; DISABILITY; MORTALITY; CONSENSUS; STRENGTH;
D O I
10.1177/20552076231197098
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Introduction: Sarcopenia (low muscle mass and strength) causes dysmobility and loss of independence. Sarcopenia is often not directly coded or described in electronic health records (EHR). The objective was to improve sarcopenia detection using structured data from EHR. Methods: Adults undergoing musculoskeletal testing (December 2017-March 2020) were classified as meeting sarcopenia thresholds for 0 (controls), >= 1 (Sarcopenia-1), or >= 2 (Sarcopenia-2) tests. Electronic health record diagnoses, medications, and laboratory testing were extracted from the Indiana Network for Patient Care. Five machine learning models were applied to EHR data for predicting sarcopenia. Results: Of 1304 participants, 1055 were controls, 249 met Sarcopenia-1 and 76 met Sarcopenia-2. Sarcopenic participants were older, with higher fat mass, Charlson Comorbidity Index, and more chronic diseases. All models performed better for Sarcopenia-2 than Sarcopenia-1. The top performing models for Sarcopenia-1 were Logistic Regression [area under the curve (AUC) 71.59 (95% confidence interval [CI], 71.51-71.66)] and Multi-Layer Perceptron [AUC 71.48 (95%CI, 71.00-71.97)]. The top performing models for Sarcopenia-2 were Logistic Regression [AUC 91.44 (95%CI, 91.28-91.60)] and Support Vector Machine [AUC 90.81 (95%CI, 88.41-93.20)]. For the best Logistic Regression Model, important sarcopenia predictors included diabetes mellitus, digestive system complaints, signs and symptoms involving the nervous, musculoskeletal and respiratory systems, metabolic disorders, and kidney or urinary tract disorders. Opioids, corticosteroids, and anti-hyperlipidemic drugs were also more common among sarcopenic participants. Conclusions: Applying machine learning models, sarcopenia can be predicted from structured data in EHR, which may be developed through future studies to facilitate large-scale early detection and intervention in clinical populations.
引用
收藏
页数:13
相关论文
共 50 条
[21]   Predicting polycystic ovary syndrome with machine learning algorithms from electronic health records [J].
Zad, Zahra ;
Jiang, Victoria S. ;
Wolf, Amber T. ;
Wang, Taiyao ;
Cheng, J. Jojo ;
Paschalidis, Ioannis Ch. ;
Mahalingaiah, Shruthi .
FRONTIERS IN ENDOCRINOLOGY, 2024, 15
[22]   A Machine Learning Algorithm for Identifying Atopic Dermatitis in Adults from Electronic Health Records [J].
Gustafson, Erin ;
Pacheco, Jennifer ;
Wehbe, Firas ;
Silverberg, Jonathan ;
Thompson, William .
2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, :83-90
[23]   Machine learning functional impairment classification with electronic health record data [J].
Pavon, Juliessa M. ;
Previll, Laura ;
Woo, Myung ;
Henao, Ricardo ;
Solomon, Mary ;
Rogers, Ursula ;
Olson, Andrew ;
Fischer, Jonathan ;
Leo, Christopher ;
Fillenbaum, Gerda ;
Hoenig, Helen ;
Casarett, David .
JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2023, 71 (09) :2822-2833
[24]   Dense phenotyping from electronic health records enables machine learning-based prediction of preterm birth [J].
Abraham, Abin ;
Le, Brian ;
Kosti, Idit ;
Straub, Peter ;
Velez-Edwards, Digna R. ;
Davis, Lea K. ;
Newton, J. M. ;
Muglia, Louis J. ;
Rokas, Antonis ;
Bejan, Cosmin A. ;
Sirota, Marina ;
Capra, John A. .
BMC MEDICINE, 2022, 20 (01)
[25]   Machine Learning for Multimodal Electronic Health Records-Based Research: Challenges and Perspectives [J].
Liu, Ziyi ;
Zhang, Jiaqi ;
Hou, Yongshuai ;
Zhang, Xinran ;
Li, Ge ;
Xiang, Yang .
HEALTH INFORMATION PROCESSING, CHIP 2022, 2023, 1772 :135-155
[26]   Aging Health Behind an Image: Quantifying Sarcopenia and Associated Risk Factors from Advanced CT Analysis and Machine Learning Technologies [J].
Recenti, Marco ;
Gislason, Magnus K. ;
Edmunds, Kyle J. ;
Gargiulo, Paolo .
COMPUTER METHODS, IMAGING AND VISUALIZATION IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2020, 36 :188-197
[27]   A deep learning method to detect opioid prescription and opioid use disorder from electronic health records [J].
Kashyap, Aditya ;
Callison-Burch, Chris ;
Boland, Mary Regina .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 171
[28]   Machine Learning-Driven Models to Predict Prognostic Outcomes in Patients Hospitalized With Heart Failure Using Electronic Health Records: Retrospective Study [J].
Lv, Haichen ;
Yang, Xiaolei ;
Wang, Bingyi ;
Wang, Shaobo ;
Du, Xiaoyan ;
Tan, Qian ;
Hao, Zhujing ;
Liu, Ying ;
Yan, Jun ;
Xia, Yunlong .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (04)
[29]   Prediction of Venous Thromboembolism in Diverse Populations Using Machine Learning and Structured Electronic Health Records [J].
Chen, Robert ;
Petrazzini, Ben Omega ;
Malick, Waqas A. ;
Rosenson, Robert S. ;
Do, Ron .
ARTERIOSCLEROSIS THROMBOSIS AND VASCULAR BIOLOGY, 2024, 44 (02) :491-504
[30]   Prediction and diagnosis of depression using machine learning with electronic health records data: a systematic review [J].
Nickson, David ;
Meyer, Caroline ;
Walasek, Lukasz ;
Toro, Carla .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)