Sarcopenia feature selection and risk prediction using machine learning A cross-sectional study

被引:31
|
作者
Kang, Yang-Jae [1 ,2 ]
Yoo, Jun-Il [3 ]
Ha, Yong-chan [4 ]
机构
[1] Gyeongsang Natl Univ Hosp, PMBBRC, Div Appl Life Sci Dept, Jinju, South Korea
[2] Gyeongsang Natl Univ Hosp, Div Life Sci Dept, Jinju, South Korea
[3] Gyeongsang Natl Univ Hosp, Dept Orthopaed Surg, 90 Chilamdong, Jinju 660702, Gyeongnamdo, South Korea
[4] Chung Ang Univ, Coll Med, Dept Orthopaed Surg, Seoul, South Korea
关键词
feature selection; machine learning; risk prediction; sarcopenia; MUSCLE MASS; HEALTH;
D O I
10.1097/MD.0000000000017699
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The purpose of this study was to verify the usefulness of machine learning (ML) for selection of risk factors and development of predictive models for patients with sarcopenia. We collected medical records from Korean postmenopausal women based on Korea National Health and Nutrition Examination Surveys. A training data set compiled from simple survey data was used to construct models based on popular ML algorithms (e.g., support vector machine, random forest [RF], and logistic regression). A total of 4020 patients >= 65 years of age were enrolled in this study. The study population consisted of 1698 (42.2%) male and 2322 (57.8%) female patients. The 10 most important risk factors in men were bodymass index (BMI), red blood cell (RBC) count, blood urea nitrogen (BUN), vitamin D, ferritin, fiber intake (g/d), primary diastolic blood pressure, white blood cell (WBC) count, fat intake (g/d), age, glutamic-pyruvic transaminase, niacin intake (mg/d), protein intake (g/d), fasting blood sugar, and water intake (g/d). The 10 most important risk factors in women were BMI, water intake (g/d), WBC, RBC count, iron intake (mg/d), BUN, high-density lipoprotein, protein intake (g/d), fiber consumption (g/d), vitamin C intake (mg/d), parathyroid hormone, niacin intake (mg/d), carotene intake (mg/d), potassiumintake (mg/d), calcium intake (mg/d), sodiumintake (mg/d), retinol intake (mg/d), and age. A receiver operating characteristic (ROC) curve analysis found that the area under the ROC curve for each ML model was not significantly different within a gender. The most cost-effective method in clinical practice is to make feature selection using RF models and expert knowledge and to make disease prediction using verification by several ML models. However, the developed prediction model should be validated using additional studies.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Sarcopenia feature selection and risk prediction using machine learning
    Yoo, Jun-Il
    Park, Chan-Ho
    Kim, Hyeonmok
    JOURNAL OF BONE AND MINERAL RESEARCH, 2019, 34 : 145 - 145
  • [2] Sarcopenia risk prediction and feature selection by using quantum machine learning algorithms
    Ullah, Ubaid
    Maheshwari, Danyal
    Castillo Olea, Cristian
    Zapirain, Begonya Garcia
    QUANTUM MACHINE INTELLIGENCE, 2024, 6 (02)
  • [3] A Cross-Sectional Machine Learning Approach for Hedge Fund Return Prediction and Selection
    Wu, Wenbo
    Chen, Jiaqi
    Yang, Zhibin
    Tindall, Michael L.
    MANAGEMENT SCIENCE, 2021, 67 (07) : 4577 - 4601
  • [4] Prediction of Diabetes Using Data Mining and Machine Learning Algorithms: A Cross-Sectional Study
    Shojaee-Mend, Hassan
    Velayati, Farnia
    Tayefi, Batool
    Babaee, Ebrahim
    HEALTHCARE INFORMATICS RESEARCH, 2024, 30 (01) : 73 - 82
  • [5] Prediction of Suicidal Ideation among Korean Adults Using Machine Learning: A Cross-Sectional Study
    Oh, Bumjo
    Yun, Je-Yeon
    Yeo, Eun Chong
    Kim, Dong-Hoi
    Kim, Jin
    Cho, Bum-Joo
    PSYCHIATRY INVESTIGATION, 2020, 17 (04) : 331 - 340
  • [6] Diabetic retinopathy risk prediction for fundus examination using sparse learning: a cross-sectional study
    Oh, Ein
    Yoo, Tae Keun
    Park, Eun-Cheol
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2013, 13
  • [7] Diabetic retinopathy risk prediction for fundus examination using sparse learning: a cross-sectional study
    Ein Oh
    Tae Keun Yoo
    Eun-Cheol Park
    BMC Medical Informatics and Decision Making, 13
  • [8] Machine learning techniques for cross-sectional equity returns' prediction
    Fieberg, Christian
    Metko, Daniel
    Poddig, Thorsten
    Loy, Thomas
    OR SPECTRUM, 2023, 45 (01) : 289 - 323
  • [9] Machine learning techniques for cross-sectional equity returns’ prediction
    Christian Fieberg
    Daniel Metko
    Thorsten Poddig
    Thomas Loy
    OR Spectrum, 2023, 45 : 289 - 323
  • [10] Comparative study on risk prediction model of type 2 diabetes based on machine learning theory: a cross-sectional study
    Wang, Shu
    Chen, Rong
    Wang, Shuang
    Kong, Danli
    Cao, Rudai
    Lin, Chunwen
    Luo, Ling
    Huang, Jialu
    Zhang, Qiaoli
    Yu, Haibing
    Ding, Yuan Lin
    BMJ OPEN, 2023, 13 (08):