Machine Learning Approach to Metabolomic Data Predicts Type 2 Diabetes Mellitus Incidence

被引:3
|
作者
Leiherer, Andreas [1 ,2 ,3 ]
Muendlein, Axel [1 ]
Mink, Sylvia [2 ,3 ]
Mader, Arthur [1 ,4 ]
Saely, Christoph H. [1 ,3 ,4 ]
Festa, Andreas [1 ]
Fraunberger, Peter [2 ,3 ]
Drexel, Heinz [1 ,3 ,5 ,6 ]
机构
[1] Vorarlberg Inst Vasc Invest & Treatment VIVIT, A-6800 Feldkirch, Austria
[2] Cent Med Labs, A-6800 Feldkirch, Austria
[3] Private Univ Principal Liechtenstein, Fac Med Sci, FL-9495 Triesen, Liechtenstein
[4] Acad Teaching Hosp Feldkirch, Dept Internal Med 3, A-6800 Feldkirch, Austria
[5] Acad Teaching Hosp Feldkirch, Vorarlberger Landeskrankenhausbetriebsgesell, A-6800 Feldkirch, Austria
[6] Drexel Univ, Coll Med, Philadelphia, PA 19129 USA
关键词
ML; machine learning; artificial intelligence; diabetes; incidence; metabolomics; support vector machine; accuracy; CERAMIDES; MODEL;
D O I
10.3390/ijms25105331
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Metabolomics, with its wealth of data, offers a valuable avenue for enhancing predictions and decision-making in diabetes. This observational study aimed to leverage machine learning (ML) algorithms to predict the 4-year risk of developing type 2 diabetes mellitus (T2DM) using targeted quantitative metabolomics data. A cohort of 279 cardiovascular risk patients who underwent coronary angiography and who were initially free of T2DM according to American Diabetes Association (ADA) criteria was analyzed at baseline, including anthropometric data and targeted metabolomics, using liquid chromatography (LC)-mass spectroscopy (MS) and flow injection analysis (FIA)-MS, respectively. All patients were followed for four years. During this time, 11.5% of the patients developed T2DM. After data preprocessing, 362 variables were used for ML, employing the Caret package in R. The dataset was divided into training and test sets (75:25 ratio) and we used an oversampling approach to address the classifier imbalance of T2DM incidence. After an additional recursive feature elimination step, identifying a set of 77 variables that were the most valuable for model generation, a Support Vector Machine (SVM) model with a linear kernel demonstrated the most promising predictive capabilities, exhibiting an F1 score of 50%, a specificity of 93%, and balanced and unbalanced accuracies of 72% and 88%, respectively. The top-ranked features were bile acids, ceramides, amino acids, and hexoses, whereas anthropometric features such as age, sex, waist circumference, or body mass index had no contribution. In conclusion, ML analysis of metabolomics data is a promising tool for identifying individuals at risk of developing T2DM and opens avenues for personalized and early intervention strategies.
引用
收藏
页数:14
相关论文
共 50 条
  • [32] Use of statins and the incidence of type 2 diabetes mellitus
    Bernardi, Andre
    Rocha, Viviane Zorzanelli
    Faria-Neto, Jose Rocha
    REVISTA DA ASSOCIACAO MEDICA BRASILEIRA, 2015, 61 (04): : 375 - 380
  • [33] Periodontal disease and incidence of type 2 diabetes mellitus
    Joshipura, Kaumudi J.
    Li, Tricia Y.
    Perez, Cynthia
    Hu, Frank
    DIABETES, 2008, 57 : A257 - A257
  • [34] Optimized Machine Learning Approach for the Prediction of Diabetes-Mellitus
    Challa, Manoj
    Chinnaiyan, R.
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 321 - 328
  • [35] Diabetes Type 2: Poincare Data Preprocessing for Quantum Machine Learning
    Sierra-Sosa, Daniel
    Arcila-Moreno, Juan D.
    Garcia-Zapirain, Begonya
    Elmaghraby, Adel
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (02): : 1849 - 1861
  • [36] Identifying diagnostic indicators for type 2 diabetes mellitus from physical examination using interpretable machine learning approach
    Lv, Xiang
    Luo, Jiesi
    Huang, Wei
    Guo, Hui
    Bai, Xue
    Yan, Pijun
    Jiang, Zongzhe
    Zhang, Yonglin
    Jing, Runyu
    Chen, Qi
    Li, Menglong
    FRONTIERS IN ENDOCRINOLOGY, 2024, 15
  • [37] Roles of Anxiety and Depression in Predicting Cardiovascular Disease Among Patients With Type 2 Diabetes Mellitus: A Machine Learning Approach
    Chu, Haiyun
    Chen, Lu
    Yang, Xiuxian
    Qiu, Xiaohui
    Qiao, Zhengxue
    Song, Xuejia
    Zhao, Erying
    Zhou, Jiawei
    Zhang, Wenxin
    Mehmood, Anam
    Pan, Hui
    Yang, Yanjie
    FRONTIERS IN PSYCHOLOGY, 2021, 12
  • [38] Using Machine Learning to Predict CKD upon Type 2 Diabetes Mellitus Diagnosis
    Allen, Angier O.
    Iqbal, Zohora
    Green-Saxena, Abigail
    Das, Ritankar
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2021, 32 (10): : 268 - 268
  • [39] Machine-learning algorithms in screening for type 2 diabetes mellitus: Data from Fasa Adults Cohort Study
    Karmand, Hanieh
    Andishgar, Aref
    Tabrizi, Reza
    Sadeghi, Alireza
    Pezeshki, Babak
    Ravankhah, Mahdi
    Taherifard, Erfan
    Ahmadizar, Fariba
    ENDOCRINOLOGY DIABETES & METABOLISM, 2024, 7 (02)
  • [40] An Osteopathic Approach to Type 2 Diabetes Mellitus
    Shubrook, Jay H., Jr.
    Johnson, Amy W.
    JOURNAL OF THE AMERICAN OSTEOPATHIC ASSOCIATION, 2011, 111 (09): : 531 - 537