Machine learning approach on plasma proteomics identifies signatures associated with obesity in the KORA FF4 cohort

被引:0
作者
Niu, Jiefei [1 ,2 ,3 ]
Adam, Jonathan [1 ,2 ,4 ]
Skurk, Thomas [5 ,6 ]
Seissler, Jochen [4 ,7 ,8 ,9 ]
Dong, Qiuling [1 ,2 ,3 ]
Efiong, Esienanwan [1 ,2 ,10 ,11 ]
Gieger, Christian [1 ,2 ,4 ]
Peters, Annette [2 ,4 ,12 ]
Sharma, Sapna [1 ,2 ,4 ]
Grallert, Harald [1 ,2 ,4 ]
机构
[1] Helmholtz Zentrum Munchen, Res Unit Mol Epidemiol, D-85764 Neuherberg, Germany
[2] Helmholtz Zentrum Munchen, Inst Epidemiol, D-85764 Neuherberg, Germany
[3] Ludwig Maximilians Univ Munchen, Fac Med, Munich, Germany
[4] German Ctr Diabet Res DZD, Neuherberg, Germany
[5] Tech Univ Munich, Sch Med, Munich, Germany
[6] Tech Univ Munich, ZIEL Inst Food & Hlth, Core Facil Human Studies, Freising Weihenstephan, Germany
[7] Klinikum Ludwig Maximilians Univ, Med Klin & Poliklin 4, Munich, Germany
[8] Ludwig Maximilians Univ Munchen, Clin Cooperat Grp Diabet, Munich, Germany
[9] Helmholtz Zentrum Munchen, Munich, Germany
[10] Fac Pharmaceut Biomed & Vet Sci, Dept Pharmaceut Sci, Campus Drie Eiken,Univ Pl 1, Antwerp, Belgium
[11] Fed Univ Lafia, Fac Sci, Dept Phys, Lafia, Nigeria
[12] Ludwig Maximilians Univ Munchen, Inst Med Informat Proc Biometry & Epidemiol IBE, Fac Med, Munich, Germany
关键词
cohort study; machine learning; obesity; proteomics; BODY-MASS INDEX; MENDELIAN RANDOMIZATION;
D O I
10.1111/dom.16264
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims: This study investigated the role of plasma proteins in obesity to identify predictive biomarkers and explore underlying biological mechanisms. Methods: In the Cooperative Health Research in the Region of Augsburg (KORA) FF4 study, 809 proteins were measured in 2045 individuals (564 obese and 1481 non-obese). Multivariate logistic regression adjusted for confounders (basic and full models) was used to identify obesity-associated proteins. Priority-Lasso was applied for feature selection, followed by machine learning models (support vector machine [SVM], random forest [RF], k-nearest neighbour [KNN] and adaptive boosting [Adaboost]) for prediction. Correlation and enrichment analyses were performed to elucidate relationships between protein biomarkers, obesity risk factors and perturbed pathways. Mendelian randomisation (MR) assessed causal links between proteins and obesity. Results: A total of 16 proteins were identified as significantly associated with obesity through multivariable logistic regression in the basic model and subsequent Priority-Lasso analysis. Enrichment analyses highlighted immune response, lipid metabolism and inflammation regulation were linked to obesity. Machine learning models demonstrated robust predictive performance with area under the curves (AUC) of 0.820 (SVM), 0.805 (RF), 0.791 (KNN) and 0.819 (Adaboost). All 16 proteins correlated with obesity-related risk factors such as blood pressure and lipid levels. MR analysis identified AFM, CRP and CFH as causal and potentially modifiable proteins. Conclusions: The protein signatures identified in our study showed promising predictive potential for obesity. These findings warrant further investigation to evaluate their clinical applicability, offering insights into obesity prevention and treatment strategies.
引用
收藏
页码:2626 / 2636
页数:11
相关论文
共 46 条
  • [1] Roberto C.A., Swinburn B., Hawkes C., Et al., Patchy progress on obesity prevention: emerging examples, entrenched barriers, and new thinking, Lancet, 385, 9985, pp. 2400-2409, (2015)
  • [2] Jaacks L.M., Vandevijvere S., Pan A., Et al., The obesity transition: stages of the global epidemic, Lancet Diabetes Endocrinol, 7, 3, pp. 231-240, (2019)
  • [3] Trends in adult body-mass index in 200 countries from 1975 to 2014: a pooled analysis of 1698 population-based measurement studies with 19.2 million participants, Lancet, 387, 10026, pp. 1377-1396, (2016)
  • [4] Prospective Studies C., Whitlock G., Lewington S., Et al., Body-mass index and cause-specific mortality in 900 000 adults: collaborative analyses of 57 prospective studies, Lancet, 373, 9669, pp. 1083-1096, (2009)
  • [5] Swinburn B.A., Sacks G., Hall K.D., Et al., The global obesity pandemic: shaped by global drivers and local environments, Lancet, 378, 9793, pp. 804-814, (2011)
  • [6] Shah A.M., Myhre P.L., Arthur V., Et al., Large scale plasma proteomics identifies novel proteins and protein networks associated with heart failure development, Nat Commun, 15, 1, (2024)
  • [7] Swinney D.C., Anthony J., How were new medicines discovered?, Nat Rev Drug Discov, 10, 7, pp. 507-519, (2011)
  • [8] Kang C., Lee Y., Lee J.E., Recent advances in mass spectrometry-based proteomics of gastric cancer, World J Gastroenterol, 22, 37, pp. 8283-8293, (2016)
  • [9] Doerr A., Mass spectrometry-based targeted proteomics, Nat Methods, 10, 1, (2013)
  • [10] Geyer P.E., Wewer Albrechtsen N.J., Tyanova S., Et al., Proteomics reveals the effects of sustained weight loss on the human plasma proteome, Mol Syst Biol, 12, 12, (2016)