Integration of protein context improves protein-based COVID-19 patient stratification

被引:3
|
作者
Gao, Jinlong [1 ,2 ]
He, Jiale [1 ,2 ]
Zhang, Fangfei [1 ,2 ]
Xiao, Qi [1 ,2 ]
Cai, Xue [1 ,2 ]
Yi, Xiao [1 ,2 ]
Zheng, Siqi [1 ,2 ]
Zhang, Ying [3 ]
Wang, Donglian [3 ]
Zhu, Guangjun [3 ]
Wang, Jing [3 ]
Shen, Bo [3 ]
Ralser, Markus [4 ,5 ,6 ,7 ]
Guo, Tiannan [1 ,2 ]
Zhu, Yi [1 ,2 ]
机构
[1] Westlake Univ, Sch Life Sci, Westlake Lab Life Sci & Biomed, Key Lab Struct Biol Zhejiang Prov, Hangzhou, Zhejiang, Peoples R China
[2] Westlake Inst Adv Study, Inst Basic Med Sci, Hangzhou, Zhejiang, Peoples R China
[3] Wenzhou Med Univ, Taizhou Hosp, Linhai, Zhejiang, Peoples R China
[4] Francis Crick Inst, Mol Biol Metab Lab, London, England
[5] Charite Univ Med Berlin, Dept Biochem, Berlin, Germany
[6] Free Univ Berlin, Berlin, Germany
[7] Humboldt Univ, Berlin, Germany
基金
中国国家自然科学基金; 英国惠康基金; 国家重点研发计划;
关键词
COVID-19; Severe cases; Proteomics; Protein complex; Stoichiometric ratio; INFLAMMATION; NETWORK; DISEASE;
D O I
10.1186/s12014-022-09370-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Classification of disease severity is crucial for the management of COVID-19. Several studies have shown that individual proteins can be used to classify the severity of COVID-19. Here, we aimed to investigate whether integrating four types of protein context data, namely, protein complexes, stoichiometric ratios, pathways and network degrees will improve the severity classification of COVID-19. Methods We performed machine learning based on three previously published datasets. The first was a SWATH (sequential window acquisition of all theoretical fragment ion spectra) MS (mass spectrometry) based proteomic dataset. The second was a TMTpro 16plex labeled shotgun proteomics dataset. The third was a SWATH dataset of an independent patient cohort. Results Besides twelve proteins, machine learning also prioritized two complexes, one stoichiometric ratio, five pathways, and five network degrees, resulting a 25-feature panel. As a result, a model based on the 25 features led to effective classification of severe cases with an AUC of 0.965, outperforming the models with proteins only. Complement component C9, transthyretin (TTR) and TTR-RBP (transthyretin-retinol binding protein) complex, the stoichiometric ratio of SAA2 (serum amyloid A proteins 2)/YLPM1 (YLP Motif Containing 1), and the network degree of SIRT7 (Sirtuin 7) and A2M (alpha-2-macroglobulin) were highlighted as potential markers by this classifier. This classifier was further validated with a TMT-based proteomic data set from the same cohort (test dataset 1) and an independent SWATH-based proteomic data set from Germany (test dataset 2), reaching an AUC of 0.900 and 0.908, respectively. Machine learning models integrating protein context information achieved higher AUCs than models with only one feature type. Conclusion Our results show that the integration of protein context including protein complexes, stoichiometric ratios, pathways, network degrees, and proteins improves phenotype prediction.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Integration of protein context improves protein-based COVID-19 patient stratification
    Jinlong Gao
    Jiale He
    Fangfei Zhang
    Qi Xiao
    Xue Cai
    Xiao Yi
    Siqi Zheng
    Ying Zhang
    Donglian Wang
    Guangjun Zhu
    Jing Wang
    Bo Shen
    Markus Ralser
    Tiannan Guo
    Yi Zhu
    Clinical Proteomics, 2022, 19
  • [2] Protein-based lateral flow assays for COVID-19 detection
    Mahmoudinobar, Farbod
    Britton, Dustin
    Montclare, Jin Kim
    PROTEIN ENGINEERING DESIGN & SELECTION, 2021, 34
  • [3] Formulation Development of a COVID-19 Recombinant Spike Protein-Based Vaccine
    Xiao, Emily
    Mirabel, Clementine
    Clenet, Didier
    Zhu, Shaolong
    James, Andrew
    Ettorre, Luciano
    Williams, Trevor
    Szeto, Jason
    Rahman, Nausheen
    Ausar, Salvador Fernando
    VACCINES, 2024, 12 (08)
  • [4] COVID-19 Diagnostic Strategies Part II: Protein-Based Technologies
    Shaffaf, Tina
    Ghafar-Zadeh, Ebrahim
    BIOENGINEERING-BASEL, 2021, 8 (05):
  • [5] A Review of Protein-Based COVID-19 Vaccines: From Monovalent to Multivalent Formulations
    Qian, Gui
    Gao, Cuige
    Zhang, Miaomiao
    Chen, Yuanxin
    Xie, Liangzhi
    VACCINES, 2024, 12 (06)
  • [6] Unfolded protein response in the COVID-19 context
    Barabutis, Nektarios
    AGING AND HEALTH RESEARCH, 2021, 1 (01):
  • [7] Protein Posttranslational Signatures Identified in COVID-19 Patient Plasma
    Vedula, Pavan
    Tang, Hsin-Yao
    Speicher, David W.
    Kashina, Anna
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2022, 10
  • [8] Fusion protein-based COVID-19 vaccines exemplified by a chimeric vaccine based on a single fusion protein (W-PreS-O)
    Gattinger, Pia
    Kozlovskaya, Luibov I.
    Lunin, Alexander S.
    Gancharova, Olga S.
    Sirazova, Dina I.
    Apolokhov, Vasiliy D.
    Chekina, Egor S.
    Gordeychuk, Ilya V.
    Karaulov, Alexander V.
    Valenta, Rudolf
    Ishmukhametov, Aydar A.
    FRONTIERS IN IMMUNOLOGY, 2025, 16
  • [9] Reactogenicity Differences between Adjuvanted, Protein-Based and Messenger Ribonucleic Acid (mRNA)-Based COVID-19 Vaccines
    Rousculp, Matthew D.
    Hollis, Kelly
    Ziemiecki, Ryan
    Odom, Dawn
    Marchese, Anthony M.
    Montazeri, Mitra
    Odak, Shardul
    Jackson, Laurin
    Beyhaghi, Hadi
    Toback, Seth
    VACCINES, 2024, 12 (07)
  • [10] Pre-Clinical Safety and Immunogenicity Study of a Coronavirus Protein-Based Subunit Vaccine for COVID-19
    Shorayeva, Kamshat
    Nakhanov, Aziz
    Nurpeisova, Ainur
    Chervyakova, Olga
    Jekebekov, Kuanysh
    Abay, Zhandos
    Assanzhanova, Nurika
    Sadikaliyeva, Sandugash
    Kalimolda, Elina
    Terebay, Aibol
    Moldagulova, Sabina
    Absatova, Zharkinay
    Tulendibayev, Ali
    Kopeyev, Syrym
    Nakhanova, Gulnur
    Issabek, Aisha
    Nurabayev, Sergazy
    Kerimbayev, Aslan
    Kutumbetov, Lespek
    Abduraimov, Yergali
    Kassenov, Markhabat
    Orynbayev, Mukhit
    Zakarya, Kunsulu
    Tortorella, Domenico
    VACCINES, 2023, 11 (12)