Poly-omic risk scores predict inflammatory bowel disease diagnosis

被引:6
作者
Arehart, Christopher H. [1 ,2 ,3 ]
Sterrett, John D. [1 ,4 ]
Garris, Rosanna L. [1 ,5 ]
Quispe-Pilco, Ruth E. [1 ,2 ]
Gignoux, Christopher R. [6 ]
Evans, Luke M. [2 ,3 ]
Stanislawski, Maggie A. [6 ]
机构
[1] Univ Colorado, Interdisciplinary Quantitat Biol PhD Program, Boulder, CO 80309 USA
[2] Univ Colorado, Dept Ecol & Evolutionary Biol, Boulder, CO USA
[3] Univ Colorado, Inst Behav Genet, Boulder, CO USA
[4] Univ Colorado, Dept Integrat Physiol, Boulder, CO 80309 USA
[5] Univ Colorado, Dept Biochem, Boulder, CO USA
[6] Univ Colorado Anschutz Med Campus, Dept Biomed Informat, Aurora, CO USA
基金
美国国家科学基金会;
关键词
omics; inflammatory bowel disease; gut microbiome; metabolomics; metatranscriptomics; viromics; multi-omics; T-CELLS; PREVALENCE; ACIDS;
D O I
10.1128/msystems.00677-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Inflammatory bowel disease (IBD) is characterized by complex etiology and a disrupted colonic ecosystem. We provide a framework for the analysis of multi-omic data, which we apply to study the gut ecosystem in IBD. Specifically, we train and validate models using data on the metagenome, metatranscriptome, virome, and metabolome from the Human Microbiome Project 2 IBD multi-omic database, with 1,785 repeated samples from 130 individuals (103 cases and 27 controls). After splitting the participants into training and testing groups, we used mixed-effects least absolute shrinkage and selection operator regression to select features for each omic. These features, with demographic covariates, were used to generate separate single-omic prediction scores. All four single-omic scores were then combined into a final regression to assess the relative importance of the individual omics and the predictive benefits when considered together. We identified several species, pathways, and metabolites known to be associated with IBD risk, and we explored the connections between data sets. Individually, metabolomic and viromic scores were more predictive than metagenomics or metatranscriptomics, and when all four scores were combined, we predicted disease diagnosis with a Nagelkerke's R2 of 0.46 and an area under the curve of 0.80 (95% confidence interval: 0.63, 0.98). Our work supports that some single-omic models for complex traits are more predictive than others, that incorporating multiple omic data sets may improve prediction, and that each omic data type provides a combination of unique and redundant information. This modeling framework can be extended to other complex traits and multi-omic data sets.IMPORTANCEComplex traits are characterized by many biological and environmental factors, such that multi-omic data sets are well-positioned to help us understand their underlying etiologies. We applied a prediction framework across multiple omics (metagenomics, metatranscriptomics, metabolomics, and viromics) from the gut ecosystem to predict inflammatory bowel disease (IBD) diagnosis. The predicted scores from our models highlighted key features and allowed us to compare the relative utility of each omic data set in single-omic versus multi-omic models. Our results emphasized the importance of metabolomics and viromics over metagenomics and metatranscriptomics for predicting IBD status. The greater predictive capability of metabolomics and viromics is likely because these omics serve as markers of lifestyle factors such as diet. This study provides a modeling framework for multi-omic data, and our results show the utility of combining multiple omic data types to disentangle complex disease etiologies and biological signatures. Complex traits are characterized by many biological and environmental factors, such that multi-omic data sets are well-positioned to help us understand their underlying etiologies. We applied a prediction framework across multiple omics (metagenomics, metatranscriptomics, metabolomics, and viromics) from the gut ecosystem to predict inflammatory bowel disease (IBD) diagnosis. The predicted scores from our models highlighted key features and allowed us to compare the relative utility of each omic data set in single-omic versus multi-omic models. Our results emphasized the importance of metabolomics and viromics over metagenomics and metatranscriptomics for predicting IBD status. The greater predictive capability of metabolomics and viromics is likely because these omics serve as markers of lifestyle factors such as diet. This study provides a modeling framework for multi-omic data, and our results show the utility of combining multiple omic data types to disentangle complex disease etiologies and biological signatures.
引用
收藏
页数:17
相关论文
共 56 条
[1]   The global, regional, and national burden of inflammatory bowel disease in 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017 [J].
Alatab, Sudabeh ;
Sepanlou, Sadaf G. ;
Ikuta, Kevin ;
Vahedi, Homayoon ;
Bisignano, Catherine ;
Safiri, Saeid ;
Sadeghi, Anahita ;
Nixon, Molly R. ;
Abdoli, Amir ;
Abolhassani, Hassan ;
Alipour, Vahid ;
Almadi, Majid A. H. ;
Almasi-Hashiani, Amir ;
Anushiravani, Amir ;
Arabloo, Jalal ;
Atique, Suleman ;
Awasthi, Ashish ;
Badawi, Alaa ;
Baig, Atif A. A. ;
Bhala, Neeraj ;
Bijani, Ali ;
Biondi, Antonio ;
Borzi, Antonio M. ;
Burke, Kristin E. ;
Carvalho, Felix ;
Daryani, Ahmad ;
Dubey, Manisha ;
Eftekhari, Aziz ;
Fernandes, Eduarda ;
Fernandes, Joao C. ;
Fischer, Florian ;
Haj-Mirzaian, Arvin ;
Haj-Mirzaian, Arya ;
Hasanzadeh, Amir ;
Hashemian, Maryam ;
Hay, Simon, I ;
Hoang, Chi L. ;
Househ, Mowafa ;
Ilesanmi, Olayinka S. ;
Balalami, Nader Jafari ;
James, Spencer L. ;
Kengne, Andre P. ;
Malekzadeh, Masoud M. ;
Merat, Shahin ;
Meretoja, Tuomo J. ;
Mestrovic, Tomislav ;
Mirrakhimov, Erkin M. ;
Mirzaei, Hamed ;
Mohammad, Karzan A. ;
Mokdad, Ali H. .
LANCET GASTROENTEROLOGY & HEPATOLOGY, 2020, 5 (01) :17-30
[2]   Cutting edge:: Human γδ T cells are activated by intermediates of the 2-C-methyl-D-erythritol 4-phosphate pathway of isoprenoid biosynthesis [J].
Altincicek, B ;
Moll, J ;
Campos, N ;
Foerster, G ;
Beck, E ;
Hoeffler, JF ;
Grosdemange-Billiard, C ;
Rodríguez-Concepción, M ;
Rohmer, M ;
Boronat, A ;
Eberl, M ;
Jomaa, H .
JOURNAL OF IMMUNOLOGY, 2001, 166 (06) :3655-3658
[3]   Fitting Linear Mixed-Effects Models Using lme4 [J].
Bates, Douglas ;
Maechler, Martin ;
Bolker, Benjamin M. ;
Walker, Steven C. .
JOURNAL OF STATISTICAL SOFTWARE, 2015, 67 (01) :1-48
[4]   Asthma, Type 1 and Type 2 Diabetes Mellitus, and Inflammatory Bowel Disease amongst South Asian Immigrants to Canada and Their Children: A Population-Based Cohort Study [J].
Benchimol, Eric I. ;
Manuel, Douglas G. ;
To, Teresa ;
Mack, David R. ;
Nguyen, Geoffrey C. ;
Gommerman, Jennifer L. ;
Croitoru, Kenneth ;
Mojaverian, Nassim ;
Wang, Xuesong ;
Quach, Pauline ;
Guttmann, Astrid .
PLOS ONE, 2015, 10 (04)
[5]   Utilizing machine learning with knockoff filtering to extract significant metabolites in Crohn's disease with a publicly available untargeted metabolomics dataset [J].
Bin Masud, Shoaib ;
Jenkins, Conor ;
Hussey, Erika ;
Elkin-Frankston, Seth ;
Mach, Phillip ;
Dhummakupt, Elizabeth ;
Aeron, Shuchin .
PLOS ONE, 2021, 16 (07)
[6]   Multi-"-Omics" Profiling in Patients With Quiescent Inflammatory Bowel Disease Identifies Biomarkers Predicting Relapse [J].
Borren, Nienke Z. ;
Plichta, Damian ;
Joshi, Amit D. ;
Bonilla, Gracia ;
Sadreyev, Ruslan ;
Vlamakis, Hera ;
Xavier, Ramnik J. ;
Ananthakrishnan, Ashwin N. .
INFLAMMATORY BOWEL DISEASES, 2020, 26 (10) :1524-1532
[7]   Microbial pathways in colonic sulfur metabolism and links with health and disease [J].
Carbonero, Franck ;
Benefiel, Ann C. ;
Alizadeh-Ghamsari, Amir H. ;
Gaskins, H. Rex .
FRONTIERS IN PHYSIOLOGY, 2012, 3
[8]   Identifying environmental risk factors for inflammatory bowel diseases: a Mendelian randomization study [J].
Carreras-Torres, Robert ;
Ibanez-Sanz, Gemma ;
Obon-Santacana, Mireia ;
Duell, Eric J. ;
Moreno, Victor .
SCIENTIFIC REPORTS, 2020, 10 (01)
[9]   Gammadelta T Cells in Crohn's Disease: A New Player in the Disease Pathogenesis? [J].
Catalan-Serra, Ignacio ;
Sandvik, Arne Kristian ;
Bruland, Torunn ;
Carlos Andreu-Ballester, Juan .
JOURNAL OF CROHNS & COLITIS, 2017, 11 (09) :1135-1145
[10]   Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data [J].
Chen, Guo-Bo ;
Lee, Sang Hong ;
Brion, Marie-Jo A. ;
Montgomery, Grant W. ;
Wray, Naomi R. ;
Radford-Smith, Graham L. ;
Visscher, Peter M. .
HUMAN MOLECULAR GENETICS, 2014, 23 (17) :4710-4720