MULTICATEGORY CLASSIFICATION METHODS;
HUMAN GUT MICROBIOME;
COMPREHENSIVE EVALUATION;
FECAL MICROBIOTA;
GENE-EXPRESSION;
VALIDATION;
PREDICTION;
REGRESSION;
SELECTION;
D O I:
10.1371/journal.pcbi.1004977
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Shotgun metagenomic analysis of the human associated microbiome provides a rich set of microbial features for prediction and biomarker discovery in the context of human diseases and health conditions. However, the use of such high-resolution microbial features presents new challenges, and validated computational tools for learning tasks are lacking. Moreover, classification rules have scarcely been validated in independent studies, posing questions about the generality and generalization of disease-predictive models across cohorts. In this paper, we comprehensively assess approaches to metagenomics-based prediction tasks and for quantitative assessment of the strength of potential microbiome-phenotype associations. We develop a computational framework for prediction tasks using quantitative microbiome profiles, including species-level relative abundances and presence of strain-specific markers. A comprehensive meta-analysis, with particular emphasis on generalization across cohorts, was performed in a collection of 2424 publicly available metagenomic samples from eight large-scale studies. Cross-validation revealed good disease-prediction capabilities, which were in general improved by feature selection and use of strain-specific markers instead of species-level taxonomic abundance. In cross-study analysis, models transferred between studies were in some cases less accurate than models tested by within-study cross-validation. Interestingly, the addition of healthy (control) samples from other studies to training sets improved disease prediction capabilities. Some microbial species (most notably Streptococcus anginosus) seem to characterize general dysbiotic states of the microbiome rather than connections with a specific disease. Our results in modelling features of the "healthy" microbiome can be considered a first step toward defining general microbial dysbiosis. The software framework, microbiome profiles, and metadata for thousands of samples are publicly available at http://segatalab.cibio.unitn.it/tools/metaml.
机构:
Univ Utah, Coll Hlth, Dept Nutr & Integrat Physiol, Salt Lake City, UT 84112 USA
Univ Utah, Dept Internal Med, Div Nephrol & Hypertens, Salt Lake City, UT 84132 USAUniv Utah, Coll Hlth, Dept Nutr & Integrat Physiol, Salt Lake City, UT 84112 USA
Aalami, Amir Hossein
Rahimi, Mohammad
论文数: 0引用数: 0
h-index: 0
机构:
McMaster Univ, Dept Mech Engn, Hamilton, ON L8S 4L7, CanadaUniv Utah, Coll Hlth, Dept Nutr & Integrat Physiol, Salt Lake City, UT 84112 USA
Rahimi, Mohammad
Sahebkar, Amirhossein
论文数: 0引用数: 0
h-index: 0
机构:
Saveetha Univ, Saveetha Med Coll & Hosp, Saveetha Inst Med & Tech Sci, Ctr Global Hlth Res, Chennai, IndiaUniv Utah, Coll Hlth, Dept Nutr & Integrat Physiol, Salt Lake City, UT 84112 USA
机构:
Inst Potosino Invest Cient & Tecnol, Col Lomas Secc 4ta, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, MexicoInst Potosino Invest Cient & Tecnol, Col Lomas Secc 4ta, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, Mexico
Uc-Castillo, Jose Luis
Marin-Celestino, Ana Elizabeth
论文数: 0引用数: 0
h-index: 0
机构:
CONAHCYT Inst Potosino Invest Cient & Tecnol, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, MexicoInst Potosino Invest Cient & Tecnol, Col Lomas Secc 4ta, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, Mexico
Marin-Celestino, Ana Elizabeth
Martinez-Cruz, Diego Armando
论文数: 0引用数: 0
h-index: 0
机构:
CONAHCYT Ctr Invest Mat Avanzados, SC Calle CIMAV 110,Ejido Arroyo Seco,Col 15 Mayo, Durango 34147, Dgo, MexicoInst Potosino Invest Cient & Tecnol, Col Lomas Secc 4ta, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, Mexico
Martinez-Cruz, Diego Armando
Tuxpan-Vargas, Jose
论文数: 0引用数: 0
h-index: 0
机构:
CONAHCYT Inst Potosino Invest Cient & Tecnol, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, MexicoInst Potosino Invest Cient & Tecnol, Col Lomas Secc 4ta, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, Mexico
Tuxpan-Vargas, Jose
Ramos-Leal, Jose Alfredo
论文数: 0引用数: 0
h-index: 0
机构:
Inst Potosino Invest Cient & Tecnol, Col Lomas Secc 4ta, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, MexicoInst Potosino Invest Cient & Tecnol, Col Lomas Secc 4ta, AC Div Geociencias Aplicadas, Camino Presa San Jose 2055, San Luis Potosi 78216, Spl, Mexico
机构:
Chongqing Med Univ, Affiliated Hosp 2, Dept Crit Care Med, Chongqing 400010, Peoples R ChinaChongqing Med Univ, Affiliated Hosp 2, Dept Crit Care Med, Chongqing 400010, Peoples R China
Zhang, Yan
Xu, Weiwei
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Med Univ, Affiliated Hosp 2, Dept Endocrine & Metab Dis, Chongqing 400010, Peoples R ChinaChongqing Med Univ, Affiliated Hosp 2, Dept Crit Care Med, Chongqing 400010, Peoples R China
Xu, Weiwei
Yang, Ping
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Med Univ, Affiliated Hosp 2, Dept Crit Care Med, Chongqing 400010, Peoples R ChinaChongqing Med Univ, Affiliated Hosp 2, Dept Crit Care Med, Chongqing 400010, Peoples R China
Yang, Ping
Zhang, An
论文数: 0引用数: 0
h-index: 0
机构:
Chongqing Med Univ, Affiliated Hosp 2, Dept Crit Care Med, Chongqing 400010, Peoples R ChinaChongqing Med Univ, Affiliated Hosp 2, Dept Crit Care Med, Chongqing 400010, Peoples R China
机构:
Chinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
Azeem, Muhammad Ilyas
Palomba, Fabio
论文数: 0引用数: 0
h-index: 0
机构:
Univ Zurich, Zurich, SwitzerlandChinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
Palomba, Fabio
Shi, Lin
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R ChinaChinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
Shi, Lin
Wang, Qing
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R China
Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 100190, Peoples R ChinaChinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
机构:
Zhang Jiakou First Hosp, Dept Pharm, 6 Libaisi Lane,Xinhua Front St, Zhangjiakou 075000, Hebei, Peoples R ChinaZhang Jiakou First Hosp, Dept Pharm, 6 Libaisi Lane,Xinhua Front St, Zhangjiakou 075000, Hebei, Peoples R China
Zhang, Hongru
Wang, Chen
论文数: 0引用数: 0
h-index: 0
机构:
Zhang Jiakou First Hosp, Dept Pharm, 6 Libaisi Lane,Xinhua Front St, Zhangjiakou 075000, Hebei, Peoples R ChinaZhang Jiakou First Hosp, Dept Pharm, 6 Libaisi Lane,Xinhua Front St, Zhangjiakou 075000, Hebei, Peoples R China
Wang, Chen
Yang, Ning
论文数: 0引用数: 0
h-index: 0
机构:
Zhang Jiakou First Hosp, Dept Pharm, 6 Libaisi Lane,Xinhua Front St, Zhangjiakou 075000, Hebei, Peoples R ChinaZhang Jiakou First Hosp, Dept Pharm, 6 Libaisi Lane,Xinhua Front St, Zhangjiakou 075000, Hebei, Peoples R China