Interpretable and accurate prediction models for metagenomics data

被引:32
作者
Prifti, Edi [1 ,2 ]
Chevaleyre, Yann [3 ]
Hanczar, Blaise [4 ]
Belda, Eugeni [2 ]
Danchin, Antoine [5 ]
Clement, Karine [6 ,7 ]
Zucker, Jean-Daniel [1 ,2 ,6 ]
机构
[1] Sorbonne Univ, UMMISCO, IRD, 32 Ave Henri Varagnat, F-93143 Bondy, France
[2] ICAN, Inst Cardiometab & Nutr, Integrom, 91 Blvd Hop, F-75013 Paris, France
[3] PSL Res Univ, Paris Dauphine Univ, CNRS, LAMSADE,UMR 7243, Pl Mal Lattre Tassigny, F-75016 Paris, France
[4] Univ Evry, Univ Paris Saclay, IBISC, 23 Blvd France, F-91034 Evry, France
[5] Univ Paris 05, CNRS, INSERM, Inst Cochin,UMR8104,U1016, 24 Rue Faubourg St Jacques, F-75014 Paris, France
[6] Sorbonne Univ, INSERM, Nutr & Obesities Syst Approach Res Unit NutriOm, 91 Blvd Hop, F-75013 Paris, France
[7] Hop La Pitie Salpetriere, AP HP, Nutr Dept, CRNH Ile France, 91 Blvd Hop, F-75013 Paris, France
来源
GIGASCIENCE | 2020年 / 9卷 / 03期
关键词
prediction; interpretable models; metagenomics biomarkers; microbial ecosystems; HUMAN GUT MICROBIOME; ARTIFICIAL-INTELLIGENCE; MODULATION; SIGNATURES; IMMUNITY; FUTURE; NOV;
D O I
10.1093/gigascience/giaa010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Microbiome biomarker discovery for patient diagnosis, prognosis, and risk evaluation is attracting broad interest. Selected groups of microbial features provide signatures that characterize host disease states such as cancer or cardio-metabolic diseases. Yet, the current predictive models stemming from machine learning still behave as black boxes and seldom generalize well. Their interpretation is challenging for physicians and biologists, which makes them difficult to trust and use routinely in the physician-patient decision-making process. Novel methods that provide interpretability and biological insight are needed. Here, we introduce "predomics", an original machine learning approach inspired by microbial ecosystem interactions that is tailored for metagenomics data. It discovers accurate predictive signatures and provides unprecedented interpretability. The decision provided by the predictive model is based on a simple, yet powerful score computed by adding, subtracting, or dividing cumulative abundance of microbiome measurements. Results: Tested on >100 datasets, we demonstrate that predomics models are simple and highly interpretable. Even with such simplicity, they are at least as accurate as state-of-the-art methods. The family of best models, discovered during the learning process, offers the ability to distil biological information and to decipher the predictability signatures of the studied condition. In a proof-of-concept experiment, we successfully predicted body corpulence and metabolic improvement after bariatric surgery using pre-surgery microbiome data. Conclusions: Predomics is a new algorithm that helps in providing reliable and trustworthy diagnostic decisions in the microbiome field. Predomics is in accord with societal and legal requirements that plead for an explainable artificial intelligence approach in the medical field.
引用
收藏
页数:11
相关论文
共 48 条
[1]   3off2: A network reconstruction algorithm based on 2-point and 3-point information statistics [J].
Affeldt, Severine ;
Verny, Louis ;
Isambert, Herve .
BMC BIOINFORMATICS, 2016, 17
[2]   Major microbiota dysbiosis in severe obesity: fate after bariatric surgery [J].
Aron-Wisnewsky, Judith ;
Prifti, Edi ;
Belda, Eugeni ;
Ichou, Farid ;
Kayser, Brandon D. ;
Dao, Maria Carlota ;
Verger, Eric O. ;
Hedjazi, Lyamine ;
Bouillot, Jean-Luc ;
Chevallier, Jean-Marc ;
Pons, Nicolas ;
Le Chatelier, Emmanuelle ;
Levenez, Florence ;
Ehrlich, Stanislav Dusko ;
Dore, Joel ;
Zucker, Jean-Daniel ;
Clement, Karine .
GUT, 2019, 68 (01) :70-82
[3]   Altered profile of human gut microbiome is associated with cirrhosis and its complications [J].
Bajaj, Jasmohan S. ;
Heuman, Douglas M. ;
Hylemon, Phillip B. ;
Sanyal, Arun J. ;
White, Melanie B. ;
Monteith, Pamela ;
Noble, Nicole A. ;
Unser, Ariel B. ;
Daita, Kalyani ;
Fisher, Andmorgan R. ;
Sikaroodi, Masoumeh ;
Gillevet, Patrick M. .
JOURNAL OF HEPATOLOGY, 2014, 60 (05) :940-947
[4]   Establishment and evaluation of prediction model for multiple disease classification based on gut microbial data [J].
Bang, Sohyun ;
Yoo, DongAhn ;
Kim, Soo-Jin ;
Jhang, Soyun ;
Cho, Seoae ;
Kim, Heebal .
SCIENTIFIC REPORTS, 2019, 9 (1)
[5]  
Begum D., 2019, bioRxiv, P816090, DOI DOI 10.1101/816090
[6]   Artificial intelligence in medicine: current trends and future possibilities [J].
Buch, Varun H. ;
Ahmed, Irfan ;
Maruthappu, Mahiben .
BRITISH JOURNAL OF GENERAL PRACTICE, 2018, 68 (668) :143-144
[7]   Exploring chemoselective S-to-N acyl transfer reactions in synthesis and chemical biology [J].
Burke, Helen M. ;
McSweeney, Lauren ;
Scanlan, Eoin M. .
NATURE COMMUNICATIONS, 2017, 8
[8]   THE ASSEMBLY OF SPECIES COMMUNITIES - CHANCE OR COMPETITION [J].
CONNOR, EF ;
SIMBERLOFF, D .
ECOLOGY, 1979, 60 (06) :1132-1140
[9]   Dietary intervention impact on gut microbial gene richness [J].
Cotillard, Aurelie ;
Kennedy, Sean P. ;
Kong, Ling Chun ;
Prifti, Edi ;
Pons, Nicolas ;
Le Chatelier, Emmanuelle ;
Almeida, Mathieu ;
Quinquis, Benoit ;
Levenez, Florence ;
Galleron, Nathalie ;
Gougis, Sophie ;
Rizkalla, Salwa ;
Batto, Jean-Michel ;
Renault, Pierre ;
Dore, Joel ;
Zucker, Jean-Daniel ;
Clement, Karine ;
Ehrlich, Stanislav Dusko .
NATURE, 2013, 500 (7464) :585-+
[10]  
Elson CO, 2014, GUT MICROBES, V3, P332