Biomarker-based classification of bacterial and fungal whole-blood infections in a genome-wide expression study

被引:30
作者
Dix, Andreas [1 ]
Huenniger, Kerstin [2 ,3 ]
Weber, Michael [2 ,3 ]
Guthke, Reinhard [1 ]
Kurzai, Oliver [2 ,3 ]
Linde, Joerg [1 ]
机构
[1] Hans Knoell Inst, Leibniz Inst Nat Prod Res & Infect Biol, Syst Biol Bioinformat, D-07745 Jena, Germany
[2] Univ Jena, Sept Res Ctr, Jena, Germany
[3] Hans Knoell Inst, Leibniz Inst Nat Prod Res & Infect Biol, D-07745 Jena, Germany
关键词
immune response; microarray; feature selection; systems biology; decision tree based methods; fungal pathogens; REAL-TIME PCR; GENE-EXPRESSION; STREAM PATHOGENS; RANDOM FOREST; SEPSIS; SELECTION; IDENTIFICATION; EPIDEMIOLOGY; DISCOVERY; CULTURE;
D O I
10.3389/fmicb.2015.00171
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Sepsis is a clinical syndrome that can be caused by bacteria or fungi. Early knowledge on the nature of the causative agent is a prerequisite for targeted anti-microbial therapy. Besides currently used detection methods like blood culture and PCR-based assays, the analysis of the transcriptional response of the host to infecting organisms holds great promise. In this study, we aim to examine the transcriptional footprint of infections caused by the bacterial pathogens Staphylococcus aureus and Escherichia coli and the fungal pathogens Candida albicans and Aspergillus fumigatus in a human whole-blood model. Moreover, we use the expression information to build a random forest classifier to classify if a sample contains a bacterial, fungal, or mock-infection. After normalizing the transcription intensities using stably expressed reference genes, we filtered the gene set for biomarkers of bacterial or fungal blood infections. This selection is based on differential expression and an additional gene relevance measure. In this way, we identified 38 biomarker genes, including IL6, SOCS3, and IRG1 which were already associated to sepsis by other studies. Using these genes, we trained the classifier and assessed its performance. It yielded a 96% accuracy (sensitivities >93%, specificities >97%) for a 10-fold stratified cross-validation and a 92% accuracy (sensitivities and specificities >83%) for an additional test dataset comprising Cryptococcus neoformans infections. Furthermore, the classifier is robust to Gaussian noise, indicating correct class predictions on datasets of new species. In conclusion, this genome-wide approach demonstrates an effective feature selection process in combination with the construction of a well-performing classification model. Further analyses of genes with pathogen-dependent expression patterns can provide insights into the systemic host responses, which may lead to new anti-microbial therapeutic advances.
引用
收藏
页数:11
相关论文
共 49 条
  • [1] Selection bias in gene extraction on the basis of microarray gene-expression data
    Ambroise, C
    McLachlan, GJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6562 - 6566
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [4] A multicenter trial to compare blood culture with polymerase chain reaction in severe human sepsis
    Bloos, Frank
    Hinder, Frank
    Becker, Karsten
    Sachse, Svea
    Dessap, Armand Mekontso
    Straube, Eberhard
    Cattoir, Vincent
    Brun-Buisson, Christian
    Reinhart, Konrad
    Peters, Georg
    Bauer, Michael
    [J]. INTENSIVE CARE MEDICINE, 2010, 36 (02) : 241 - 247
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Aspergillus fumigatus induces microRNA-132 in human monocytes and dendritic cells
    Das Gupta, Mithun
    Fliesser, Mirjam
    Springer, Jan
    Breitschopf, Tanja
    Schlossnagel, Hannes
    Schmitt, Anna-Lena
    Kurzai, Oliver
    Huenniger, Kerstin
    Einsele, Hermann
    Loeffler, Juergen
    [J]. INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2014, 304 (5-6) : 592 - 596
  • [7] Gene selection and classification of microarray data using random forest -: art. no. 3
    Díaz-Uriarte, R
    de Andrés, SA
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [8] lumi:: a pipeline for processing Illumina microarray
    Du, Pan
    Kibbe, Warren A.
    Lin, Simon M.
    [J]. BIOINFORMATICS, 2008, 24 (13) : 1547 - 1548
  • [9] Transcriptome Analysis of Neisseria meningitidis in Human Whole Blood and Mutagenesis Studies Identify Virulence Factors Involved in Blood Survival
    Echenique-Rivera, Hebert
    Muzzi, Alessandro
    Del Tordello, Elena
    Seib, Kate L.
    Francois, Patrice
    Rappuoli, Rino
    Pizza, Mariagrazia
    Serruto, Davide
    [J]. PLOS PATHOGENS, 2011, 7 (05)
  • [10] GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists
    Eden, Eran
    Navon, Roy
    Steinfeld, Israel
    Lipson, Doron
    Yakhini, Zohar
    [J]. BMC BIOINFORMATICS, 2009, 10