The landscape of microbial phenotypic traits and associated genes

被引:93
作者
Brbic, Maria [1 ]
Piskorec, Matija [1 ]
Vidulin, Vedrana [1 ]
Krisko, Anita [2 ]
Smuc, Tomislav [1 ]
Supek, Fran [1 ,3 ,4 ]
机构
[1] Rudjer Boskovic Inst, Div Elect, Zagreb 10000, Croatia
[2] Mediterranean Inst Life Sci, Split 21000, Croatia
[3] Barcelona Inst Sci & Technol, EMBL CRG Syst Biol Res Unit, Ctr Genom Regulat CRG, Barcelona 08003, Spain
[4] UPF, Barcelona 08002, Spain
关键词
PROTEIN FAMILIES; GENOME; BACTERIAL; SPORULATION; SIGNATURES; DATABASE; YEAST; CONSERVATION; RESISTANCE; EXPRESSION;
D O I
10.1093/nar/gkw964
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Bacteria and Archaea display a variety of phenotypic traits and can adapt to diverse ecological niches. However, systematic annotation of prokaryotic phenotypes is lacking. We have therefore developed ProTraits, a resource containing similar to 545 000 novel phenotype inferences, spanning 424 traits assigned to 3046 bacterial and archaeal species. These annotations were assigned by a computational pipeline that associates microbes with phenotypes by text-mining the scientific literature and the broader World Wide Web, while also being able to define novel concepts from unstructured text. Moreover, the ProTraits pipeline assigns phenotypes by drawing extensively on comparative genomics, capturing patterns in gene repertoires, codon usage biases, proteome composition and co-occurrence in metagenomes. Notably, we find that gene synteny is highly predictive of many phenotypes, and highlight examples of gene neighborhoods associated with spore-forming ability. A global analysis of trait interrelatedness outlined clusters in the microbial phenotype network, suggesting common genetic underpinnings. Our extended set of phenotype annotations allows detection of 57 088 high confidence gene-trait links, which recover many known associations involving sporulation, flagella, catalase activity, aerobicity, photosynthesis and other traits. Over 99% of the commonly occurring gene families are involved in genetic interactions conditional on at least one phenotype, suggesting that epistasis has a major role in shaping microbial gene content.
引用
收藏
页码:10074 / 10090
页数:17
相关论文
共 93 条
[1]   Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes [J].
Albertsen, Mads ;
Hugenholtz, Philip ;
Skarshewski, Adam ;
Nielsen, Kare L. ;
Tyson, Gene W. ;
Nielsen, Per H. .
NATURE BIOTECHNOLOGY, 2013, 31 (06) :533-+
[2]   Event-based text mining for biology and functional genomics [J].
Ananiadou, Sophia ;
Thompson, Paul ;
Nawaz, Raheel ;
McNaught, John ;
Kell, Douglas B. .
BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (03) :213-230
[3]   Learning Topic Models - Going beyond SVD [J].
Arora, Sanjeev ;
Ge, Rong ;
Moitra, Ankur .
2012 IEEE 53RD ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2012, :1-10
[4]   An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network [J].
Arrieta-Ortiz, Mario L. ;
Hafemeister, Christoph ;
Bate, Ashley Rose ;
Chu, Timothy ;
Greenfield, Alex ;
Shuster, Bentley ;
Barry, Samantha N. ;
Gallitto, Matthew ;
Liu, Brian ;
Kacmarczyk, Thadeous ;
Santoriello, Francis ;
Chen, Jie ;
Rodrigues, Christopher D. A. ;
Sato, Tsutomu ;
Rudner, David Z. ;
Driks, Adam ;
Bonneau, Richard ;
Eichenberger, Patrick .
MOLECULAR SYSTEMS BIOLOGY, 2015, 11 (11)
[5]   A Tutorial and Case Study in Propensity Score Analysis: An Application to Estimating the Effect of In-Hospital Smoking Cessation Counseling on Mortality [J].
Austin, Peter C. .
MULTIVARIATE BEHAVIORAL RESEARCH, 2011, 46 (01) :119-151
[6]   Convergent Adaptation in the Dominant Global Hospital Clone ST239 of Methicillin-Resistant Staphylococcus aureus [J].
Baines, Sarah L. ;
Holt, Kathryn E. ;
Schultz, Mark B. ;
Seemann, Torsten ;
Howden, Brian O. ;
Jensen, Slade O. ;
van Hal, Sebastiaan J. ;
Coombs, Geoffrey W. ;
Firth, Neville ;
Powell, David R. ;
Stinear, Timothy P. ;
Howden, Benjamin P. .
MBIO, 2015, 6 (02)
[7]   An Automated Phenotype-Driven Approach (GeneForce) for Refining Metabolic and Regulatory Models [J].
Barua, Dipak ;
Kim, Joonhoon ;
Reed, Jennifer L. .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (10)
[8]   PhenoLink - a web-tool for linking phenotype to ∼omics data for bacteria: application to gene-trait matching for Lactobacillus plantarum strains [J].
Bayjanov, Jumamurat R. ;
Molenaar, Douwe ;
Tzeneva, Vesela ;
Siezen, Roland J. ;
van Hijum, Sacha A. F. T. .
BMC GENOMICS, 2012, 13
[9]   Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task [J].
Bossy, Robert ;
Golik, Wiktoria ;
Ratkovic, Zorana ;
Valsamou, Dialekti ;
Bessieres, Philippe ;
Nedellec, Claire .
BMC BIOINFORMATICS, 2015, 16
[10]   Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled [J].
Brbic, Maria ;
Warnecke, Tobias ;
Krisko, Anita ;
Supek, Fran .
GENOME BIOLOGY AND EVOLUTION, 2015, 7 (06) :1519-1532