Modules of co-occurrence in the cyanobacterial pan-genome reveal functional associations between groups of ortholog genes

被引:13
作者
Beck, Christian [1 ]
Knoop, Henning [1 ]
Steuer, Ralf [1 ]
机构
[1] Humboldt Univ, ITB, Berlin, Germany
来源
PLOS GENETICS | 2018年 / 14卷 / 03期
关键词
PROTEIN; EVOLUTION; CORE; BIOSYNTHESIS; METABOLISM; DIVERSITY; DISCOVERY; COVERAGE; INSIGHTS;
D O I
10.1371/journal.pgen.1007239
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Cyanobacteria are a monophyletic phylogenetic group of global importance and have received considerable attention as potential host organisms for the renewable synthesis of chemical bulk products from atmospheric CO2. The cyanobacterial phylum exhibits enormous metabolic diversity with respect to morphology, lifestyle and habitat. As yet, however, research has mostly focused on few model strains and cyanobacterial diversity is insufficiently understood. In this respect, the increasing availability of fully sequenced bacterial genomes opens new and unprecedented opportunities to investigate the genetic inventory of organisms in the context of their pan-genome. Here, we seek understand cyanobacterial diversity using a comparative genome analysis of 77 fully sequenced and assembled cyanobacterial genomes. We use phylogenetic profiling to analyze the co-occurrence of clusters of likely ortholog genes (CLOGs) and reveal novel functional associations between CLOGs that are not captured by co-localization of genes. Going beyond pair-wise co-occurrences, we propose a network approach that allows us to identify modules of co-occurring CLOGs. The extracted modules exhibit a high degree of functional coherence and reveal known as well as previously unknown functional associations. We argue that the high functional coherence observed for the modules is a consequence of the similar-yet-diverse nature of cyanobacteria. Our approach highlights the importance of a multi-strain analysis to understand gene functions and environmental adaptations, with implications beyond the cyanobacterial phylum. The analysis is augmented with a simple toolbox that facilitates further analysis to investigate the co-occurrence neighborhood of specific CLOGs of interest.
引用
收藏
页数:20
相关论文
共 62 条
  • [11] Structure of UreG/UreF/UreH Complex Reveals How Urease Accessory Proteins Facilitate Maturation of Helicobacter pylori Urease
    Fong, Yu Hang
    Wong, Ho Chun
    Yuen, Man Hon
    Lau, Pak Ho
    Chen, Yu Wai
    Wong, Kam-Bo
    [J]. PLOS BIOLOGY, 2013, 11 (10)
  • [12] PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species
    Fouts, Derrick E.
    Brinkac, Lauren
    Beck, Erin
    Inman, Jason
    Sutton, Granger
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (22) : e172
  • [13] CyanoBase: a large-scale update on its 20th anniversary
    Fujisawa, Takatomo
    Narikawa, Rei
    Maeda, Shin-Ichi
    Watanabe, Satoru
    Kanesaki, Yu
    Kobayashi, Koichi
    Nomata, Jiro
    Hanaoka, Mitsumasa
    Watanabe, Mai
    Ehira, Shigeki
    Suzuki, Eiji
    Awai, Koichiro
    Nakamura, Yasukazu
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D551 - D554
  • [14] Expanded microbial genome coverage and improved protein family annotation in the COG database
    Galperin, Michael Y.
    Makarova, Kira S.
    Wolf, Yuri I.
    Koonin, Eugene V.
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D261 - D269
  • [15] Using comparative genomics to drive new discoveries in microbiology
    Haft, Daniel H.
    [J]. CURRENT OPINION IN MICROBIOLOGY, 2015, 23 : 189 - 196
  • [16] ACTIVATION AND PATHWAY OF GLUCOSYLGLYCEROL SYNTHESIS IN THE CYANOBACTERIUM-SYNECHOCYSTIS SP PCC-6803
    HAGEMANN, M
    ERDMANN, N
    [J]. MICROBIOLOGY-SGM, 1994, 140 : 1427 - 1431
  • [17] FUNCTIONAL EXCHANGEABILITY OF THE ABC PROTEINS OF THE PERIPLASMIC BINDING PROTEIN-DEPENDENT TRANSPORT-SYSTEMS UGP AND MAL OF ESCHERICHIA-COLI
    HEKSTRA, D
    TOMMASSEN, J
    [J]. JOURNAL OF BACTERIOLOGY, 1993, 175 (20) : 6546 - 6552
  • [18] The role of a gene cluster for trehalose metabolism in dehydration tolerance of the filamentous cyanobacterium Anabaena sp PCC 7120
    Higo, A
    Katoh, H
    Ohmori, K
    Ikeuchi, M
    Ohmori, M
    [J]. MICROBIOLOGY-SGM, 2006, 152 : 979 - 987
  • [19] Characterization and modeling of the Haemophilus influenzae core and supragenomes based on the complete genomic sequences of Rd and 12 clinical nontypeable strains
    Hogg, Justin S.
    Hu, Fen Z.
    Janto, Benjamin
    Boissy, Robert
    Hayes, Jay
    Keefe, Randy
    Post, J. Christopher
    Ehrlich, Garth D.
    [J]. GENOME BIOLOGY, 2007, 8 (06)
  • [20] Quinol and cytochrome oxidases in the cyanobacterium Synechocystis sp. PCC 6803
    Howitt, CA
    Vermaas, WFJ
    [J]. BIOCHEMISTRY, 1998, 37 (51) : 17944 - 17951