Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size

被引:55
作者
Park, Sang-Cheol [1 ]
Lee, Kihyun [2 ]
Kim, Yeong Ouk [3 ]
Won, Sungho [1 ,3 ,4 ]
Chun, Jongsik [3 ,5 ,6 ]
机构
[1] Seoul Natl Univ, Inst Hlth & Environm, Seoul, South Korea
[2] Chung Ang Univ, Dept Syst Biotechnol, Anseong, South Korea
[3] Seoul Natl Univ, Interdisciplinary Program Bioinformat, Seoul, South Korea
[4] Seoul Natl Univ, Dept Publ Hlth Sci, Seoul, South Korea
[5] Seoul Natl Univ, Dept Biol Sci, Seoul, South Korea
[6] Seoul Natl Univ, Inst Mol Biol & Genet, Seoul, South Korea
来源
FRONTIERS IN MICROBIOLOGY | 2019年 / 10卷
基金
新加坡国家研究基金会;
关键词
pan-genome; core-genome; Heaps' law; gene pool; large-scale genomics; seven species; estimation model; ANTIBIOTIC-RESISTANCE; SEQUENCE SIMILARITY; ESCHERICHIA-COLI; STRAINS;
D O I
10.3389/fmicb.2019.00834
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
For more than a decade, pan-genome analysis has been applied as an effective method for explaining the genetic contents variation of prokaryotic species. However, genomic characteristics and detailed structures of gene pools have not been fully clarified, because most studies have used a small number of genomes. Here, we constructed pan-genomes of seven species in order to elucidate variations in the genetic contents of >27,000 genomes belonging to Streptococcus pneumoniae, Staphylococcus aureus subsp. aureus, Salmonella enterica subsp. enterica, Escherichia coli and Shigella spp., Mycobacterium tuberculosis complex, Pseudomonas aeruginosa, and Acinetobacter baumannii. This work showed the pan-genomes of all seven species has open property. Additionally, systematic evaluation of the characteristics of their pan-genome revealed that phylogenetic distance provided valuable information for estimating the parameters for pan-genome size among several models including Heaps' law. Our results provide a better understanding of the species and a solution to minimize sampling biases associated with genome-sequencing preferences for pathogenic strains.
引用
收藏
页数:12
相关论文
共 28 条
  • [1] Comparative genome-scale modelling of Staphylococcus aureus strains identifies strain-specific metabolic capabilities linked to pathogenicity
    Bosi, Emanuele
    Monk, Jonathan M.
    Aziz, Ramy K.
    Fondi, Marco
    Nizet, Victor
    Palsson, Bernhard O.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (26) : E3801 - E3809
  • [2] A novel method of consensus pan-chromosome assembly and large-scale comparative analysis reveal the highly flexible pan-genome of Acinetobacter baumannii
    Chan, Agnes P.
    Sutton, Granger
    DePew, Jessica
    Krishnakumar, Radha
    Choi, Yongwook
    Huang, Xiao-Zhe
    Beck, Erin
    Harkins, Derek M.
    Kim, Maria
    Lesho, Emil P.
    Nikolich, Mikeljon P.
    Fouts, Derrick E.
    [J]. GENOME BIOLOGY, 2015, 16
  • [3] Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli:: A comparative genomics approach
    Chen, SL
    Hung, CS
    Xu, JA
    Reigstad, CS
    Magrini, V
    Sabo, A
    Blasiar, D
    Bieri, T
    Meyer, RR
    Ozersky, P
    Armstrong, JR
    Fulton, RS
    Latreille, JP
    Spieth, J
    Hooton, TM
    Mardis, ER
    Hultgren, SJ
    Gordon, JI
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (15) : 5977 - 5982
  • [4] Pan-genomic and transcriptomic analyses of Leuconostoc mesenteroides provide insights into its genomic and metabolic features and roles in kimchi fermentation
    Chun, Byung Hee
    Kim, Kyung Hyun
    Jeon, Hye Hee
    Lee, Se Hee
    Jeon, Che Ok
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [5] Probing the pan-genome of Listeria monocytogenes: new insights into intraspecific niche expansion and genomic diversification
    Deng, Xiangyu
    Phillippy, Adam M.
    Li, Zengxin
    Salzberg, Steven L.
    Zhang, Wei
    [J]. BMC GENOMICS, 2010, 11
  • [6] Structure and dynamics of the pan-genome of Streptococcus pneumoniae and closely related species
    Donati, Claudio
    Hiller, N. Luisa
    Tettelin, Herve
    Muzzi, Alessandro
    Croucher, Nicholas J.
    Angiuoli, Samuel V.
    Oggioni, Marco
    Hotopp, Julie C. Dunning
    Hu, Fen Z.
    Riley, David R.
    Covacci, Antonello
    Mitchell, Tim J.
    Bentley, Stephen D.
    Kilian, Morgens
    Ehrlich, Garth D.
    Rappuoli, Rino
    Moxon, E. Richard
    Masignani, Vega
    [J]. GENOME BIOLOGY, 2010, 11 (10):
  • [7] Search and clustering orders of magnitude faster than BLAST
    Edgar, Robert C.
    [J]. BIOINFORMATICS, 2010, 26 (19) : 2460 - 2461
  • [8] Distinct Salmonella Enteritidis lineages associated with enterocolitis in high-income settings and invasive disease in low-income settings
    Feasey, Nicholas A.
    Hadfield, James
    Keddy, Karen H.
    Dallman, Timothy J.
    Jacobs, Jan
    Deng, Xiangyu
    Wigley, Paul
    Barquist, Lars Barquist
    Langridge, Gemma C.
    Feltwell, Theresa
    Harris, Simon R.
    Mather, Alison E.
    Fookes, Maria
    Aslett, Martin
    Msefula, Chisomo
    Kariuki, Samuel
    Maclennan, Calman A.
    Onsare, Robert S.
    Weill, Francois-Xavier
    Le Hello, Simon
    Smith, Anthony M.
    McClelland, Michael
    Desai, Prerak
    Parry, Christopher M.
    Cheesbrough, John
    French, Neil
    Campos, Josefina
    Chabalgoity, Jose A.
    Betancor, Laura
    Hopkins, Katie L.
    Nair, Satheesh
    Humphrey, Tom J.
    Lunguya, Octavie
    Cogan, Tristan A.
    Tapia, Milagritos D.
    Sow, Samba O.
    Tennant, Sharon M.
    Bornstein, Kristin
    Levine, Myron M.
    Lacharme-Lora, Lizeth
    Everett, Dean B.
    Kingsley, Robert A.
    Parkhill, Julian
    Heyderman, Robert S.
    Dougan, Gordon
    Gordon, Melita A.
    Thomson, Nicholas R.
    [J]. NATURE GENETICS, 2016, 48 (10) : 1211 - 1217
  • [9] Diverse virulence traits underlying different clinical outcomes of Salmonella infection
    Fierer, J
    Guiney, DG
    [J]. JOURNAL OF CLINICAL INVESTIGATION, 2001, 107 (07) : 775 - 780
  • [10] HMMER web server: interactive sequence similarity searching
    Finn, Robert D.
    Clements, Jody
    Eddy, Sean R.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : W29 - W37