BPGA- an ultra-fast pan-genome analysis pipeline

被引:706
|
作者
Chaudhari, Narendrakumar M. [1 ]
Gupta, Vinod Kumar [1 ]
Dutta, Chitra [1 ]
机构
[1] Indian Inst Chem Biol, CSIR, Struct Biol & Bioinformat Div, 4 Raja SC Mullick Rd, Kolkata 700032, India
来源
SCIENTIFIC REPORTS | 2016年 / 6卷
关键词
STREPTOCOCCUS-PNEUMONIAE; SEQUENCE; IDENTIFICATION; REVEALS; STRAINS; CORE; PANGENOME; EVOLUTION; INSIGHTS; VACCINE;
D O I
10.1038/srep24373
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in ultra-high-throughput sequencing technology and metagenomics have led to a paradigm shift in microbial genomics from few genome comparisons to large-scale pan-genome studies at different scales of phylogenetic resolution. Pan-genome studies provide a framework for estimating the genomic diversity of the dataset, determining core (conserved), accessory (dispensable) and unique (strain-specific) gene pool of a species, tracing horizontal gene-flux across strains and providing insight into species evolution. The existing pan genome software tools suffer from various limitations like limited datasets, difficult installation/requirements, inadequate functional features etc. Here we present an ultra-fast computational pipeline BPGA (Bacterial Pan Genome Analysis tool) with seven functional modules. In addition to the routine pan genome analyses, BPGA introduces a number of novel features for downstream analyses like core/pan/MLST (Multi Locus Sequence Typing) phylogeny, exclusive presence/absence of genes in specific strains, subset analysis, atypical G + C content analysis and KEGG & COG mapping of core, accessory and unique genes. Other notable features include minimum running prerequisites, freedom to select the gene clustering method, ultra-fast execution, user friendly command line interface and high-quality graphics outputs. The performance of BPGA has been evaluated using a dataset of complete genome sequences of 28 Streptococcus pyogenes strains.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Pan-Genome Analysis and Secondary Metabolic Pathway Mining of Biocontrol Bacterium Brevibacillus brevis
    Du, Jie
    Huang, Binbin
    Huang, Jun
    Long, Qingshan
    Zhang, Cuiyang
    Guo, Zhaohui
    Wang, Yunsheng
    Chen, Wu
    Tan, Shiyong
    Liu, Qingshu
    AGRONOMY-BASEL, 2024, 14 (05):
  • [22] Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.
    Yang, Xiaowen
    Li, Yajie
    Zang, Juan
    Li, Yexia
    Bie, Pengfei
    Lu, Yanli
    Wu, Qingmin
    MOLECULAR GENETICS AND GENOMICS, 2016, 291 (02) : 905 - 912
  • [23] Analysis of genetic recombination and the pan-genome of a highly recombinogenic bacteriophage species
    Yahara, Koji
    Lehours, Philippe
    Vale, Filipa F.
    MICROBIAL GENOMICS, 2019, 5 (08):
  • [24] Pan-genome analysis reveals novel chromosomal markers for multiplex PCR-based specific detection of Bacillus anthracis
    Zorigt, Tuvshinzaya
    Furuta, Yoshikazu
    Paudel, Atmika
    Kamboyi, Harvey Kakoma
    Shawa, Misheck
    Chuluun, Mungunsar
    Sugawara, Misa
    Enkhtsetseg, Nyamdorj
    Enkhtuya, Jargalsaikhan
    Battsetseg, Badgar
    Munyeme, Musso
    Hang'ombe, Bernard M.
    Higashi, Hideaki
    BMC INFECTIOUS DISEASES, 2024, 24 (01)
  • [25] Pan-genome analysis of invasive Streptococcus mutans strains
    Sujitha, Srinivasan
    Gunasekaran, Paramasamy
    Rajendhran, Jeyaprakash
    CURRENT SCIENCE, 2024, 127 (07): : 849 - 855
  • [26] Pangloss: A Tool for Pan-Genome Analysis of Microbial Eukaryotes
    McCarthy, Charley G. P.
    Fitzpatrick, David A.
    GENES, 2019, 10 (07):
  • [27] Determining the Genetic Characteristics of Resistance and Virulence of the "Epidermidis Cluster Group" Through Pan-Genome Analysis
    Sun, Zhewei
    Zhou, Danying
    Zhang, Xueya
    Li, Qiaoling
    Lin, Hailong
    Lu, Wei
    Liu, Hongmao
    Lu, Junwan
    Lin, Xi
    Li, Kewei
    Xu, Teng
    Bao, Qiyu
    Zhang, Hailin
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2020, 10
  • [28] Pan-genome sequence analysis using Panseq: an online tool for the rapid analysis of core and accessory genomic regions
    Laing, Chad
    Buchanan, Cody
    Taboada, Eduardo N.
    Zhang, Yongxiang
    Kropinski, Andrew
    Villegas, Andre
    Thomas, James E.
    Gannon, Victor P. J.
    BMC BIOINFORMATICS, 2010, 11
  • [29] Pan-genome analysis and ancestral state reconstruction of class halobacteria: probability of a new super-order
    Gaba, Sonam
    Kumari, Abha
    Medema, Marnix
    Kaushik, Rajeev
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [30] Comparative Genomics of Mycoplasma: Analysis of Conserved Essential Genes and Diversity of the Pan-Genome
    Liu, Wei
    Fang, Liurong
    Li, Mao
    Li, Sha
    Guo, Shaohua
    Luo, Rui
    Feng, Zhixin
    Li, Bin
    Zhou, Zhemin
    Shao, Guoqing
    Chen, Huanchun
    Xiao, Shaobo
    PLOS ONE, 2012, 7 (04):