BPGA- an ultra-fast pan-genome analysis pipeline

被引:706
|
作者
Chaudhari, Narendrakumar M. [1 ]
Gupta, Vinod Kumar [1 ]
Dutta, Chitra [1 ]
机构
[1] Indian Inst Chem Biol, CSIR, Struct Biol & Bioinformat Div, 4 Raja SC Mullick Rd, Kolkata 700032, India
来源
SCIENTIFIC REPORTS | 2016年 / 6卷
关键词
STREPTOCOCCUS-PNEUMONIAE; SEQUENCE; IDENTIFICATION; REVEALS; STRAINS; CORE; PANGENOME; EVOLUTION; INSIGHTS; VACCINE;
D O I
10.1038/srep24373
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in ultra-high-throughput sequencing technology and metagenomics have led to a paradigm shift in microbial genomics from few genome comparisons to large-scale pan-genome studies at different scales of phylogenetic resolution. Pan-genome studies provide a framework for estimating the genomic diversity of the dataset, determining core (conserved), accessory (dispensable) and unique (strain-specific) gene pool of a species, tracing horizontal gene-flux across strains and providing insight into species evolution. The existing pan genome software tools suffer from various limitations like limited datasets, difficult installation/requirements, inadequate functional features etc. Here we present an ultra-fast computational pipeline BPGA (Bacterial Pan Genome Analysis tool) with seven functional modules. In addition to the routine pan genome analyses, BPGA introduces a number of novel features for downstream analyses like core/pan/MLST (Multi Locus Sequence Typing) phylogeny, exclusive presence/absence of genes in specific strains, subset analysis, atypical G + C content analysis and KEGG & COG mapping of core, accessory and unique genes. Other notable features include minimum running prerequisites, freedom to select the gene clustering method, ultra-fast execution, user friendly command line interface and high-quality graphics outputs. The performance of BPGA has been evaluated using a dataset of complete genome sequences of 28 Streptococcus pyogenes strains.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] HUPAN: a pan-genome analysis pipeline for human genomes
    Duan, Zhongqu
    Qiao, Yuyang
    Lu, Jinyuan
    Lu, Huimin
    Zhang, Wenmin
    Yan, Fazhe
    Sun, Chen
    Hu, Zhiqiang
    Zhang, Zhen
    Li, Guichao
    Chen, Hongzhuan
    Xiang, Zhen
    Zhu, Zhenggang
    Zhao, Hongyu
    Yu, Yingyan
    Wei, Chaochun
    GENOME BIOLOGY, 2019, 20 (1)
  • [2] AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae
    Song, Giltae
    Dickins, Benjamin J. A.
    Demeter, Janos
    Engel, Stacia
    Dunn, Barbara
    Cherry, J. Michael
    PLOS ONE, 2015, 10 (03):
  • [3] HUPAN: a pan-genome analysis pipeline for human genomes
    Zhongqu Duan
    Yuyang Qiao
    Jinyuan Lu
    Huimin Lu
    Wenmin Zhang
    Fazhe Yan
    Chen Sun
    Zhiqiang Hu
    Zhen Zhang
    Guichao Li
    Hongzhuan Chen
    Zhen Xiang
    Zhenggang Zhu
    Hongyu Zhao
    Yingyan Yu
    Chaochun Wei
    Genome Biology, 20
  • [4] PGAP-X: extension on pan-genome analysis pipeline
    Zhao, Yongbing
    Sun, Chen
    Zhao, Dongyu
    Zhang, Yadong
    You, Yang
    Jia, Xinmiao
    Yang, Junhui
    Wang, Lingping
    Wang, Jinyue
    Fu, Haohuan
    Kang, Yu
    Chen, Fei
    Yu, Jun
    Wu, Jiayan
    Xiao, Jingfa
    BMC GENOMICS, 2018, 19
  • [5] Comprehensive pan-genome analysis of Mycobacterium marinum: insights into genomic diversity, evolution, and pathogenicity
    Zhang, Meng
    Adroub, Sabir
    Ummels, Roy
    Asaad, Mohammed
    Song, Lei
    Pain, Arnab
    Bitter, Wilbert
    Guan, Qingtian
    Abdallah, Abdallah M.
    SCIENTIFIC REPORTS, 2024, 14 (01): : 27723
  • [6] Genome and pan-genome analysis to classify emerging bacteria
    Caputo, Aurelia
    Fournier, Pierre-Edouard
    Raoult, Didier
    BIOLOGY DIRECT, 2019, 14 (1)
  • [7] Pan-genome analysis of three main Chinese chestnut varieties
    Hu, Guanglong
    Cheng, Lili
    Cheng, Yunhe
    Mao, Weitao
    Qiao, Yanjie
    Lan, Yanping
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [8] Pan-genome analysis of Bacillus for microbiome profiling
    Kim, Yihwan
    Koh, InSong
    Lim, Mi Young
    Chung, Won-Hyong
    Rho, Mina
    SCIENTIFIC REPORTS, 2017, 7
  • [9] Pan-Genome Analysis Reveals Host-Specific Functional Divergences in Burkholderia gladioli
    Lee, Hyun-Hee
    Park, Jungwook
    Jung, Hyejung
    Seo, Young-Su
    MICROORGANISMS, 2021, 9 (06)
  • [10] NGSPanPipe: A Pipeline for Pan-genome Identification in Microbial Strains from Experimental Reads
    Kulsum, Umay
    Kapil, Arti
    Singh, Harpreet
    Kaur, Punit
    INFECTIOUS DISEASES AND NANOMEDICINE III, 2018, 1052 : 39 - 49