Comparative Whole-Genome Analysis of Clinical Isolates Reveals Characteristic Architecture of Mycobacterium tuberculosis Pangenome

被引:34
作者
Periwal, Vinita [1 ,5 ]
Patowary, Ashok [2 ]
Vellarikkal, Shamsudheen Karuthedath [2 ,5 ]
Gupta, Anju [4 ]
Singh, Meghna [2 ,5 ]
Mittal, Ashish [2 ]
Jeyapaul, Shamini [2 ]
Chauhan, Rajendra Kumar [2 ]
Singh, Ajay Vir [3 ]
Singh, Pravin Kumar [3 ]
Garg, Parul [3 ]
Katoch, Viswa Mohan [3 ]
Katoch, Kiran [3 ]
Chauhan, Devendra Singh [3 ]
Sivasubbu, Sridhar [2 ]
Scaria, Vinod [1 ]
机构
[1] CSIR, IGIB, GN Ramachandran Knowledge Ctr Genome Informat, Delhi 110007, India
[2] CSIR, IGIB, Genom & Mol Med, Delhi 110007, India
[3] Natl JALMA Inst Leprosy & Other Mycobacterial Dis, Tajganj 282001, Agra, India
[4] CSIR, Open Source Drug Discovery Unit, New Delhi 110001, India
[5] AcSIR, New Delhi 110001, India
来源
PLOS ONE | 2015年 / 10卷 / 04期
关键词
ESCHERICHIA-COLI; RESISTANT STRAINS; SEQUENCE; TOOL; EVOLUTION; ALIGNMENT; COMPLEX; GENES; IDENTIFICATION; ANNOTATION;
D O I
10.1371/journal.pone.0122979
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The tubercle complex consists of closely related mycobacterium species which appear to be variants of a single species. Comparative genome analysis of different strains could provide useful clues and insights into the genetic diversity of the species. We integrated genome assemblies of 96 strains from Mycobacterium tuberculosis complex (MTBC), which included 8 Indian clinical isolates sequenced and assembled in this study, to understand its pangenome architecture. We predicted genes for all the 96 strains and clustered their respective CDSs into homologous gene clusters (HGCs) to reveal a hard-core, soft-core and accessory genome component of MTBC. The hard-core (HGCs shared amongst 100% of the strains) was comprised of 2,066 gene clusters whereas the soft-core (HGCs shared amongst at least 95% of the strains) comprised of 3,374 gene clusters. The change in the core and accessory genome components when observed as a function of their size revealed that MTBC has an open pangenome. We identified 74 HGCs that were absent from reference strains H37Rv and H37Ra but were present in most of clinical isolates. We report PCR validation on 9 candidate genes depicting 7 genes completely absent from H37Rv and H37Ra whereas 2 genes shared partial homology with them accounting to probable insertion and deletion events. The pangenome approach is a promising tool for studying strain specific genetic differences occurring within species. We also suggest that since selecting appropriate target genes for typing purposes requires the expected target gene be present in all isolates being typed, therefore estimating the core-component of the species becomes a subject of prime importance.
引用
收藏
页数:26
相关论文
共 62 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] [Anonymous], 2005, PHYLIP (phylogeny inference package) version 3.6
  • [3] Dataset of potential targets for Mycobacterium tuberculosis H37Rv through comparative genome analysis
    Asif, Siddiqui M.
    Asad, Amir
    Faizan, Ahmad
    Anjali, Malik S.
    Arvind, Arya
    Neelesh, Kapoor
    Hirdesh, Kumar
    Sanjay, Kumar
    [J]. BIOINFORMATION, 2009, 4 (06) : 245 - 248
  • [4] A genome-wide sequence-independent comparative analysis of insertion-deletion polymorphisms in multiple Mycobacterium tuberculosis strains
    Azhikina, T
    Gvozdevsky, N
    Botvinnik, A
    Fushan, A
    Shemyakin, I
    Stepanshina, V
    Lipin, M
    Barry, CB
    Sverdlov, E
    [J]. RESEARCH IN MICROBIOLOGY, 2006, 157 (03) : 282 - 290
  • [5] Mycobacterium tuberculosis spoligotypes and drug susceptibility pattern of isolates from tuberculosis patients in South-Western Uganda
    Bazira, Joel
    Asiimwe, Benon B.
    Joloba, Moses L.
    Bwanga, Freddie
    Matee, Mecky I.
    [J]. BMC INFECTIOUS DISEASES, 2011, 11
  • [6] Evolution of Mycobacterium tuberculosis
    Behr, Marcel A.
    [J]. NEW PARADIGM OF IMMUNITY TO TUBERCULOSIS, 2013, 783 : 81 - 91
  • [7] The Genome of Mycobacterium Africanum West African 2 Reveals a Lineage-Specific Locus and Genome Erosion Common to the M. tuberculosis Complex
    Bentley, Stephen D.
    Comas, Inaki
    Bryant, Josephine M.
    Walker, Danielle
    Smith, Noel H.
    Harris, Simon R.
    Thurston, Scott
    Gagneux, Sebastien
    Wood, Jonathan
    Antonio, Martin
    Quail, Michael A.
    Gehre, Florian
    Adegbola, Richard A.
    Parkhill, Julian
    de Jong, Bouke C.
    [J]. PLOS NEGLECTED TROPICAL DISEASES, 2012, 6 (02):
  • [8] A new evolutionary scenario for the Mycobacterium tuberculosis complex
    Brosch, R
    Gordon, SV
    Marmiesse, M
    Brodin, P
    Buchrieser, C
    Eiglmeier, K
    Garnier, T
    Gutierrez, C
    Hewinson, G
    Kremer, K
    Parsons, LM
    Pym, AS
    Samper, S
    van Soolingen, D
    Cole, ST
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (06) : 3684 - 3689
  • [9] Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence
    Cole, ST
    Brosch, R
    Parkhill, J
    Garnier, T
    Churcher, C
    Harris, D
    Gordon, SV
    Eiglmeier, K
    Gas, S
    Barry, CE
    Tekaia, F
    Badcock, K
    Basham, D
    Brown, D
    Chillingworth, T
    Connor, R
    Davies, R
    Devlin, K
    Feltwell, T
    Gentles, S
    Hamlin, N
    Holroyd, S
    Hornby, T
    Jagels, K
    Krogh, A
    McLean, J
    Moule, S
    Murphy, L
    Oliver, K
    Osborne, J
    Quail, MA
    Rajandream, MA
    Rogers, J
    Rutter, S
    Seeger, K
    Skelton, J
    Squares, R
    Squares, S
    Sulston, JE
    Taylor, K
    Whitehead, S
    Barrell, BG
    [J]. NATURE, 1998, 393 (6685) : 537 - +
  • [10] Blast2GO:: a universal tool for annotation, visualization and analysis in functional genomics research
    Conesa, A
    Götz, S
    García-Gómez, JM
    Terol, J
    Talón, M
    Robles, M
    [J]. BIOINFORMATICS, 2005, 21 (18) : 3674 - 3676