MTGIpick allows robust identification of genomic islands from a single genome

被引:48
作者
Dai, Qi [1 ,2 ]
Bao, Chaohui [1 ]
Hai, Yabing [1 ]
Ma, Sheng [1 ]
Zhou, Tao [1 ]
Wang, Cong [1 ]
Wang, Yunfei [2 ]
Huo, Wenwen [2 ]
Liu, Xiaoqing [3 ]
Yao, Yuhua [1 ]
Xuan, Zhenyu [2 ]
Chen, Min [4 ]
Zhang, Michael Q. [2 ,5 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou 310018, Zhejiang, Peoples R China
[2] Univ Texas Dallas, Ctr Syst Biol, Richardson, TX 75080 USA
[3] Hangzhou Dianzi Univ, Coll Sci, Hangzhou, Zhejiang, Peoples R China
[4] Univ Texas Dallas, Dept Math Sci, Richardson, TX 75083 USA
[5] Tsinghua Univ, Ctr Syst Biol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
genomic island detection; genomic signature; feature selection; multiscale statistical test; boundary detection; HORIZONTAL GENE-TRANSFER; PATHOGENICITY ISLANDS; PROKARYOTIC GENOMES; BACTERIAL GENOMES; COMMON THEMES; TRANSFER-RNA; DATABASE; VISUALIZATION; ISLANDVIEWER; EVOLUTION;
D O I
10.1093/bib/bbw118
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genomic islands (GIs) that are associated with microbial adaptations and carry sequence patterns different from that of the host are sporadically distributed among closely related species. This bias can dominate the signal of interest in GI detection. However, variations still exist among the segments of the host, although no uniform standard exists regarding the best methods of discriminating GIs from the rest of the genome in terms of compositional bias. In the present work, we proposed a robust software, MTGIpick, which used regions with pattern bias showing multiscale difference levels to identify GIs from the host. MTGIpick can identify GIs from a single genome without annotated information of genomes or prior knowledge from other data sets. When real biological data were used, MTGIpick demonstrated better performance than existing methods, as well as revealed potential GIs with accurate sizes missed by existing methods because of a uniform standard. Software and supplementary are freely available at http://bioinfo.zstu.edu.cn/MTGI or https://github.com/bioinfo0706/MTGIpick.
引用
收藏
页码:361 / 373
页数:13
相关论文
共 46 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Detection of genomic islands via segmental genome heterogeneity
    Arvey, Aaron J.
    Azad, Rajeev K.
    Raval, Alpan
    Lawrence, Jeffrey G.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 (16) : 5255 - 5266
  • [3] Towards more robust methods of alien gene detection
    Azad, Rajeev K.
    Lawrence, Jeffrey G.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (09) : e56
  • [4] Systematic determination of the mosaic structure of bacterial genomes: species backbone versus strain-specific loops
    Chiapello, H
    Bourgait, I
    Sourivong, F
    Heuclin, G
    Gendrault-Jacquemard, A
    Petit, MA
    El Karoui, M
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [5] Mauve: Multiple alignment of conserved genomic sequence with rearrangements
    Darling, ACE
    Mau, B
    Blattner, FR
    Perna, NT
    [J]. GENOME RESEARCH, 2004, 14 (07) : 1394 - 1403
  • [6] IslandViewer update: improved genomic island discovery and visualization
    Dhillon, Bhavjinder K.
    Chiu, Terry A.
    Laird, Matthew R.
    Langille, Morgan G. I.
    Brinkman, Fiona S. L.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) : W129 - W132
  • [7] Genomic islands in pathogenic and environmental microorganisms
    Dobrindt, U
    Hochhut, B
    Hentschel, U
    Hacker, J
    [J]. NATURE REVIEWS MICROBIOLOGY, 2004, 2 (05) : 414 - 424
  • [8] Common themes in microbial pathogenicity revisited
    Finlay, BB
    Falkow, S
    [J]. MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 1997, 61 (02) : 136 - +
  • [9] The Pfam protein families database
    Finn, Robert D.
    Tate, John
    Mistry, Jaina
    Coggill, Penny C.
    Sammut, Stephen John
    Hotz, Hans-Rudolf
    Ceric, Goran
    Forslund, Kristoffer
    Eddy, Sean R.
    Sonnhammer, Erik L. L.
    Bateman, Alex
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D281 - D288
  • [10] Pathogenicity islands: a molecular toolbox for bacterial virulence
    Gal-Mor, Ohad
    Finlay, B. Brett
    [J]. CELLULAR MICROBIOLOGY, 2006, 8 (11) : 1707 - 1719