SigHunt: horizontal gene transfer finder optimized for eukaryotic genomes

被引:17
作者
Jaron, Kamil S. [1 ]
Moravec, Jiri C. [1 ]
Martinkova, Natalia [1 ,2 ]
机构
[1] Masaryk Univ, Inst Biostat & Anal, Brno, Czech Republic
[2] Acad Sci Czech Republic, Inst Vertebrate Biol, Brno, Czech Republic
关键词
FUNGUS ASPERGILLUS-FUMIGATUS; CRYPTOSPORIDIUM-PARVUM; SEQUENCE; EVOLUTION; IDENTIFICATION; ISLANDS; ECOLOGY;
D O I
10.1093/bioinformatics/btt727
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Genomic islands (GIs) are DNA fragments incorporated into a genome through horizontal gene transfer (also called lateral gene transfer), often with functions novel for a given organism. While methods for their detection are well researched in prokaryotes, the complexity of eukaryotic genomes makes direct utilization of these methods unreliable, and so labour-intensive phylogenetic searches are used instead. Results: We present a surrogate method that investigates nucleotide base composition of the DNA sequence in a eukaryotic genome and identifies putative GIs. We calculate a genomic signature as a vector of tetranucleotide (4-mer) frequencies using a sliding window approach. Extending the neighbourhood of the sliding window, we establish a local kernel density estimate of the 4-mer frequency. We score the number of 4-mer frequencies in the sliding window that deviate from the credibility interval of their local genomic density using a newly developed discrete interval accumulative score (DIAS). To further improve the effectiveness of DIAS, we select informative 4-mers in a range of organisms using the tetranucleotide quality score developed herein. We show that the SigHunt method is computationally efficient and able to detect GIs in eukaryotic genomes that represent nonameliorated integration. Thus, it is suited to scanning for change in organisms with different DNA composition.
引用
收藏
页码:1081 / 1086
页数:6
相关论文
共 32 条
[1]   Complete genome sequence of the apicomplexan, Cryptosporidium parvum [J].
Abrahamsen, MS ;
Templeton, TJ ;
Enomoto, S ;
Abrahante, JE ;
Zhu, G ;
Lancto, CA ;
Deng, MQ ;
Liu, C ;
Widmer, G ;
Tzipori, S ;
Buck, GA ;
Xu, P ;
Bankier, AT ;
Dear, PH ;
Konfortov, BA ;
Spriggs, HF ;
Iyer, L ;
Anantharaman, V ;
Aravind, L ;
Kapur, V .
SCIENCE, 2004, 304 (5669) :441-445
[2]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[3]  
[Anonymous], 2011, R: A Language and Environment for Statistical Computing
[4]   The genome of the diatom Thalassiosira pseudonana:: Ecology, evolution, and metabolism [J].
Armbrust, EV ;
Berges, JA ;
Bowler, C ;
Green, BR ;
Martinez, D ;
Putnam, NH ;
Zhou, SG ;
Allen, AE ;
Apt, KE ;
Bechner, M ;
Brzezinski, MA ;
Chaal, BK ;
Chiovitti, A ;
Davis, AK ;
Demarest, MS ;
Detter, JC ;
Glavina, T ;
Goodstein, D ;
Hadi, MZ ;
Hellsten, U ;
Hildebrand, M ;
Jenkins, BD ;
Jurka, J ;
Kapitonov, VV ;
Kröger, N ;
Lau, WWY ;
Lane, TW ;
Larimer, FW ;
Lippmeier, JC ;
Lucas, S ;
Medina, M ;
Montsant, A ;
Obornik, M ;
Parker, MS ;
Palenik, B ;
Pazour, GJ ;
Richardson, PM ;
Rynearson, TA ;
Saito, MA ;
Schwartz, DC ;
Thamatrakoln, K ;
Valentin, K ;
Vardi, A ;
Wilkerson, FP ;
Rokhsar, DS .
SCIENCE, 2004, 306 (5693) :79-86
[5]   Towards an accurate identification of mosaic genes and partial horizontal gene transfers [J].
Boc, Alix ;
Makarenkov, Vladimir .
NUCLEIC ACIDS RESEARCH, 2011, 39 (21) :e144
[6]   The impact of transposable elements in environmental adaptation [J].
Casacuberta, Elena ;
Gonzalez, Josefa .
MOLECULAR ECOLOGY, 2013, 22 (06) :1503-1517
[7]   Detection of horizontal transfer of individual genes by anomalous oligomer frequencies [J].
Elhai, Jeff ;
Liu, Hailan ;
Taton, Arnaud .
BMC GENOMICS, 2012, 13
[9]   Emergence of a new disease as a result of interspecific virulence gene transfer [J].
Friesen, Timothy L. ;
Stukenbrock, Eva H. ;
Liu, Zhaohui ;
Meinhardt, Steven ;
Ling, Hua ;
Faris, Justin D. ;
Rasmussen, Jack B. ;
Solomon, Peter S. ;
McDonald, Bruce A. ;
Oliver, Richard P. .
NATURE GENETICS, 2006, 38 (08) :953-956
[10]   Life with 6000 genes [J].
Goffeau, A ;
Barrell, BG ;
Bussey, H ;
Davis, RW ;
Dujon, B ;
Feldmann, H ;
Galibert, F ;
Hoheisel, JD ;
Jacq, C ;
Johnston, M ;
Louis, EJ ;
Mewes, HW ;
Murakami, Y ;
Philippsen, P ;
Tettelin, H ;
Oliver, SG .
SCIENCE, 1996, 274 (5287) :546-&