A sensitive, support-vector-machine method for the detection of horizontal gene transfers in viral, archaeal and bacterial genomes

被引:41
作者
Tsirigos, A
Rigoutsos, I
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Bioinformat & Pattern Discovery Grp, Computat Biol Ctr, Yorktown Hts, NY 10598 USA
[2] CUNY, New York, NY 10021 USA
[3] MIT, Dept Chem Engn, Cambridge, MA 02139 USA
关键词
D O I
10.1093/nar/gki660
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In earlier work, we introduced and discussed a generalized computational framework for identifying horizontal transfers. This framework relied on a gene's nucleotide composition, obviated the need for knowledge of codon boundaries and database searches, and was shown to perform very well across a wide range of archaeal and bacterial genomes when compared with previously published approaches, such as Codon Adaptation Index and C + G content. Nonetheless, two considerations remained outstanding: we wanted to further increase the sensitivity of detecting horizontal transfers and also to be able to apply the method to increasingly smaller genomes. In the discussion that follows, we present such a method, Wn-SVM, and show that it exhibits a very significant improvement in sensitivity compared with earlier approaches. Wn-SVM uses a one-class support-vector machine and can learn using rather small training sets. This property makes Wn-SVM particularly suitable for studying small-size genomes, similar to those of viruses, as well as the typically larger archaeal and bacterial genomes. We show experimentally that the new method results in a superior performance across a wide range of organisms and that it improves even upon our own earlier method by an average of 10% across all examined genomes. As a small-genome case study, we analyze the genome of the human cytomegalovirus and demonstrate that Wn-SVM correctly identifies regions that are known to be conserved and prototypical of all beta-herpesvirinae, regions that are known to have been acquired horizontally from the human host and, finally, regions that had not up to now been suspected to be horizontally transferred. Atypical region predictions for many eukaryotic viruses, including the alpha-, beta- and gamma-herpesvirinae, and 123 archaeal and bacterial genomes, have been made available online at http://cbcsrv.watson.ibm.com/HGT_SVM/.
引用
收藏
页码:3699 / 3707
页数:9
相关论文
共 24 条
  • [1] CODON PREFERENCES IN FREE-LIVING MICROORGANISMS
    ANDERSSON, SGE
    KURLAND, CG
    [J]. MICROBIOLOGICAL REVIEWS, 1990, 54 (02) : 198 - 210
  • [2] [Anonymous], 2002, HORIZONTAL GENE TRAN
  • [3] STUDIES ON THE CHEMICAL NATURE OF THE SUBSTANCE INDUCING TRANSFORMATION OF PNEUMOCOCCAL TYPES INDUCTION OF TRANSFORMATION BY A DESOXYRIBONUCLEIC ACID FRACTION ISOLATED FROM PNEUMOCOCCUS TYPE III
    Avery, Oswald T.
    MacLeod, Colin M.
    McCarty, Maclyn
    [J]. JOURNAL OF EXPERIMENTAL MEDICINE, 1944, 79 (02) : 137 - 158
  • [4] CHEE MS, 1990, CURR TOP MICROBIOL, V154, P125
  • [5] CRISTIANINI N., 2000, INTRO SUPPORT VECTOR, P5
  • [6] HGT-DB: a database of putative horizontally transferred genes in prokaryotic complete genomes
    Garcia-Vallve, S
    Guzman, E
    Montero, MA
    Romeu, A
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 187 - 189
  • [7] Horizontal gene transfer in bacterial and archaeal complete genomes
    Garcia-Vallvé, S
    Romeu, A
    Palau, J
    [J]. GENOME RESEARCH, 2000, 10 (11) : 1719 - 1725
  • [8] Detection of genes with atypical nucleotide sequence in microbial genomes
    Hooper, SD
    Berg, OG
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2002, 54 (03) : 365 - 375
  • [9] IKEMURA T, 1985, MOL BIOL EVOL, V2, P13
  • [10] Codon usages in different gene classes of the Escherichia coli genome
    Karlin, S
    Mrázek, J
    Campbell, AM
    [J]. MOLECULAR MICROBIOLOGY, 1998, 29 (06) : 1341 - 1355