Genome Diversity of Epstein-Barr Virus from Multiple Tumor Types and Normal Infection

被引:187
作者
Palser, Anne L. [1 ]
Grayson, Nicholas E. [1 ]
White, Robert E. [2 ]
Corton, Craig [1 ]
Correia, Samantha [2 ]
Abdullah, Mohammed M. Ba [2 ]
Watson, Simon J. [1 ]
Cotten, Matthew [1 ]
Arrand, John R. [3 ]
Murray, Paul G. [3 ]
Allday, Martin J. [2 ]
Rickinson, Alan B. [3 ]
Young, Lawrence S. [4 ]
Farrell, Paul J. [2 ]
Kellam, Paul [1 ,5 ]
机构
[1] Wellcome Trust Sanger Inst, Cambridge, England
[2] Univ London Imperial Coll Sci Technol & Med, Fac Med, Virol Sect, London, England
[3] Univ Birmingham, Sch Canc Sci, Birmingham, W Midlands, England
[4] Univ Warwick, Coventry CV4 7AL, W Midlands, England
[5] UCL, Div Infect & Immun, London, England
基金
英国惠康基金;
关键词
INTERTYPIC RECOMBINANTS; AMINO-ACID; SEQUENCE; LYMPHOMAGENESIS; ALGORITHMS; STRAINS; BIOLOGY; TOOLS;
D O I
10.1128/JVI.03614-14
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Epstein-Barr virus (EBV) infects most of the world's population and is causally associated with several human cancers, but little is known about how EBV genetic variation might influence infection or EBV-associated disease. There are currently no published wild-type EBV genome sequences from a healthy individual and very few genomes from EBV-associated diseases. We have sequenced 71 geographically distinct EBV strains from cell lines, multiple types of primary tumor, and blood samples and the first EBV genome from the saliva of a healthy carrier. We show that the established genome map of EBV accurately represents all strains sequenced, but novel deletions are present in a few isolates. We have increased the number of type 2 EBV genomes sequenced from one to 12 and establish that the type 1/type 2 classification is a major feature of EBV genome variation, defined almost exclusively by variation of EBNA2 and EBNA3 genes, but geographic variation is also present. Single nucleotide polymorphism (SNP) density varies substantially across all known open reading frames and is highest in latency-associated genes. Some T-cell epitope sequences in EBNA3 genes show extensive variation across strains, and we identify codons under positive selection, both important considerations for the development of vaccines and T-cell therapy. We also provide new evidence for recombination between strains, which provides a further mechanism for the generation of diversity. Our results provide the first global view of EBV sequence variation and demonstrate an effective method for sequencing large numbers of genomes to further understand the genetics of EBV infection. IMPORTANCE Most people in the world are infected by Epstein-Barr virus (EBV), and it causes several human diseases, which occur at very different rates in different parts of the world and are linked to host immune system variation. Natural variation in EBV DNA sequence may be important for normal infection and for causing disease. Here we used rapid, cost-effective sequencing to determine 71 new EBV sequences from different sample types and locations worldwide. We showed geographic variation in EBV genomes and identified the most variable parts of the genome. We identified protein sequences that seem to have been selected by the host immune system and detected variability in known immune epitopes. This gives the first overview of EBV genome variation, important for designing vaccines and immune therapy for EBV, and provides techniques to investigate relationships between viral sequence variation and EBV-associated diseases.
引用
收藏
页码:5222 / 5237
页数:16
相关论文
共 52 条
  • [1] Allday Martin J., 2013, Frontiers in Genetics, V4, P212, DOI 10.3389/fgene.2013.00212
  • [2] ABACAS: algorithm-based automatic contiguation of assembled sequences
    Assefa, Samuel
    Keane, Thomas M.
    Otto, Thomas D.
    Newbold, Chris
    Berriman, Matthew
    [J]. BIOINFORMATICS, 2009, 25 (15) : 1968 - 1969
  • [3] DNA-SEQUENCE AND EXPRESSION OF THE B95-8 EPSTEIN-BARR VIRUS GENOME
    BAER, R
    BANKIER, AT
    BIGGIN, MD
    DEININGER, PL
    FARRELL, PJ
    GIBSON, TJ
    HATFULL, G
    HUDSON, GS
    SATCHWELL, SC
    SEGUIN, C
    TUFFNELL, PS
    BARRELL, BG
    [J]. NATURE, 1984, 310 (5974) : 207 - 211
  • [4] Toward almost closed genomes with GapFiller
    Boetzer, Marten
    Pirovano, Walter
    [J]. GENOME BIOLOGY, 2012, 13 (06):
  • [5] Gap5-editing the billion fragment sequence assembly
    Bonfield, James K.
    Whitwham, Andrew
    [J]. BIOINFORMATICS, 2010, 26 (14) : 1699 - 1703
  • [6] Biopython']python: freely available Python']Python tools for computational molecular biology and bioinformatics
    Cock, Peter J. A.
    Antao, Tiago
    Chang, Jeffrey T.
    Chapman, Brad A.
    Cox, Cymon J.
    Dalke, Andrew
    Friedberg, Iddo
    Hamelryck, Thomas
    Kauff, Frank
    Wilczynski, Bartek
    de Hoon, Michiel J. L.
    [J]. BIOINFORMATICS, 2009, 25 (11) : 1422 - 1423
  • [7] Epstein-Barr Virus: An Important Vaccine Target for Cancer Prevention
    Cohen, Jeffrey I.
    Fauci, Anthony S.
    Varmus, Harold
    Nabel, Gary J.
    [J]. SCIENCE TRANSLATIONAL MEDICINE, 2011, 3 (107)
  • [8] jModelTest 2: more models, new heuristics and parallel computing
    Darriba, Diego
    Taboada, Guillermo L.
    Doallo, Ramon
    Posada, David
    [J]. NATURE METHODS, 2012, 9 (08) : 772 - 772
  • [9] Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology
    Delport, Wayne
    Poon, Art F. Y.
    Frost, Simon D. W.
    Pond, Sergei L. Kosakovsky
    [J]. BIOINFORMATICS, 2010, 26 (19) : 2455 - 2457
  • [10] Specific Capture and Whole-Genome Sequencing of Viruses from Clinical Samples
    Depledge, Daniel P.
    Palser, Anne L.
    Watson, Simon J.
    Lai, Imogen Yi-Chun
    Gray, Eleanor R.
    Grant, Paul
    Kanda, Ravinder K.
    Leproust, Emily
    Kellam, Paul
    Breuer, Judith
    [J]. PLOS ONE, 2011, 6 (11):