Indels in SARS-CoV-2 occur at template-switching hotspots

被引:26
作者
Chrisman, Brianna Sierra [1 ]
Paskov, Kelley [2 ]
Stockham, Nate. [3 ]
Tabatabaei, Kevin [4 ]
Jung, Jae-Yoon [2 ]
Washington, Peter [1 ]
Varma, Maya [5 ]
Sun, Min Woo [2 ]
Maleki, Sepideh [6 ]
Wall, Dennis P. [2 ,7 ]
机构
[1] Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Biomed Data Sci, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Neurosci, Stanford, CA 94305 USA
[4] McMaster Univ, Fac Hlth Sci, Hamilton, ON, Canada
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[7] Stanford Univ, Dept Pediat Syst Med, Stanford, CA 94305 USA
关键词
RNA virus; Recombination; SARS-CoV-2; Genomics; CORONAVIRUS; RECOMBINATION; ORIGIN; RNA;
D O I
10.1186/s13040-021-00251-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The evolutionary dynamics of SARS-CoV-2 have been carefully monitored since the COVID-19 pandemic began in December 2019. However, analysis has focused primarily on single nucleotide polymorphisms and largely ignored the role of insertions and deletions (indels) as well as recombination in SARS-CoV-2 evolution. Using sequences from the GISAID database, we catalogue over 100 insertions and deletions in the SARS-CoV-2 consensus sequences. We hypothesize that these indels are artifacts of recombination events between SARS-CoV-2 replicates whereby RNA-dependent RNA polymerase (RdRp) re-associates with a homologous template at a different loci ("imperfect homologous recombination"). We provide several independent pieces of evidence that suggest this. (1) The indels from the GISAID consensus sequences are clustered at specific regions of the genome. (2) These regions are also enriched for 5' and 3' breakpoints in the transcription regulatory site (TRS) independent transcriptome, presumably sites of RNA-dependent RNA polymerase (RdRp) template-switching. (3) Within raw reads, these indel hotspots have cases of both high intra-host heterogeneity and intra-host homogeneity, suggesting that these indels are both consequences of de novo recombination events within a host and artifacts of previous recombination. We briefly analyze the indels in the context of RNA secondary structure, noting that indels preferentially occur in "arms" and loop structures of the predicted folded RNA, suggesting that secondary structure may be a mechanism for TRS-independent template-switching in SARS-CoV-2 or other coronaviruses. These insights into the relationship between structural variation and recombination in SARS-CoV-2 can improve our reconstructions of the SARS-CoV-2 evolutionary history as well as our understanding of the process of RdRp template-switching in RNA viruses.
引用
收藏
页数:16
相关论文
共 53 条
  • [1] A max-margin training of RNA secondary structure prediction integrated with the thermodynamic model
    Akiyama, Manato
    Sato, Kengo
    Sakakibara, Yasubumi
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2018, 16 (06)
  • [2] The proximal origin of SARS-CoV-2
    Andersen, Kristian G.
    Rambaut, Andrew
    Lipkin, W. Ian
    Holmes, Edward C.
    Garry, Robert F.
    [J]. NATURE MEDICINE, 2020, 26 (04) : 450 - 452
  • [3] [Anonymous], 2006, Randomization, Bootstrap and Monte Carlo Methods in Biology
  • [4] RNApdbee-a webserver to derive secondary structures from pdb files of knotted and unknotted RNAs
    Antczak, Maciej
    Zok, Tomasz
    Popenda, Mariusz
    Lukasiak, Piotr
    Adamiak, Ryszard W.
    Blazewicz, Jacek
    Szachniuk, Marta
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) : W368 - W372
  • [5] RANDOM NATURE OF CORONAVIRUS RNA RECOMBINATION IN THE ABSENCE OF SELECTION PRESSURE
    BANNER, LR
    LAI, MMC
    [J]. VIROLOGY, 1991, 185 (01) : 441 - 445
  • [6] Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic
    Boni, Maciej F.
    Lemey, Philippe
    Jiang, Xiaowei
    Lam, Tommy Tsan-Yuk
    Perry, Blair W.
    Castoe, Todd A.
    Rambaut, Andrew
    Robertson, David L.
    [J]. NATURE MICROBIOLOGY, 2020, 5 (11) : 1408 - +
  • [7] Analyses of a whole-genome inter-clade recombination map of hepatitis delta virus suggest a host polymerase-driven and viral RNA structure-promoted template-switching mechanism for viral RNA recombination
    Chao, Mei
    Wang, Tzu-Chi
    Lin, Chia-Chi
    Wang, Robert Yung-Liang
    Lin, Wen-Bin
    Lee, Shang-En
    Cheng, Ying-Yu
    Yeh, Chau-Ting
    Iang, Shan-Bei
    [J]. ONCOTARGET, 2017, 8 (37) : 60841 - 60859
  • [8] fastp: an ultra-fast all-in-one FASTQ preprocessor
    Chen, Shifu
    Zhou, Yanqing
    Chen, Yaru
    Gu, Jia
    [J]. BIOINFORMATICS, 2018, 34 (17) : 884 - 890
  • [9] Mechanism of RNA recombination in carmo- and tombusviruses: Evidence for template switching by the RNA-Dependent RNA polymerase in vitro
    Cheng, CP
    Nagy, PD
    [J]. JOURNAL OF VIROLOGY, 2003, 77 (22) : 12033 - 12047
  • [10] bpRNA: large-scale automated annotation and analysis of RNA secondary structure
    Danaee, Padideh
    Rouches, Mason
    Wiley, Michelle
    Deng, Dezhong
    Huang, Liang
    Hendrix, David
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (11) : 5381 - 5394