Motifs in SARS-CoV-2 evolution

被引:0
作者
Barrett, Christopher [1 ,2 ]
Bura, Andrei C. [1 ]
He, Qijun [1 ]
Huang, Fenix W. [1 ]
Li, Thomas J. X. [1 ]
Reidys, Christian M. [1 ,3 ]
机构
[1] Univ Virginia, Biocomplex Inst & Initiat, Charlottesville, VA 22904 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22904 USA
[3] Univ Virginia, Dept Math, Charlottesville, VA 22904 USA
关键词
site motif; relational structure; coevolution; SARS-CoV-2; genomic surveillance; GENOMIC SURVEILLANCE; FITNESS; COEVOLUTION; INFORMATION; PHYLOGENY; IMPROVES; STATES; RNA;
D O I
10.1261/rna.079557.122
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel framework enhancing the prediction of whether novel lineage poses the threat of eventually dominating the viral population. The framework is based purely on genomic sequence data, without requiring prior established biological analysis. Its building blocks are sets of coevolving sites in the alignment (motifs), identified via coevolutionary signals. The collection of such motifs forms a relational structure over the polymorphic sites. Motifs are constructed using distances quantifying the coevolutionary coupling of pairs and manifest as coevolving clusters of sites. We present an approach to genomic surveillance based on this notion of relational structure. Our system will issue an alert regarding a lineage, based on its contribution to drastic changes in the relational structure. We then conduct a comprehensive retrospective analysis of the COVID-19 pandemic based on SARS-CoV-2 genomic sequence data in GISAID from October 2020 to September 2022, across 21 lineages and 27 countries with weekly resolution. We investigate the performance of this surveillance system in terms of its accuracy, timeliness, and robustness. Lastly, we study how well each lineage is classified by such a system.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 56 条
  • [1] Mapping the Mutual Information Network of Enzymatic Families in the Protein Structure to Unveil Functional Features
    Aguilar, Daniel
    Oliva, Baldo
    Marino Buslje, Cristina
    [J]. PLOS ONE, 2012, 7 (07):
  • [2] Bacterial and Viral Bioinformatics Resource Center, 2022, Bacterial and viral bioinformatics resource center
  • [3] Split Decomposition: A New and Useful Approach to Phylogenetic Analysis of Distance Data
    Bandelt, Hans-Juergen
    Dress, Andreas W. M.
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 1992, 1 (03) : 242 - 252
  • [4] Barrett CL, 2022, medRxiv, DOI [10.1101/2022.08.05.22278480, 10.1101/2022.08.05.22278480, DOI 10.1101/2022.08.05.22278480]
  • [5] Multiscale Feedback Loops in SARS-CoV-2 Viral Evolution
    Barrett, Christopher
    Bura, Andrei C.
    He, Qijun
    Huang, Fenix W.
    Li, Thomas J. X.
    Waterman, Michael S.
    Reidys, Christian M.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (03) : 248 - 256
  • [6] Bedford T., 2021, Nextstrain
  • [7] Centers for Disease Control and Prevention, 2021, Monitoring variant proportions in United States
  • [8] Global landscape of SARS-CoV-2 genomic surveillance and data sharing
    Chen, Zhiyuan
    Azman, Andrew S.
    Chen, Xinhua
    Zou, Junyi
    Tian, Yuyang
    Sun, Ruijia
    Xu, Xiangyanyu
    Wu, Yani
    Lu, Wanying
    Ge, Shijia
    Zhao, Zeyao
    Yang, Juan
    Leung, Daniel T.
    Domman, Daryl B.
    Yu, Hongjie
    [J]. NATURE GENETICS, 2022, 54 (04) : 499 - +
  • [9] Choudhary MC, 2021, medRxiv, DOI [10.1101/2021.03.02.21252750, 10.1101/2021.03.02.21252750, DOI 10.1101/2021.03.02.21252750]
  • [10] Cover T. M., 2006, Elements of Information Theory, V2nd