Optimized quantification of intra-host viral diversity in SARS-CoV-2 and influenza virus sequence data

被引:9
|
作者
Roder, A. E. [1 ]
Johnson, K. E. E. [1 ,2 ]
Knoll, M. [2 ]
Khalfan, M. [2 ]
Wang, B. [2 ]
Schultz-Cherry, S. [3 ]
Banakis, S. [1 ]
Kreitman, A. [1 ]
Mederos, C. [1 ]
Youn, J. -H. [4 ]
Mercado, R. [4 ]
Wang, W. [1 ]
Chung, M. [1 ]
Ruchnewitz, D. [5 ]
Samanovic, M. I. [6 ]
Mulligan, M. J. [6 ]
Laessig, M. [5 ]
Luksza, M. [7 ]
Das, S. [4 ]
Gresham, D. [2 ]
Ghedin, E. [1 ,2 ]
机构
[1] NIAID, Syst Genom Sect, Lab Parasit Dis, DIR,NIH, Bethesda, MD 20892 USA
[2] NYU, Ctr Genom & Syst Biol, Dept Biol, New York, NY 10012 USA
[3] St Jude Childrens Res Hosp, Dept Infect Dis, Memphis, TN USA
[4] NIH, Dept Lab Med, Bethesda, MD USA
[5] Univ Cologne, Inst Biol Phys, Cologne, Germany
[6] NYU, Langone Vaccine Ctr, Dept Med, New York, NY USA
[7] Icahn Sch Med Mt Sinai, Dept Oncol Sci, New York, NY USA
来源
MBIO | 2023年 / 14卷 / 04期
关键词
SARS-CoV-2; influenza; genomics; bioinformatics; RNA; SELECTION; EVOLUTION; MUTATION; CANCER;
D O I
10.1128/mbio.01046-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
High error rates of viral RNA-dependent RNA polymerases lead to diverse intra-host viral populations during infection. Errors made during replication that are not strongly deleterious to the virus can lead to the generation of minority variants. However, accurate detection of minority variants in viral sequence data is complicated by errors introduced during sample preparation and data analysis. We used synthetic RNA controls and simulated data to test seven variant-calling tools across a range of allele frequencies and simulated coverages. We show that choice of variant caller and use of replicate sequencing have the most significant impact on single-nucleotide variant (SNV) discovery and demonstrate how both allele frequency and coverage thresholds impact both false discovery and false-negative rates. When replicates are not available, using a combination of multiple callers with more stringent cutoffs is recommended. We use these parameters to find minority variants in sequencing data from SARS-CoV-2 clinical specimens and provide guidance for studies of intra-host viral diversity using either single replicate data or data from technical replicates. Our study provides a framework for rigorous assessment of technical factors that impact SNV identification in viral samples and establishes heuristics that will inform and improve future studies of intra-host variation, viral diversity, and viral evolution. IMPORTANCEWhen viruses replicate inside a host cell, the virus replication machinery makes mistakes. Over time, these mistakes create mutations that result in a diverse population of viruses inside the host. Mutations that are neither lethal to the virus nor strongly beneficial can lead to minority variants that are minor members of the virus population. However, preparing samples for sequencing can also introduce errors that resemble minority variants, resulting in the inclusion of false-positive data if not filtered correctly. In this study, we aimed to determine the best methods for identification and quantification of these minority variants by testing the performance of seven commonly used variant-calling tools. We used simulated and synthetic data to test their performance against a true set of variants and then used these studies to inform variant identification in data from SARS-CoV-2 clinical specimens. Together, analyses of our data provide extensive guidance for future studies of viral diversity and evolution. When viruses replicate inside a host cell, the virus replication machinery makes mistakes. Over time, these mistakes create mutations that result in a diverse population of viruses inside the host. Mutations that are neither lethal to the virus nor strongly beneficial can lead to minority variants that are minor members of the virus population. However, preparing samples for sequencing can also introduce errors that resemble minority variants, resulting in the inclusion of false-positive data if not filtered correctly. In this study, we aimed to determine the best methods for identification and quantification of these minority variants by testing the performance of seven commonly used variant-calling tools. We used simulated and synthetic data to test their performance against a true set of variants and then used these studies to inform variant identification in data from SARS-CoV-2 clinical specimens. Together, analyses of our data provide extensive guidance for future studies of viral diversity and evolution.
引用
收藏
页数:17
相关论文
共 50 条
  • [11] Intra-host variation and evolutionary dynamics of SARS-CoV-2 populations in COVID-19 patients
    Wang, Yanqun
    Wang, Daxi
    Zhang, Lu
    Sun, Wanying
    Zhang, Zhaoyong
    Chen, Weijun
    Zhu, Airu
    Huang, Yongbo
    Xiao, Fei
    Yao, Jinxiu
    Gan, Mian
    Li, Fang
    Luo, Ling
    Huang, Xiaofang
    Zhang, Yanjun
    Sook-san Wong
    Cheng, Xinyi
    Ji, Jingkai
    Ou, Zhihua
    Xiao, Minfeng
    Li, Min
    Li, Jiandong
    Ren, Peidi
    Deng, Ziqing
    Zhong, Huanzi
    Xu, Xun
    Song, Tie
    Mok, Chris Ka Pun
    Peiris, Malik
    Zhong, Nanshan
    Zhao, Jingxian
    Li, Yimin
    Li, Junhua
    Zhao, Jincun
    GENOME MEDICINE, 2021, 13 (01)
  • [12] SARS-CoV-2 and Influenza Virus Coinfection
    Kaptan, Figen
    FLORA INFEKSIYON HASTALIKLARI VE KLINIK MIKROBIYOLOJI DERGISI, 2020, 25 (04): : 457 - 463
  • [13] Influenza Virus and SARS-CoV-2 Vaccines
    Sandor, Adam M.
    Sturdivant, Michael S.
    Ting, Jenny P. Y.
    JOURNAL OF IMMUNOLOGY, 2021, 206 (11) : 2509 - 2520
  • [14] Inter- and intra-host sequence diversity reveal the emergence of viral variants during an overwintering epidemic caused by dengue virus serotype 2 in southern Taiwan
    Ko, Hui-Ying
    Li, Yao-Tsun
    Chao, Day-Yu
    Chang, Yun-Cheng
    Li, Zheng-Rong T.
    Wang, Melody
    Kao, Chuan-Liang
    Wen, Tzai-Hung
    Shu, Pei-Yun
    Chang, Gwong-Jen J.
    King, Chwan-Chuen
    PLOS NEGLECTED TROPICAL DISEASES, 2018, 12 (10):
  • [15] Inter- and Intra-Host Viral Diversity in a Large Seasonal DENV2 Outbreak
    Romano, Camila Malta
    Lauck, Michael
    Salvador, Felipe S.
    Lima, Celia Rodrigues
    Villas-Boas, Lucy S.
    Araujo, Evaldo Stanislau A.
    Levi, Jose Eduardo
    Pannuti, Claudio Sergio
    O'Connor, David
    Kallas, Esper Georges
    PLOS ONE, 2013, 8 (08):
  • [16] SARS-CoV-2 and Influenza A Virus Coinfections in Ferrets
    Huang, Ying
    Skarlupka, Amanda L.
    Jang, Hyesun
    Blas-Machado, Uriel
    Holladay, Nathan
    Hogan, R. Jeffrey
    Ross, Ted M.
    JOURNAL OF VIROLOGY, 2022, 96 (05)
  • [17] Detection of SARS-CoV-2 intra-host recombination during superinfection with Alpha and Epsilon variants in New York City
    Wertheim, Joel O.
    Wang, Jade C.
    Leelawong, Mindy
    Martin, Darren P.
    Havens, Jennifer L.
    Chowdhury, Moinuddin A.
    Pekar, Jonathan E.
    Amin, Helly
    Arroyo, Anthony
    Awandare, Gordon A.
    Chow, Hoi Yan
    Gonzalez, Edimarlyn
    Luoma, Elizabeth
    Morang'a, Collins M.
    Nekrutenko, Anton
    Shank, Stephen D.
    Silver, Stefan
    Quashie, Peter K.
    Rakeman, Jennifer L.
    Ruiz, Victoria
    Torian, Lucia, V
    Vasylyeva, Tetyana, I
    Pond, Sergei L. Kosakovsky
    Hughes, Scott
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [18] Intra-Host SARS-CoV-2 Evolution in the Gut of Mucosally-Infected Chlorocebus aethiops (African Green Monkeys)
    Rowe, Lori A.
    Beddingfield, Brandon J.
    Goff, Kelly
    Killeen, Stephanie Z.
    Chirichella, Nicole R.
    Melton, Alexandra
    Roy, Chad J.
    Maness, Nicholas J.
    VIRUSES-BASEL, 2022, 14 (01):
  • [19] Systematic detection of co-infection and intra-host recombination in more than 2 million global SARS-CoV-2 samples
    Pipek, Orsolya Anna
    Medgyes-Horvath, Anna
    Steger, Jozsef
    Papp, Krisztian
    Visontai, David
    Koopmans, Marion
    Nieuwenhuijse, David
    Munnink, Bas B. Oude
    Csabai, Istvan
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [20] Co-infection with SARS-CoV-2 and influenza A virus
    Azekawa, Shuhei
    Namkoong, Ho
    Mitamura, Keiko
    Kawaoka, Yoshihiro
    Saito, Fumitake
    IDCASES, 2020, 20