Characterizations of SARS-CoV-2 mutational profile, spike protein stability and viral transmission

被引:140
作者
Laha, Sayantan [1 ]
Chakraborty, Joyeeta [1 ]
Das, Shantanab [1 ]
Manna, Soumen Kanti [4 ]
Biswas, Sampa [2 ,3 ]
Chatterjee, Raghunath [1 ]
机构
[1] Indian Stat Inst, Human Genet Unit, 203 B T Rd, Kolkata 700108, India
[2] Saha Inst Nucl Phys, Crystallog & Mol Biol Div, 1-AF Bidhannagar, Kolkata 700064, India
[3] Homi Bhaba Natl Inst, Mumbai 400094, Maharashtra, India
[4] Saha Inst Nucl Phys HBNI, Biophys & Struct Genom Div, 1-AF Bidhannagar, Kolkata 700064, India
关键词
SARS-CoV-2; Hot-spot mutations; Spike glycoprotein; Frequent mutation; Protein stability; HYDROGEN-BONDS; RNA-SYNTHESIS; CORONAVIRUS; IDENTIFICATION; ENERGETICS;
D O I
10.1016/j.meegid.2020.104445
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
The recent pandemic of SARS-CoV-2 infection has affected more than 3.0 million people worldwide with more than 200 thousand reported deaths. The SARS-CoV-2 genome has the capability of gaining rapid mutations as the virus spreads. Whole-genome sequencing data offers a wide range of opportunities to study mutation dynamics. The advantage of an increasing amount of whole-genome sequence data of SARS-CoV-2 intrigued us to explore the mutation profile across the genome, to check the genome diversity, and to investigate the implications of those mutations in protein stability and viral transmission. We have identified frequently mutated residues by aligning similar to 660 SARS-CoV-2 genomes and validated in 10,000 datasets available in GISAID Nextstrain. We further evaluated the potential of these frequently mutated residues in protein structure stability of spike glycoprotein and their possible functional consequences in other proteins. Among the 11 genes, surface glycoprotein, nucleocapsid, ORF1ab, and ORF8 showed frequent mutations, while envelop, membrane, ORF6, ORF7a and ORF7b showed conservation in terms of amino acid substitutions. Combined analysis with the frequently mutated residues identified 20 viral variants, among which 12 specific combinations comprised more than 97% of the isolates considered for the analysis. Some of the mutations across different proteins showed co-occurrences, suggesting their structural and/or functional interaction among different SARS-COV-2 proteins, and their involvement in adaptability and viral transmission. Analysis of protein structure stability of surface glycoprotein mutants indicated the viability of specific variants and are more prone to be temporally and spatially distributed across the globe. A similar empirical analysis of other proteins indicated the existence of important functional implications of several variants. Identification of frequently mutated variants among COVID-19 patients might be useful for better clinical management, contact tracing, and containment of the disease.
引用
收藏
页数:11
相关论文
共 42 条
[1]   A brief review of socio-economic and environmental impact of Covid-19 [J].
Bashir, Muhammad Farhan ;
Ma, Benjiang ;
Shahzad, Luqman .
AIR QUALITY ATMOSPHERE AND HEALTH, 2020, 13 (12) :1403-1409
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   Emerging coronaviruses: Genome structure, replication, and pathogenesis [J].
Chen, Yu ;
Liu, Qianyun ;
Guo, Deyin .
JOURNAL OF MEDICAL VIROLOGY, 2020, 92 (04) :418-423
[4]   What China's coronavirus response can teach the rest of the world [J].
Cyranoski, David .
NATURE, 2020, 579 (7800) :479-480
[5]  
Dassault Systemes BIOVIA, 2016, DISC STUD MOD ENV RE
[6]   The clinical characteristics of pneumonia patients coinfected with 2019 novel coronavirus and influenza virus in Wuhan, China [J].
Ding, Qiang ;
Lu, Panpan ;
Fan, Yuhui ;
Xia, Yujia ;
Liu, Mei .
JOURNAL OF MEDICAL VIROLOGY, 2020, 92 (09) :1549-1555
[7]   Identification of a novel coronavirus in patients with severe acute respiratory syndrome [J].
Drosten, C ;
Günther, S ;
Preiser, W ;
van der Werf, S ;
Brodt, HR ;
Becker, S ;
Rabenau, H ;
Panning, M ;
Kolesnikova, L ;
Fouchier, RAM ;
Berger, A ;
Burguière, AM ;
Cinatl, J ;
Eickmann, M ;
Escriou, N ;
Grywna, K ;
Kramme, S ;
Manuguerra, JC ;
Müller, S ;
Rickerts, V ;
Stürmer, M ;
Vieth, S ;
Klenk, HD ;
Osterhaus, ADME ;
Schmitz, H ;
Doerr, HW .
NEW ENGLAND JOURNAL OF MEDICINE, 2003, 348 (20) :1967-1976
[8]   Why are RNA virus mutation rates so damn high? [J].
Duffy, Siobain .
PLOS BIOLOGY, 2018, 16 (08)
[9]   OpenMM 7: Rapid development of high performance algorithms for molecular dynamics [J].
Eastman, Peter ;
Swails, Jason ;
Chodera, John D. ;
McGibbon, Robert T. ;
Zhao, Yutong ;
Beauchamp, Kyle A. ;
Wang, Lee-Ping ;
Simmonett, Andrew C. ;
Harrigan, Matthew P. ;
Stern, Chaya D. ;
Wiewiora, Rafal P. ;
Brooks, Bernard R. ;
Pande, Vijay S. .
PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (07)
[10]   AMINO-ACID DIFFERENCE FORMULA TO HELP EXPLAIN PROTEIN EVOLUTION [J].
GRANTHAM, R .
SCIENCE, 1974, 185 (4154) :862-864