DVA: predicting the functional impact of single nucleotide missense variants

被引:0
作者
Wang, Dong [1 ]
Li, Jie [1 ]
Wang, Edwin [2 ]
Wang, Yadong [1 ]
机构
[1] Harbin Inst Technol Harbin, Sch Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[2] Univ Calgary, Cumming Sch Med, Calgary, AB, Canada
关键词
Missense variants; Functional impact; Variant annotation; Disease-related; BENCHMARK DATABASE; PATHOGENICITY; MUTATIONS; FREQUENCY; EXOMES;
D O I
10.1186/s12859-024-05709-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundIn the past decade, single nucleotide variants (SNVs) have been identified as having a significant relationship with the development and treatment of diseases. Among them, prioritizing missense variants for further functional impact investigation is an essential challenge in the study of common disease and cancer. Although several computational methods have been developed to predict the functional impacts of variants, the predictive ability of these methods is still insufficient in the Mendelian and cancer missense variants.ResultsWe present a novel prediction method called the disease-related variant annotation (DVA) method that predicts the effect of missense variants based on a comprehensive feature set of variants, notably, the allele frequency and protein-protein interaction network feature based on graph embedding. Benchmarked against datasets of single nucleotide missense variants, the DVA method outperforms the state-of-the-art methods by up to 0.473 in the area under receiver operating characteristic curve. The results demonstrate that the proposed method can accurately predict the functional impact of single nucleotide missense variants and substantially outperforms existing methods.ConclusionsDVA is an effective framework for identifying the functional impact of disease missense variants based on a comprehensive feature set. Based on different datasets, DVA shows its generalization ability and robustness, and it also provides innovative ideas for the study of the functional mechanism and impact of SNVs.
引用
收藏
页数:15
相关论文
共 47 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   ClinPred: Prediction Tool to Identify Disease-Relevant Nonsynonymous Single-Nucleotide Variants [J].
Alirezaie, Najmeh ;
Kernohan, Kristin D. ;
Hartley, Taila ;
Majewski, Jacek ;
Hocking, Toby Dylan .
AMERICAN JOURNAL OF HUMAN GENETICS, 2018, 103 (04) :474-483
[3]   Emergence of diverse biochemical activities in evolutionarily conserved structural scaffolds of proteins [J].
Anantharaman, V ;
Aravind, L ;
Koonin, EV .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2003, 7 (01) :12-20
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Burley SK, 2017, METHODS MOL BIOL, V1606, P627, DOI 10.1007/978-1-4939-7000-1_26
[6]   MISTIC: A prediction tool to reveal disease-relevant deleterious missense variants [J].
Chennen, Kirsley ;
Weber, Thomas ;
Lornage, Xaviere ;
Kress, Arnaud ;
Bohm, Johann ;
Thompson, Julie ;
Laporte, Jocelyn ;
Poch, Olivier .
PLOS ONE, 2020, 15 (07)
[7]   Predicting the Functional Effect of Amino Acid Substitutions and Indels [J].
Choi, Yongwook ;
Sims, Gregory E. ;
Murphy, Sean ;
Miller, Jason R. ;
Chan, Agnes P. .
PLOS ONE, 2012, 7 (10)
[8]   The international Genome sample resource (IGSR): A worldwide collection of genome variation incorporating the 1000 Genomes Project data [J].
Clarke, Laura ;
Fairley, Susan ;
Zheng-Bradley, Xiangqun ;
Streeter, Ian ;
Perry, Emily ;
Lowy, Ernesto ;
Tasse, Anne-Marie ;
Flicek, Paul .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D854-D859
[9]  
CYGLER M, 1993, PROTEIN SCI, V2, P366
[10]   Parkes Weber syndrome associated with two somatic pathogenic variants in RASA1 [J].
Daboub, Josue A. Flores ;
Grimmer, Johanes Fred ;
Frigerio, Alice ;
Wooderchak-Donahue, Whitney ;
Arnold, Ryan ;
Szymanski, Jeff ;
Longo, Nicola ;
Bayrak-Toydemir, Pinar .
COLD SPRING HARBOR MOLECULAR CASE STUDIES, 2020, 6 (04)