Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels

被引:5
|
作者
Cannon, S. [1 ]
Williams, M. [1 ]
Gunning, A. C. [1 ]
Wright, C. F. [1 ]
机构
[1] Univ Exeter, Royal Devon & Exeter Hosp, Fac Hlth & Life Sci, Dept Clin & Biomed Sci,Med Sch, Res Innovat Learning & Dev Bldg,Barrack Rd, Exeter EX2 5DW, England
基金
英国科研创新办公室; 英国惠康基金; 英国医学研究理事会;
关键词
Pathogenicity; In-frame indels; Variant interpretation; Pathogenicity prediction; VARIANTS; DELETION; DATABASE;
D O I
10.1186/s12920-023-01454-6
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
BackgroundThe use of in silico pathogenicity predictions as evidence when interpreting genetic variants is widely accepted as part of standard variant classification guidelines. Although numerous algorithms have been developed and evaluated for classifying missense variants, in-frame insertions/deletions (indels) have been much less well studied.MethodsWe created a dataset of 3964 small (< 100 bp) indels predicted to result in in-frame amino acid insertions or deletions using data from gnomAD v3.1 (minor allele frequency of 1-5%), ClinVar and the Deciphering Developmental Disorders (DDD) study. We used this dataset to evaluate the performance of nine pathogenicity predictor tools: CADD, CAPICE, FATHMM-indel, MutPred-Indel, MutationTaster2021, PROVEAN, SIFT-indel, VEST-indel and VVP.ResultsOur dataset consisted of 2224 benign/likely benign and 1740 pathogenic/likely pathogenic variants from gnomAD (n = 809), ClinVar (n = 2882) and, DDD (n = 273). We were able to generate scores across all tools for 91% of the variants, with areas under the ROC curve (AUC) of 0.81-0.96 based on the published recommended thresholds. To avoid biases caused by inclusion of our dataset in the tools' training data, we also evaluated just DDD variants not present in either gnomAD or ClinVar (70 pathogenic and 81 benign). Using this subset, the AUC of all tools decreased substantially to 0.64-0.87. Several of the tools performed similarly however, VEST-indel had the highest AUCs of 0.93 (full dataset) and 0.87 (DDD subset).ConclusionsAlgorithms designed for predicting the pathogenicity of in-frame indels perform well enough to aid clinical variant classification in a similar manner to missense prediction tools.
引用
收藏
页数:9
相关论文
共 6 条
  • [1] Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels
    S. Cannon
    M. Williams
    A. C. Gunning
    C. F. Wright
    BMC Medical Genomics, 16
  • [2] Insights on variant analysis in silico tools for pathogenicity prediction
    Garcia, Felipe Antonio de Oliveira
    de Andrade, Edilene Santos
    Palmero, Edenir Inez
    FRONTIERS IN GENETICS, 2022, 13
  • [3] Functional Characterization and In Silico Prediction Tools Improve the Pathogenicity Prediction of Novel Bile Acid Transporter Variants
    Peng, Ziyue
    Wang, Xin
    Li, Ying
    Ren, Yaqiong
    Meng, Yuhuan
    Sun, Liwei
    Zhang, Zitong
    Song, Yue
    Xia, Yang
    Shi, Lei
    Yu, Shihui
    Cheng, Liang
    Zhang, Xue
    CLINICAL GENETICS, 2025,
  • [4] Evaluation of in silico tools for the prediction of protein and peptide aggregation on diverse datasets
    Prabakaran, R.
    Rawat, Puneet
    Kumar, Sandeep
    Gromiha, M. Michael
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [5] Performance of in silico prediction tools for the classification of rare BRCA1/2 missense variants in clinical diagnostics
    Ernst, Corinna
    Hahnen, Eric
    Engel, Christoph
    Nothnagel, Michael
    Weber, Jonas
    Schmutzler, Rita K.
    Hauke, Jan
    BMC MEDICAL GENOMICS, 2018, 11
  • [6] An ensemble machine learning-based performance evaluation identifies top In-Silico pathogenicity prediction methods that best classify driver mutations in cancer
    Das, Subrata
    Patel, Vatsal
    Chakravarty, Shouvik
    Ghosh, Arnab
    Mukhopadhyay, Anirban
    Biswas, Nidhan K.
    BIODATA MINING, 2025, 18 (01):