Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels

被引：5

作者：

Cannon, S. ^{[1
]}

Williams, M. ^{[1
]}

Gunning, A. C. ^{[1
]}

Wright, C. F. ^{[1
]}

机构：

[1] Univ Exeter, Royal Devon & Exeter Hosp, Fac Hlth & Life Sci, Dept Clin & Biomed Sci,Med Sch, Res Innovat Learning & Dev Bldg,Barrack Rd, Exeter EX2 5DW, England

来源：

BMC MEDICAL GENOMICS | 2023年 / 16卷 / 01期

基金：

英国科研创新办公室; 英国惠康基金; 英国医学研究理事会;

关键词：

Pathogenicity; In-frame indels; Variant interpretation; Pathogenicity prediction; VARIANTS; DELETION; DATABASE;

D O I：

10.1186/s12920-023-01454-6

中图分类号：

Q3 [遗传学];

学科分类号：

071007 ; 090102 ;

摘要：

BackgroundThe use of in silico pathogenicity predictions as evidence when interpreting genetic variants is widely accepted as part of standard variant classification guidelines. Although numerous algorithms have been developed and evaluated for classifying missense variants, in-frame insertions/deletions (indels) have been much less well studied.MethodsWe created a dataset of 3964 small (< 100 bp) indels predicted to result in in-frame amino acid insertions or deletions using data from gnomAD v3.1 (minor allele frequency of 1-5%), ClinVar and the Deciphering Developmental Disorders (DDD) study. We used this dataset to evaluate the performance of nine pathogenicity predictor tools: CADD, CAPICE, FATHMM-indel, MutPred-Indel, MutationTaster2021, PROVEAN, SIFT-indel, VEST-indel and VVP.ResultsOur dataset consisted of 2224 benign/likely benign and 1740 pathogenic/likely pathogenic variants from gnomAD (n = 809), ClinVar (n = 2882) and, DDD (n = 273). We were able to generate scores across all tools for 91% of the variants, with areas under the ROC curve (AUC) of 0.81-0.96 based on the published recommended thresholds. To avoid biases caused by inclusion of our dataset in the tools' training data, we also evaluated just DDD variants not present in either gnomAD or ClinVar (70 pathogenic and 81 benign). Using this subset, the AUC of all tools decreased substantially to 0.64-0.87. Several of the tools performed similarly however, VEST-indel had the highest AUCs of 0.93 (full dataset) and 0.87 (DDD subset).ConclusionsAlgorithms designed for predicting the pathogenicity of in-frame indels perform well enough to aid clinical variant classification in a similar manner to missense prediction tools.

引用

页数：9

共 6 条

[1] Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels
S. Cannon
M. Williams
A. C. Gunning
C. F. Wright
BMC Medical Genomics, 16
[2] Insights on variant analysis in silico tools for pathogenicity prediction
Garcia, Felipe Antonio de Oliveira
de Andrade, Edilene Santos
Palmero, Edenir Inez
FRONTIERS IN GENETICS, 2022, 13
[3] Functional Characterization and In Silico Prediction Tools Improve the Pathogenicity Prediction of Novel Bile Acid Transporter Variants
Peng, Ziyue
Wang, Xin
Li, Ying
Ren, Yaqiong
Meng, Yuhuan
Sun, Liwei
Zhang, Zitong
Song, Yue
Xia, Yang
Shi, Lei
Yu, Shihui
Cheng, Liang
Zhang, Xue
CLINICAL GENETICS, 2025,
[4] Evaluation of in silico tools for the prediction of protein and peptide aggregation on diverse datasets
Prabakaran, R.
Rawat, Puneet
Kumar, Sandeep
Gromiha, M. Michael
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
[5] Performance of in silico prediction tools for the classification of rare BRCA1/2 missense variants in clinical diagnostics
Ernst, Corinna
Hahnen, Eric
Engel, Christoph
Nothnagel, Michael
Weber, Jonas
Schmutzler, Rita K.
Hauke, Jan
BMC MEDICAL GENOMICS, 2018, 11
[6] An ensemble machine learning-based performance evaluation identifies top In-Silico pathogenicity prediction methods that best classify driver mutations in cancer
Das, Subrata
Patel, Vatsal
Chakravarty, Shouvik
Ghosh, Arnab
Mukhopadhyay, Anirban
Biswas, Nidhan K.
BIODATA MINING, 2025, 18 (01):

← 1 →