Advanced variant classification framework reduces the false positive rate of predicted loss-of-function variants in population sequencing data

被引:11
作者
Singer-Berk, Moriel [1 ,2 ,3 ]
Gudmundsson, Sanna [1 ,2 ,3 ,4 ,5 ]
Baxter, Samantha [1 ,2 ,3 ]
Seaby, Eleanor G. [1 ,2 ,3 ,4 ,6 ]
England, Eleina [1 ,2 ,3 ,4 ]
Wood, Jordan C. [1 ,2 ,3 ]
Son, Rachel G. [1 ]
Watts, Nicholas A. [1 ]
Karczewski, Konrad J. [1 ,2 ,3 ]
Harrison, Steven M. [1 ,7 ]
Macarthur, Daniel G. [1 ,8 ,9 ,10 ]
Rehm, Heidi L. [1 ,2 ,3 ]
O'Donnell-Luria, Anne [1 ,2 ,3 ,4 ]
机构
[1] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[2] Massachusetts Gen Hosp, Ctr Genom Med, Boston, MA 02114 USA
[3] Massachusetts Gen Hosp, Analyt & Translat Genet Unit, Boston, MA 02114 USA
[4] Harvard Med Sch, Boston Childrens Hosp, Div Genet & Genom, Boston, MA 02115 USA
[5] KTH Royal Inst Technol, Dept Gene Technol, Sci Life Lab, Stockholm, Sweden
[6] Univ Hosp Southampton, Genom Informat Grp, Southampton, England
[7] Ambry Genet, Aliso Viejo, CA USA
[8] Garvan Inst Med Res, Ctr Populat Genom, Sydney, NSW, Australia
[9] UNSW Sydney, Sydney, NSW, Australia
[10] Murdoch Childrens Res Inst, Ctr Populat Genom, Melbourne, Vic, Australia
关键词
SPLICE-SITE ACTIVATION; HUMAN GENOME; ANNOTATION; GENOTYPE; DECAY;
D O I
10.1016/j.ajhg.2023.08.005
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Predicted loss of function (pLoF) variants are often highly deleterious and play an important role in disease biology, but many pLoF variants may not result in loss of function (LoF). Here we present a framework that advances interpretation of pLoF variants in research and clinical settings by considering three categories of LoF evasion: (1) predicted rescue by secondary sequence properties, (2) uncertain biological relevance, and (3) potential technical artifacts. We also provide recommendations on adjustments to ACMG/AMP guidelines' PVS1 criterion. Applying this framework to all high-confidence pLoF variants in 22 genes associated with autosomal-recessive disease from the Genome Aggregation Database (gnomAD v.2.1.1) revealed predicted LoF evasion or potential artifacts in 27.3% (304/1,113) of variants. The major reasons were location in the last exon, in a homopolymer repeat, in a low proportion expressed across transcripts (pext) scored region, or the presence of cryptic in-frame splice rescues. Variants predicted to evade LoF or to be potential artifacts were enriched for ClinVar benign variants. PVS1 was downgraded in 99.4% (162/163) of pLoF variants predicted as likely not LoF/not LoF, with 17.2% (28/163) downgraded as a result of our framework, adding to previous guidelines. Variant pathogenicity was affected (mostly from likely pathogenic to VUS) in 20 (71.4%) of these 28 variants. This framework guides assessment of pLoF variants beyond standard annotation pipelines and substantially reduces false positive rates, which is key to ensure accurate LoF variant prediction in both a research and clinical setting.
引用
收藏
页码:1496 / 1508
页数:14
相关论文
共 58 条
[11]   The origin of a novel gene through overprinting in Escherichia coli [J].
Delaye, Luis ;
DeLuna, Alexander ;
Lazcano, Antonio ;
Becerra, Arturo .
BMC EVOLUTIONARY BIOLOGY, 2008, 8 (1)
[12]   Curating Clinically Relevant Transcripts for the Interpretation of Sequence Variants [J].
DiStefano, Marina T. ;
Hemphill, Sarah E. ;
Cushman, Brandon J. ;
Bowser, Mark J. ;
Hynes, Elizabeth ;
Grant, Andrew R. ;
Siegert, Rebecca K. ;
Oza, Andrea M. ;
Gonzalez, Michael A. ;
Amr, Sami S. ;
Rehm, Heidi L. ;
Abou Tayoun, Ahmad N. .
JOURNAL OF MOLECULAR DIAGNOSTICS, 2018, 20 (06) :789-801
[13]   Ab initio prediction of mutation-induced cryptic splice-site activation and exon skipping [J].
Divina, Petr ;
Kvitkovicova, Andrea ;
Buratti, Emanuele ;
Vorechovsky, Igor .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2009, 17 (06) :759-765
[14]   How to get away with nonsense: Mechanisms and consequences of escape from nonsense-mediated RNA decay [J].
Dyle, Michael C. ;
Kolakada, Divya ;
Cortazar, Michael A. ;
Jagannathan, Sujatha .
WILEY INTERDISCIPLINARY REVIEWS-RNA, 2020, 11 (01)
[15]   Reducing INDEL calling errors in whole genome and exome sequencing data [J].
Fang, Han ;
Wu, Yiyang ;
Narzisi, Giuseppe ;
O'Rawe, Jason A. ;
Jimenez Barron, Laura T. ;
Rosenbaum, Julie ;
Ronemus, Michael ;
Iossifov, Ivan ;
Schatz, Michael C. ;
Lyon, Gholson J. .
GENOME MEDICINE, 2014, 6
[16]   Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies [J].
Feng, Weixing ;
Zhao, Sen ;
Xue, Dingkai ;
Song, Fengfei ;
Li, Ziwei ;
Chen, Duojiao ;
He, Bo ;
Hao, Yangyang ;
Wang, Yadong ;
Liu, Yunlong .
BMC GENOMICS, 2016, 17
[17]   Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction [J].
Frankish, Adam ;
Uszczynska, Barbara ;
Ritchie, Graham R. S. ;
Gonzalez, Jose M. ;
Pervouchine, Dmitri ;
Petryszak, Robert ;
Mudge, Jonathan M. ;
Fonseca, Nuno ;
Brazma, Alvis ;
Guigo, Roderic ;
Harrow, Jennifer .
BMC GENOMICS, 2015, 16
[18]   PRE-MESSENGER-RNA SPLICING [J].
GREEN, MR .
ANNUAL REVIEW OF GENETICS, 1986, 20 :671-708
[19]   Interpreting variants in genes affected by clonal hematopoiesis in population data [J].
Gudmundsson, Sanna ;
Carlston, Colleen M. ;
O'Donnell-Luria, Anne .
HUMAN GENETICS, 2024, 143 (04) :545-549
[20]   Variant interpretation using population databases: Lessons from gnomAD [J].
Gudmundsson, Sanna ;
Singer-Berk, Moriel ;
Watts, Nicholas A. ;
Phu, William ;
Goodrich, Julia K. ;
Solomonson, Matthew ;
Rehm, Heidi L. ;
MacArthur, Daniel G. ;
O'Donnell-Luria, Anne .
HUMAN MUTATION, 2022, 43 (08) :1012-1030