High-throughput discovery of rare insertions and deletions in large cohorts

被引:41
|
作者
Vallania, Francesco L. M. [1 ]
Druley, Todd E. [1 ]
Ramos, Enrique [1 ]
Wang, Jue [1 ]
Borecki, Ingrid [1 ]
Province, Michael [1 ]
Mitra, Robi D. [1 ]
机构
[1] Washington Univ, Sch Med, Dept Genet, Ctr Genome Sci & Syst Biol, St Louis, MO 63108 USA
关键词
COLORECTAL ADENOMAS; VARIANTS; DNA; SUSCEPTIBILITY; HYPOTHESIS; CONTRIBUTE; MUTATIONS; EVOLUTION; DISEASES; GENOME;
D O I
10.1101/gr.109157.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Pooled-DNA sequencing strategies enable fast, accurate, and cost-effect detection of rare variants, but current approaches are not able to accurately identify short insertions and deletions (indels), despite their pivotal role in genetic disease. Furthermore, the sensitivity and specificity of these methods depend on arbitrary, user-selected significance thresholds, whose optimal values change from experiment to experiment. Here, we present a combined experimental and computational strategy that combines a synthetically engineered DNA library inserted in each run and a new computational approach named SPLINTER that detects and quantifies short indels and substitutions in large pools. SPLINTER integrates information from the synthetic library to select the optimal significance thresholds for every experiment. We show that SPLINTER detects indels (up to 4 bp) and substitutions in large pools with high sensitivity and specificity, accurately quantifies variant frequency (r = 0.999), and compares favorably with existing algorithms for the analysis of pooled sequencing data. We applied our approach to analyze a cohort of 1152 individuals, identifying 48 variants and validating 14 of 14 (100%) predictions by individual genotyping. Thus, our strategy provides a novel and sensitive method that will speed the discovery of novel disease-causing rare variants.
引用
收藏
页码:1711 / 1718
页数:8
相关论文
共 50 条
  • [21] A probabilistic approach for SNP discovery in high-throughput human resequencing data
    Hoberman, Rose
    Dias, Joana
    Ge, Bing
    Harmsen, Eef
    Mayhew, Michael
    Verlaan, Dominique J.
    Kwan, Tony
    Dewar, Ken
    Blanchette, Mathieu
    Pastinen, Tomi
    GENOME RESEARCH, 2009, 19 (09) : 1542 - 1552
  • [22] Discovery of Two Novel Oxidases Using a High-Throughput Activity Screen
    Rembeza, Elzbieta
    Boverio, Alessandro
    Fraaije, Marco W.
    Engqvist, Martin K. M.
    CHEMBIOCHEM, 2022, 23 (02)
  • [23] High-Throughput Aluminum Alloy Discovery Using Laser Additive Manufacturing
    Pan, Qingyu
    Kapoor, Monica
    Mileski, Sean
    Carsley, John
    Lou, Xiaoyuan
    LIGHT METALS 2021, 50TH EDITION, 2021, : 140 - 146
  • [24] High-throughput genetic characterization of a cohort of Brugada syndrome patients
    Di Resta, Chiara
    Pietrelli, Alessandro
    Sala, Simone
    Della Bella, Paolo
    De Bellis, Gianluca
    Ferrari, Maurizio
    Bordoni, Roberta
    Benedetti, Sara
    HUMAN MOLECULAR GENETICS, 2015, 24 (20) : 5828 - 5835
  • [25] Genetics of neurodegenerative diseases: insights from high-throughput resequencing
    Tsuji, Shoji
    HUMAN MOLECULAR GENETICS, 2010, 19 : R65 - R70
  • [26] High-Throughput Bubble Screening Method for Combinatorial Discovery of Electrocatalysts for Water Splitting
    Xiang, Chengxiang
    Suram, Santosh K.
    Haber, Joel A.
    Guevarra, Dan W.
    Soedarmadji, Ed
    Jin, Jian
    Gregoire, John M.
    ACS COMBINATORIAL SCIENCE, 2014, 16 (02) : 47 - 52
  • [27] High-throughput base editing: a promising technology for precision medicine and drug discovery
    Sachinidis, Agapios
    SIGNAL TRANSDUCTION AND TARGETED THERAPY, 2021, 6 (01)
  • [28] DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data
    Chadi Saad
    Laurent Noé
    Hugues Richard
    Julie Leclerc
    Marie-Pierre Buisine
    Hélène Touzet
    Martin Figeac
    BMC Bioinformatics, 19
  • [29] DiNAMO: highly sensitive DNA motif discovery in high-throughput sequencing data
    Saad, Chadi
    Noe, Laurent
    Richard, Hugues
    Leclerc, Julie
    Buisine, Marie-Pierre
    Touzet, Helene
    Figeac, Martin
    BMC BIOINFORMATICS, 2018, 19
  • [30] Odintifier - A computational method for identifying insertions of organellar origin from modern and ancient high-throughput sequencing data based on haplotype phasing
    Castruita, Jose Alfredo Samaniego
    Mendoza, Marie Lisandra Zepeda
    Barnett, Ross
    Wales, Nathan
    Gilbert, M. Thomas P.
    BMC BIOINFORMATICS, 2015, 16