A clinically validated whole genome pipeline for structural variant detection and analysis

被引:22
作者
Neerman, Nir [1 ]
Faust, Gregory [1 ]
Meeks, Naomi [1 ]
Modai, Shira [1 ]
Kalfon, Limor [2 ]
Falik-Zaccai, Tzipora [2 ,3 ]
Kaplun, Alexander [1 ]
机构
[1] Variantyx Inc, Framingham, MA 01867 USA
[2] Galilee Med Ctr, Inst Human Genet, Nahariyya, Israel
[3] Azrieli Fac Med, Bar Ilan, Safed, Israel
关键词
Whole genome sequencing; Structural variants; Clinical validation; Pipeline; Diagnostic console; WGS; CNV; Deletion; Duplication; Break point; COPY-NUMBER; DATABASE;
D O I
10.1186/s12864-019-5866-z
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundWith the continuing decrease in cost of whole genome sequencing (WGS), we have already approached the point of inflection where WGS testing has become economically feasible, facilitating broader access to the benefits that are helping to define WGS as the new diagnostic standard. WGS provides unique opportunities for detection of structural variants; however, such analyses, despite being recognized by the research community, have not previously made their way into routine clinical practice.ResultsWe have developed a clinically validated pipeline for highly specific and sensitive detection of structural variants basing on 30X PCR-free WGS. Using a combination of breakpoint analysis of split and discordant reads, and read depth analysis, the pipeline identifies structural variants down to single base pair resolution. False positives are minimized using calculations for loss of heterozygosity and bi-modal heterozygous variant allele frequencies to enhance heterozygous deletion and duplication detection respectively. Compound and potential compound combinations of structural variants and small sequence changes are automatically detected. To facilitate clinical interpretation, identified variants are annotated with phenotype information derived from HGMD Professional and population allele frequencies derived from public and Variantyx allele frequency databases. Single base pair resolution enables easy visual inspection of potentially causal variants using the IGV genome browser as well as easy biochemical validation via PCR. Analytical and clinical sensitivity and specificity of the pipeline has been validated using analysis of Genome in a Bottle reference genomes and known positive samples confirmed by orthogonal sequencing technologies.ConclusionConsistent read depth of PCR-free WGS enables reliable detection of structural variants of any size. Annotation both on gene and variant level allows clinicians to match reported patient phenotype with detected variants and confidently report causative finding in all clinical cases used for validation.
引用
收藏
页数:8
相关论文
共 25 条
[1]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[2]   Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequencing data [J].
Boeva, Valentina ;
Popova, Tatiana ;
Bleakley, Kevin ;
Chiche, Pierre ;
Cappo, Julie ;
Schleiermacher, Gudrun ;
Janoueix-Lerosey, Isabelle ;
Delattre, Olivier ;
Barillot, Emmanuel .
BIOINFORMATICS, 2012, 28 (03) :423-425
[3]  
Chaisson M.J. P., 2017, Multi-platform discovery of haplotype-resolved structural variation in human genomes, DOI DOI 10.1101/193144
[4]   A validation study of copy number variant (CNVs) detection to replace constitutional microarray from low resolution whole genome sequencing data [J].
Chaubey, Alka ;
Valencia, C. Alexander ;
Zeqiang, Ma ;
Irzyk, Gerard ;
Szekeres, Edward, Jr. ;
Markovic, Zdenek ;
Wang, Yang ;
Tanner, Alice ;
Collins, Christin ;
Hegde, Madhuri .
CANCER GENETICS, 2018, 224 :56-56
[5]   Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications [J].
Chen, Xiaoyu ;
Schulz-Trieglaff, Ole ;
Shaw, Richard ;
Barnes, Bret ;
Schlesinger, Felix ;
Kallberg, Morten ;
Cox, Anthony J. ;
Kruglyakl, Semyon ;
Saunders, Christopher T. .
BIOINFORMATICS, 2016, 32 (08) :1220-1222
[6]  
Chiang C, 2015, NAT METHODS, V12, P966, DOI [10.1038/NMETH.3505, 10.1038/nmeth.3505]
[7]  
Falik Zaccai TC, NOVEL STRUCTUR UNPUB
[8]   Structural variation detection using next-generation sequencing data A comparative technical review [J].
Guan, Peiyong ;
Sung, Wing-Kin .
METHODS, 2016, 102 :36-49
[9]  
INSERM, 1999, ORPHANET ONLINE RARE
[10]  
Isakov O, 2013, METHODS MOL BIOL, V1038, P137, DOI 10.1007/978-1-62703-514-9_8