CODEX2: full-spectrum copy number variation detection by high-throughput DNA sequencing

被引:41
|
作者
Jiang, Yuchao [1 ,2 ,3 ]
Wang, Rujin [1 ]
Urrutia, Eugene [1 ]
Anastopoulos, Ioannis N. [4 ]
Nathanson, Katherine L. [4 ,5 ]
Zhang, Nancy R. [6 ]
机构
[1] Univ N Carolina, Dept Biostat, Gillings Sch Global Publ Hlth, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Dept Genet, Sch Med, Chapel Hill, NC 27599 USA
[3] Univ N Carolina, Lineberger Comprehens Canc Ctr, Chapel Hill, NC 27599 USA
[4] Univ Penn, Dept Med, Div Translat Med & Human Genet, Perelman Sch Med, Philadelphia, PA 19104 USA
[5] Univ Penn, Perelman Sch Med, Abramson Canc Ctr, Philadelphia, PA 19104 USA
[6] Univ Penn, Wharton Sch, Dept Stat, Philadelphia, PA 19104 USA
来源
GENOME BIOLOGY | 2018年 / 19卷
基金
美国国家卫生研究院;
关键词
Copy number variation; Normalization; Next-generation sequencing; Latent factor; Negative control; DISCOVERY; VARIANTS; COMMON; GENES;
D O I
10.1186/s13059-018-1578-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
High-throughput DNA sequencing enables detection of copy number variations (CNVs) on the genome-wide scale with finer resolution compared to array-based methods but suffers from biases and artifacts that lead to false discoveries and low sensitivity. We describe CODEX2, as a statistical framework for full-spectrum CNV profiling that is sensitive for variants with both common and rare population frequencies and that is applicable to study designs with and without negative control samples. We demonstrate and evaluate CODEX2 on whole-exome and targeted sequencing data, where biases are the most prominent. CODEX2 outperforms existing methods and, in particular, significantly improves sensitivity for common CNVs.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] reconCNV: interactive visualization of copy number data from high-throughput sequencing
    Chandramohan, Raghu
    Kakkar, Nipun
    Roy, Angshumoy
    Parsons, D. Williams
    BIOINFORMATICS, 2021, 37 (08) : 1164 - 1167
  • [22] Inferring Variation in Copy Number Using High Throughput Sequencing Data in R
    Knaus, Brian J.
    Gruenwald, Niklaus J.
    FRONTIERS IN GENETICS, 2018, 9
  • [23] FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing
    Shen, Ronglai
    Seshan, Venkatraman E.
    NUCLEIC ACIDS RESEARCH, 2016, 44 (16)
  • [24] BIC-seq: a fast algorithm for detection of copy number alterations based on high-throughput sequencing data
    Xi, Ruibin
    Luquette, Joe
    Hadjipanayis, Angela
    Kim, Tae-Min
    Park, Peter J.
    GENOME BIOLOGY, 2010, 11
  • [25] BIC-seq: a fast algorithm for detection of copy number alterations based on high-throughput sequencing data
    Ruibin Xi
    Joe Luquette
    Angela Hadjipanayis
    Tae-Min Kim
    Peter J Park
    Genome Biology, 11 (Suppl 1)
  • [26] High-throughput sequencing of high copy number plasmids from bacterial cultures by heat lysis
    Ganguly, T
    Chen, PQ
    Teetsel, R
    Zhang, LP
    Papaioannou, E
    Cianciarulo, J
    BIOTECHNIQUES, 2005, 39 (03) : 304 - +
  • [27] Detection of copy number variation for chromosomal sliding windows using high throughput sequencing data in the R environment
    Knaus, B. J.
    Grunwald, N. J.
    PHYTOPATHOLOGY, 2018, 108 (10) : 108 - 108
  • [28] Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
    Laura E MacConaill
    Micheala A Aldred
    Xincheng Lu
    Thomas LaFramboise
    BMC Genomics, 8
  • [29] Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
    MacConaill, Laura E.
    Aldred, Micheala A.
    Lu, Xincheng
    LaFramboise, Thomas
    BMC GENOMICS, 2007, 8 (1) : 211
  • [30] The CNVrd2 package: measurement of copy number at complex loci using high-throughput sequencing data
    Nguyen, Hoang T.
    Merriman, Tony R.
    Black, Michael A.
    FRONTIERS IN GENETICS, 2014, 5