DNA copy number variant;
Next generation sequencing data;
Optimality;
Robust segment detector;
Robust segment identifier;
COPY-NUMBER VARIATION;
STRUCTURAL VARIATION;
RESOLUTION;
SEQ;
ALGORITHM;
D O I:
10.1111/j.1467-9868.2012.01028.x
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
. Copy number variants (CNVs) are alternations of DNA of a genome that result in the cell having less or more than two copies of segments of the DNA. CNVs correspond to relatively large regions of the genome, ranging from about one kilobase to several megabases, that are deleted or duplicated. Motivated by CNV analysis based on next generation sequencing data, we consider the problem of detecting and identifying sparse short segments hidden in a long linear sequence of data with an unspecified noise distribution. We propose a computationally efficient method that provides a robust and near optimal solution for segment identification over a wide range of noise distributions. We theoretically quantify the conditions for detecting the segment signals and show that the method near optimally estimates the signal segments whenever it is possible to detect their existence. Simulation studies are carried out to demonstrate the efficiency of the method under various noise distributions. We present results from a CNV analysis of a HapMap Yoruban sample to illustrate the theory and the methods further.
机构:
Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USAUniv Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Alkan, Can
Coe, Bradley P.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USAUniv Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Coe, Bradley P.
Eichler, Evan E.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USAUniv Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
Dana Farber Canc Inst, Dept Med Oncol, Boston, MA 02115 USA
Dana Farber Canc Inst, Ctr Canc Genome Discovery, Boston, MA 02115 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Chiang, Derek Y.
Getz, Gad
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Getz, Gad
Jaffe, David B.
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Jaffe, David B.
O'Kelly, Michael J. T.
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
O'Kelly, Michael J. T.
Zhao, Xiaojun
论文数: 0引用数: 0
h-index: 0
机构:
Novartis Inst Biomed Res, Cambridge, MA 02139 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Zhao, Xiaojun
Carter, Scott L.
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
Harvard Mit Div Hlth Sci & Technol, Cambridge, MA 02139 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Carter, Scott L.
Russ, Carsten
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Russ, Carsten
Nusbaum, Chad
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Nusbaum, Chad
Meyerson, Matthew
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
Dana Farber Canc Inst, Dept Med Oncol, Boston, MA 02115 USA
Dana Farber Canc Inst, Ctr Canc Genome Discovery, Boston, MA 02115 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Meyerson, Matthew
Lander, Eric S.
论文数: 0引用数: 0
h-index: 0
机构:Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
机构:
Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USAUniv Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Alkan, Can
Coe, Bradley P.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USAUniv Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Coe, Bradley P.
Eichler, Evan E.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USAUniv Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
Dana Farber Canc Inst, Dept Med Oncol, Boston, MA 02115 USA
Dana Farber Canc Inst, Ctr Canc Genome Discovery, Boston, MA 02115 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Chiang, Derek Y.
Getz, Gad
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Getz, Gad
Jaffe, David B.
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Jaffe, David B.
O'Kelly, Michael J. T.
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
O'Kelly, Michael J. T.
Zhao, Xiaojun
论文数: 0引用数: 0
h-index: 0
机构:
Novartis Inst Biomed Res, Cambridge, MA 02139 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Zhao, Xiaojun
Carter, Scott L.
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
Harvard Mit Div Hlth Sci & Technol, Cambridge, MA 02139 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Carter, Scott L.
Russ, Carsten
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Russ, Carsten
Nusbaum, Chad
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Nusbaum, Chad
Meyerson, Matthew
论文数: 0引用数: 0
h-index: 0
机构:
Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
Dana Farber Canc Inst, Dept Med Oncol, Boston, MA 02115 USA
Dana Farber Canc Inst, Ctr Canc Genome Discovery, Boston, MA 02115 USABroad Inst MIT & Harvard, Cambridge, MA 02142 USA
Meyerson, Matthew
Lander, Eric S.
论文数: 0引用数: 0
h-index: 0
机构:Broad Inst MIT & Harvard, Cambridge, MA 02142 USA