Estimating genome-wide copy number using allele-specific mixture models

被引:13
|
作者
Wang, Wenyi [1 ]
Carvalho, Benilton [1 ]
Miller, Nathaniel D. [2 ]
Pevsner, Jonathan [2 ]
Chakravarti, Aravinda [3 ]
Irizarry, Rafael A. [1 ]
机构
[1] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD 21205 USA
[2] Kennedy Krieger Inst, Dept Neurol, Baltimore, MD USA
[3] Johns Hopkins Sch Med, McKusick Nathans Inst Genet Med, Baltimore, MD USA
关键词
algorithms; computational molecular biology; DNA arrays;
D O I
10.1089/cmb.2007.0148
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genomic changes such as copy number alterations are one of the major underlying causes of human phenotypic variation among normal and disease subjects. Array comparative genomic hybridization (CGH) technology was developed to detect copy number changes in a high-throughput fashion. However, this technology provides only a > 30-kb resolution, which limits the ability to detect copy number alterations spanning small regions. Higher resolution technologies such as single nucleotide polymorphism (SNP) microarrays allow detection of copy number alterations at least as small as several thousand base pairs. Unfortunately, strong probe effects and variation introduced by sample preparation procedures have made single-point copy number estimates too imprecise to be useful. Various groups have proposed statistical procedures that pool data from neighboring locations to successfully improve precision. However, these procedure need to average across relatively large regions to work effectively, thus greatly reducing resolution. Recently, regression-type models that account for probe effects have been proposed and appear to improve accuracy as well as precision. In this paper, we propose a mixture model solution, specifically designed for single-point estimation, that provides various advantages over the existing methodology. We use a 314-sample database, to motivate and fit models for the conditional distribution of the observed intensities given allele-specific copy number. We can then compute posterior probabilities that provide a useful prediction rule as well as a confidence measure for each call. Software to implement this procedure will be available in the Bioconductor oligo package (www.bioconductor.org).
引用
收藏
页码:857 / 866
页数:10
相关论文
共 50 条
  • [1] Estimating genome-wide copy number using allele specific mixture models
    Wang, Wenyi
    Carvalho, Benilton
    Miller, Nate
    Pevsner, Jonathan
    Chakravarti, Aravinda
    Irizarry, Rafael A.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2007, 4453 : 137 - +
  • [2] Genome-wide survey of allele-specific splicing in humans
    Victoria Nembaware
    Bukiwe Lupindo
    Katherine Schouest
    Charles Spillane
    Konrad Scheffler
    Cathal Seoighe
    BMC Genomics, 9
  • [3] Genome-wide survey of allele-specific splicing in humans
    Nembaware, Victoria
    Lupindo, Bukiwe
    Schouest, Katherine
    Spillane, Charles
    Scheffler, Konrad
    Seoighe, Cathal
    BMC GENOMICS, 2008, 9 (1)
  • [4] Genome-wide allele-specific analysis: insights into regulatory variation
    Tomi Pastinen
    Nature Reviews Genetics, 2010, 11 : 533 - 538
  • [5] A Genome-Wide Study of Allele-Specific Expression in Colorectal Cancer
    Liu, Zhi
    Dong, Xiao
    Li, Yixue
    FRONTIERS IN GENETICS, 2018, 9
  • [7] Allele-specific copy number analysis of tumors
    Van Loo, Peter
    Nordgard, Silje H.
    Lingjaerde, Ole Christian
    Russnes, Hege G.
    Rye, Inga H.
    Sun, Wei
    Weigman, Victor J.
    Marynen, Peter
    Zetterberg, Anders
    Naume, Bjorn
    Perou, Charles M.
    Borresen-Dale, Anne-Lise
    Kristensen, Vessela N.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (39) : 16910 - 16915
  • [8] Allele-specific copy number analysis of breast carcinomas
    Van Loo, P.
    Nordgard, S.
    Lingjaerde, O. C.
    Russnes, H. G.
    Rye, I. H.
    Sun, W.
    Naume, B.
    Perou, C. M.
    Borresen-Dale, A. L.
    Kristensen, V. N.
    EJC SUPPLEMENTS, 2010, 8 (05): : 200 - 200
  • [9] Patchwork: allele-specific copy number analysis of whole-genome sequenced tumor tissue
    Markus Mayrhofer
    Sebastian DiLorenzo
    Anders Isaksson
    Genome Biology, 14
  • [10] Patchwork: allele-specific copy number analysis of whole-genome sequenced tumor tissue
    Mayrhofer, Markus
    DiLorenzo, Sebastian
    Isaksson, Anders
    GENOME BIOLOGY, 2013, 14 (03):