Reconstructing DNA copy number by joint segmentation of multiple sequences

被引:11
|
作者
Zhang, Zhongyang [1 ]
Lange, Kenneth [2 ]
Sabatti, Chiara [3 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA USA
[2] Univ Calif Los Angeles, Dept Human Genet Biomath & Stat, Los Angeles, CA USA
[3] Stanford Univ, Dept Hlth Res & Policy & Stat, Stanford, CA 94305 USA
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
Copy number variant; Copy number polymorphism; Fused lasso; Group fused lasso; MM algorithm; CIRCULAR BINARY SEGMENTATION; HIDDEN MARKOV-MODELS; GENOTYPE CALLS; LASSO; NORMALIZATION; ALGORITHMS; SELECTION; PACKAGE; PATH;
D O I
10.1186/1471-2105-13-205
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Variations in DNA copy number carry information on the modalities of genome evolution and mis-regulation of DNA replication in cancer cells. Their study can help localize tumor suppressor genes, distinguish different populations of cancerous cells, and identify genomic variations responsible for disease phenotypes. A number of different high throughput technologies can be used to identify copy number variable sites, and the literature documents multiple effective algorithms. We focus here on the specific problem of detecting regions where variation in copy number is relatively common in the sample at hand. This problem encompasses the cases of copy number polymorphisms, related samples, technical replicates, and cancerous sub-populations from the same individual. Results: We present a segmentation method named generalized fused lasso (GFL) to reconstruct copy number variant regions. GFL is based on penalized estimation and is capable of processing multiple signals jointly. Our approach is computationally very attractive and leads to sensitivity and specificity levels comparable to those of state-of-the-art specialized methodologies. We illustrate its applicability with simulated and real data sets. Conclusions: The flexibility of our framework makes it applicable to data obtained with a wide range of technology. Its versatility and speed make GFL particularly useful in the initial screening stages of large data sets.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] VEGA: variational segmentation for copy number detection
    Morganella, Sandro
    Cerulo, Luigi
    Viglietto, Giuseppe
    Ceccarelli, Michele
    BIOINFORMATICS, 2010, 26 (24) : 3020 - 3027
  • [22] Late-onset mitochondrial DNA depletion:: DNA copy number, multiple deletions, and compensation
    Barthélémy, C
    de Baulny, HO
    Diaz, J
    Cheval, MA
    Frachon, P
    Romero, N
    Goutieres, F
    Fardeau, M
    Lombès, A
    ANNALS OF NEUROLOGY, 2001, 49 (05) : 607 - 617
  • [23] PLRS: a flexible tool for the joint analysis of DNA copy number and mRNA expression data
    Leday, Gwenael G. R.
    van de Wiel, Mark A.
    BIOINFORMATICS, 2013, 29 (08) : 1081 - 1082
  • [24] Framework for the analysis of genetic variations across multiple DNA copy number samples
    Alqallaf, Abdullah K.
    Tewfik, Ahmed H.
    Selleck, Scott B.
    Johnson, Rebecca
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 553 - +
  • [25] CGHweb: a tool for comparing DNA copy number segmentations from multiple algorithms
    Lai, Weil
    Choudhary, Vidhu
    Park, Peter J.
    BIOINFORMATICS, 2008, 24 (07) : 1014 - 1015
  • [26] Segmentation algorithm for DNA sequences
    Zhang, CT
    Gao, F
    Zhang, R
    PHYSICAL REVIEW E, 2005, 72 (04)
  • [27] Mitochondrial DNA copy number and disease
    Clyde D.
    Nature Reviews Genetics, 2022, 23 (3) : 136 - 136
  • [28] MITOCHONDRIAL DNA COPY NUMBER IN CVD
    Markina, Yuliya
    Kirichenko, Tatiana
    Tolstik, Taisiya
    Bogatyreva, Anastasia
    Markin, Alexander
    ATHEROSCLEROSIS, 2024, 395
  • [29] Bayesian DNA copy number analysis
    Paola MV Rancoita
    Marcus Hutter
    Francesco Bertoni
    Ivo Kwee
    BMC Bioinformatics, 10
  • [30] Bayesian DNA copy number analysis
    Rancoita, Paola M. V.
    Hutter, Marcus
    Bertoni, Francesco
    Kwee, Ivo
    BMC BIOINFORMATICS, 2009, 10