Reconstructing DNA copy number by joint segmentation of multiple sequences

被引:11
|
作者
Zhang, Zhongyang [1 ]
Lange, Kenneth [2 ]
Sabatti, Chiara [3 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA USA
[2] Univ Calif Los Angeles, Dept Human Genet Biomath & Stat, Los Angeles, CA USA
[3] Stanford Univ, Dept Hlth Res & Policy & Stat, Stanford, CA 94305 USA
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
Copy number variant; Copy number polymorphism; Fused lasso; Group fused lasso; MM algorithm; CIRCULAR BINARY SEGMENTATION; HIDDEN MARKOV-MODELS; GENOTYPE CALLS; LASSO; NORMALIZATION; ALGORITHMS; SELECTION; PACKAGE; PATH;
D O I
10.1186/1471-2105-13-205
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Variations in DNA copy number carry information on the modalities of genome evolution and mis-regulation of DNA replication in cancer cells. Their study can help localize tumor suppressor genes, distinguish different populations of cancerous cells, and identify genomic variations responsible for disease phenotypes. A number of different high throughput technologies can be used to identify copy number variable sites, and the literature documents multiple effective algorithms. We focus here on the specific problem of detecting regions where variation in copy number is relatively common in the sample at hand. This problem encompasses the cases of copy number polymorphisms, related samples, technical replicates, and cancerous sub-populations from the same individual. Results: We present a segmentation method named generalized fused lasso (GFL) to reconstruct copy number variant regions. GFL is based on penalized estimation and is capable of processing multiple signals jointly. Our approach is computationally very attractive and leads to sensitivity and specificity levels comparable to those of state-of-the-art specialized methodologies. We illustrate its applicability with simulated and real data sets. Conclusions: The flexibility of our framework makes it applicable to data obtained with a wide range of technology. Its versatility and speed make GFL particularly useful in the initial screening stages of large data sets.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Reconstructing DNA copy number by joint segmentation of multiple sequences
    Zhongyang Zhang
    Kenneth Lange
    Chiara Sabatti
    BMC Bioinformatics, 13
  • [2] Joint estimation of DNA copy number from multiple platforms
    Zhang, Nancy R.
    Senbabaoglu, Yasin
    Li, Jun Z.
    BIOINFORMATICS, 2010, 26 (02) : 153 - 160
  • [3] RECONSTRUCTING DNA COPY NUMBER BY PENALIZED ESTIMATION AND IMPUTATION
    Zhang, Zhongyang
    Lange, Kenneth
    Ophoff, Roel
    Sabatti, Chiara
    ANNALS OF APPLIED STATISTICS, 2010, 4 (04): : 1749 - 1773
  • [4] ToolSEG: A Tool for DNA Copy Number Segmentation
    Liu, Zhen
    Sun, Ming
    Ruan, Jun
    Yue, Junqiu
    2016 3RD INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2016), 2016, : 159 - 163
  • [5] VARIABLE COPY NUMBER DNA-SEQUENCES IN RICE
    KIKUCHI, S
    TAKAIWA, F
    OONO, K
    MOLECULAR & GENERAL GENETICS, 1987, 210 (03): : 373 - 380
  • [6] Performance evaluation of DNA copy number segmentation methods
    Pierre-Jean, Morgane
    Rigaill, Guillem
    Neuvial, Pierre
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (04) : 600 - 615
  • [7] A NOVEL APPROACH TO DNA COPY NUMBER DATA SEGMENTATION
    Wang, Siling
    Wang, Yuhang
    Xie, Yang
    Xiao, Guanghua
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2011, 9 (01) : 131 - 148
  • [8] Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA
    Pique-Regi, Roger
    Ortega, Antonio
    Asgharzadeh, Shahab
    BIOINFORMATICS, 2009, 25 (10) : 1223 - 1230
  • [9] Multi-platform segmentation for joint detection of copy number variants
    Teo, Shu Mei
    Pawitan, Yudi
    Kumar, Vikrant
    Thalamuthu, Anbupalam
    Seielstad, Mark
    Chia, Kee Seng
    Salim, Agus
    BIOINFORMATICS, 2011, 27 (11) : 1555 - 1561
  • [10] Simple binary segmentation frameworks for identifying variation in DNA copy number
    Tae Young Yang
    BMC Bioinformatics, 13