A NOVEL APPROACH TO DNA COPY NUMBER DATA SEGMENTATION

被引:2
|
作者
Wang, Siling [1 ]
Wang, Yuhang [1 ]
Xie, Yang [2 ]
Xiao, Guanghua [2 ]
机构
[1] So Methodist Univ, Dept Comp Sci & Engn, Dallas, TX 75205 USA
[2] Univ Texas SW Med Ctr Dallas, Dept Clin Sci, Dallas, TX 75390 USA
基金
美国国家科学基金会;
关键词
MDL; model-based; DCN data; segmentation; COMPARATIVE GENOMIC HYBRIDIZATION; ARRAY; AMPLIFICATIONS; ALGORITHMS; PRINCIPLE; MODELS;
D O I
10.1142/S0219720011005343
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
DNA copy number (DCN) is the number of copies of DNA at a region of a genome. The alterations of DCN are highly associated with the development of different tumors. Recently, microarray technologies are being employed to detect DCN changes at many loci at the same time in tumor samples. The resulting DCN data are often very noisy, and the tumor sample is often contaminated by normal cells. The goal of computational analysis of array-based DCN data is to infer the underlying DCNs from raw DCN data. Previous methods for this task do not model the tumor/normal cell mixture ratio explicitly and they cannot output segments with DCN annotations. We developed a novel model-based method using the minimum description length (MDL) principle for DCN data segmentation. Our new method can output underlying DCN for each chromosomal segment, and at the same time, infer the underlying tumor proportion in the test samples. Empirical results show that our method achieves better accuracies on average as compared to three previous methods, namely Circular Binary Segmentation, Hidden Markov Model and Ultrasome.
引用
收藏
页码:131 / 148
页数:18
相关论文
共 50 条
  • [1] ToolSEG: A Tool for DNA Copy Number Segmentation
    Liu, Zhen
    Sun, Ming
    Ruan, Jun
    Yue, Junqiu
    2016 3RD INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2016), 2016, : 159 - 163
  • [2] Performance evaluation of DNA copy number segmentation methods
    Pierre-Jean, Morgane
    Rigaill, Guillem
    Neuvial, Pierre
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (04) : 600 - 615
  • [3] Circular binary segmentation for the analysis of array-based DNA copy number data
    Olshen, AB
    Venkatraman, ES
    Lucito, R
    Wigler, M
    BIOSTATISTICS, 2004, 5 (04) : 557 - 572
  • [4] NMF Based Approach for Finding Recurrent Aberrations in DNA Copy Number Data
    Elhenawy, Mohammed
    Yu, Guoqiang
    2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, : 82 - 85
  • [5] A penalized regression approach for DNA copy number study using the sequencing data
    Lee, Jaeeun
    Chen, Jie
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (04)
  • [6] Reconstructing DNA copy number by joint segmentation of multiple sequences
    Zhongyang Zhang
    Kenneth Lange
    Chiara Sabatti
    BMC Bioinformatics, 13
  • [7] Reconstructing DNA copy number by joint segmentation of multiple sequences
    Zhang, Zhongyang
    Lange, Kenneth
    Sabatti, Chiara
    BMC BIOINFORMATICS, 2012, 13
  • [8] Simple binary segmentation frameworks for identifying variation in DNA copy number
    Tae Young Yang
    BMC Bioinformatics, 13
  • [9] Classification techniques for recurrent DNA copy number data
    Alqallaf, Abdullah K.
    Tewfik, Ahmed H.
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 1218 - 1221
  • [10] DBS: a fast and informative segmentation algorithm for DNA copy number analysis
    Jun Ruan
    Zhen Liu
    Ming Sun
    Yue Wang
    Junqiu Yue
    Guoqiang Yu
    BMC Bioinformatics, 20