Fast and Parallel Algorithm for Population-Based Segmentation of Copy-Number Profiles

被引：0

作者：

Rigaill, Guillem ^{[1
]}

Miele, Vincent ^{[2
]}

Picard, Franck ^{[2
]}

机构：

[1] Univ Evry Val dEssonne, Unit Rech Genom Vegetale URGV, INRA CNRS, F-91057 Evry, France

[2] Univ Lyon 1, Lab Biometrie & Biol Evolut, UMR CNRS 5558, F-69622 Villeurbanne, France

来源：

COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS: 10TH INTERNATIONAL MEETING | 2014年 / 8452卷

关键词：

DNA copy number; Dynamic Programming; Segmentation; Joint segmentation; Parallel computing; ARRAY CGH DATA; MODEL;

D O I：

10.1007/978-3-319-09042-9_18

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Dynamic Programming (DP) based change-point methods have shown very good statistical performance on DNA copy number analysis. However, the quadratic algorithmic complexity of DP has limited their use on high-density arrays or next generation sequencing data. This complexity issue is particularly critical for segmentation and calling of segments, and for the joint segmentation of many different profiles. Our contribution is two-fold. First we provide an at worst linear DP algorithm for segmentation and calling, which allows the use of DP-based segmentation on high-density arrays with a considerably reduced computational cost. For the joint segmentation issue we provide a parallel version of the cghseg package which now allows us to analyze more than 1,000 profiles of length 100,000 within a few hours. Therefore our method and software package are adapted to the next generation of computers (multi-cores) and experiments (very large profiles).

引用

页码：248 / 258

页数：11

共 20 条

[1]

Amdahl G.M., 1967, Validity of the single processor approach to achieving large scale computing capabilities. Proceedings of the Spring Joint Computer Conference, P483, DOI DOI 10.1145/1465482.1465560

[2] A high-resolution map of transcription in the yeast genome [J].