Jointly analyzing gene expression and copy number data in breast cancer using data reduction models

被引:39
作者
Berger, JA [1 ]
Hautaniemi, S
Mitra, SK
Astola, J
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
[2] MIT, Dept Biol Engn, Cambridge, MA 02139 USA
[3] Tampere Univ Technol, Inst Signal Proc, FIN-33101 Tampere, Finland
基金
芬兰科学院;
关键词
generalized singular value decomposition; cDNA microarray data; CGH microarray data; gene expression; DNA copy numbers; breast cancer;
D O I
10.1109/TCBB.2006.10
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the growing surge of biological measurements, the problem of integrating and analyzing different types of genomic measurements has become an immediate challenge for elucidating events at the molecular level. In order to address the problem of integrating different data types, we present a framework that locates variation patterns in two biological inputs based on the generalized singular value decomposition (GSVD). In this work, we jointly examine gene expression and copy number data and iteratively project the data on different decomposition directions defined by the projection angle theta in the GSVD. With the proper choice of theta, we locate similar and dissimilar patterns of variation between both data types. We discuss the properties of our algorithm using simulated data and conduct a case study with biologically verified results. Ultimately, we demonstrate the efficacy of our method on two genome-wide breast cancer studies to identify genes with large variation in expression and copy number across numerous cell line and tumor samples. Our method identifies genes that are statistically significant in both input measurements. The proposed method is useful for a wide variety of joint copy number and expression-based studies. Supplementary information is available online, including software implementations and experimental data.
引用
收藏
页码:2 / 16
页数:15
相关论文
共 41 条
[1]   Profiling breast cancer by array CGH [J].
Albertson, DG .
BREAST CANCER RESEARCH AND TREATMENT, 2003, 78 (03) :289-298
[2]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[3]   Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (06) :3351-3356
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   Which is better for cDNA-microarray-based classification: ratios or direct intensities [J].
Attoor, S ;
Dougherty, ER ;
Chen, Y ;
Bittner, ML ;
Trent, JM .
BIOINFORMATICS, 2004, 20 (16) :2513-2520
[6]  
BERGER JA, 2006, SUPPLEMENTARY WEBPAG
[7]   Finishing the euchromatic sequence of the human genome [J].
Collins, FS ;
Lander, ES ;
Rogers, J ;
Waterston, RH .
NATURE, 2004, 431 (7011) :931-945
[8]   Genome screening by comparative genomic hybridization [J].
Forozan, F ;
Karhu, R ;
Kononen, J ;
Kallioniemi, A ;
Kallioniemi, OP .
TRENDS IN GENETICS, 1997, 13 (10) :405-409
[9]  
Golub G. H., 1996, MATRIX COMPUTATIONS
[10]   The hallmarks of cancer [J].
Hanahan, D ;
Weinberg, RA .
CELL, 2000, 100 (01) :57-70