Making sense of human lung carcinomas gene expression data: Integration and analysis of two affymetrix platform experiments
被引:1
作者:
Lin, X
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USAGlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USA
Lin, X
[1
]
Park, D
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USAGlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USA
Park, D
[1
]
Eslava, S
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USAGlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USA
Eslava, S
[1
]
Lee, KR
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USAGlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USA
Lee, KR
[1
]
Lam, RLH
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USAGlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USA
Lam, RLH
[1
]
Zhu, LA
论文数: 0引用数: 0
h-index: 0
机构:
GlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USAGlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USA
Zhu, LA
[1
]
机构:
[1] GlaxoSmithKline, Biomed Data Sci, Collegeville, PA 19426 USA
来源:
METHODS OF MICROARRAY DATA ANALYSIS IV
|
2005年
关键词:
gene expression;
integration;
affymetrix MAS;
principal component analysis;
partial least squares;
survival tree;
D O I:
10.1007/0-387-23077-7_7
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
High throughput technologies such as microarray, mass spectrometry and nuclear magnetic resonance, have generated large volumes of valuable data for biology research. Researchers often face the challenges of integrating data from different sources and of identifying potential biomarkers that are highly associated with disease, drug safety, and efficacy. We present several solutions to these challenges through two Affymetrix miroarray studies aimed at providing new insights into lung cancer biology. The Harvard dataset and the Michigan dataset were integrated to identify genes that were predictive of cancer survival. Quantile normalization of expression measures was applied to make the two datasets comparable. Genes highly associated with survival were identified and survival tree analysis on the combined data was performed to predict mortality. The candidate genes could be useful for lung cancer disease prediction and cancer therapy. The methodologies for integration and analysis of multiple gene expression data have been shown to perform well and could be generalized to broader applications.