Pathway level analysis of gene expression using singular value decomposition

被引:270
作者
Tomfohr, J
Lu, J
Kepler, TB [1 ]
机构
[1] Duke Univ, Dept Biostat & Bioinformat, Durham, NC 27708 USA
[2] Duke Univ, Inst Genome Sci & Policy, Ctr Bioinformat & Computat Biol, Durham, NC 27708 USA
关键词
D O I
10.1186/1471-2105-6-225
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A promising direction in the analysis of gene expression focuses on the changes in expression of specific predefined sets of genes that are known in advance to be related (e.g., genes coding for proteins involved in cellular pathways or complexes). Such an analysis can reveal features that are not easily visible from the variations in the individual genes and can lead to a picture of expression that is more biologically transparent and accessible to interpretation. In this article, we present a new method of this kind that operates by quantifying the level of 'activity' of each pathway in different samples. The activity levels, which are derived from singular value decompositions, form the basis for statistical comparisons and other applications. Results: We demonstrate our approach using expression data from a study of type 2 diabetes and another of the influence of cigarette smoke on gene expression in airway epithelia. A number of interesting pathways are identified in comparisons between smokers and non-smokers including ones related to nicotine metabolism, mucus production, and glutathione metabolism. A comparison with results from the related approach, 'gene-set enrichment analysis', is also provided. Conclusion: Our method offers a flexible basis for identifying differentially expressed pathways from gene expression data. The results of a pathway-based analysis can be complementary to those obtained from one more focused on individual genes. A web program PLAGE (Pathway Level Analysis of Gene Expression) for performing the kinds of analyses described here is accessible at http://dulci.biostat.duke.edu/pathways.
引用
收藏
页数:11
相关论文
共 25 条
  • [1] Alberts B., 2002, Molecular Biology of The Cell, V4th
  • [2] Singular value decomposition for genome-wide expression data processing and modeling
    Alter, O
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) : 10101 - 10106
  • [3] Ariyoshi N, 2002, CANCER EPIDEM BIOMAR, V11, P890
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes
    Baldi, P
    Long, AD
    [J]. BIOINFORMATICS, 2001, 17 (06) : 509 - 519
  • [6] Cowell LG, 2002, GENOME BIOL, V3
  • [7] MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data
    Doniger, SW
    Salomonis, N
    Dahlquist, KD
    Vranizan, K
    Lawlor, SC
    Conklin, BR
    [J]. GENOME BIOLOGY, 2003, 4 (01)
  • [8] Multiple hypothesis testing in microarray experiments
    Dudoit, S
    Shaffer, JP
    Boldrick, JC
    [J]. STATISTICAL SCIENCE, 2003, 18 (01) : 71 - 103
  • [9] Cigarette smoke-induced airway-inflammation as sampled by the expired breath condensate
    Dwyer, TM
    [J]. AMERICAN JOURNAL OF THE MEDICAL SCIENCES, 2003, 326 (04) : 174 - 178
  • [10] Empirical Bayes analysis of a microarray experiment
    Efron, B
    Tibshirani, R
    Storey, JD
    Tusher, V
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1151 - 1160