Integrative analysis of genome-scale data by using pseudoinverse projection predicts novel correlation between DNA replication and RNA transcription

被引:55
作者
Alter, O [1 ]
Golub, GH
机构
[1] Univ Texas, Dept Biomed Engn, Austin, TX 78712 USA
[2] Univ Texas, Inst Cellular & Mol Biol, Austin, TX 78712 USA
[3] Stanford Univ, Sci Comp & Computat Math Program, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
singular value decomposition; generalized singular value decomposition; DNA microarrays; yeast Saccharomyces cerevisiae cell cycle;
D O I
10.1073/pnas.0406767101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We describe an integrative data-driven mathematical framework that formulates any number of genome-scale molecular biological data sets in terms of one chosen set of data samples, or of profiles extracted mathematically from data samples, designated the "basis" set. By using pseudoinverse projection, the molecular biological profiles of the data samples are least-squares-approximated as superpositions of the basis profiles. Reconstruction of the data in the basis simulates experimental observation of only the cellular states manifest in the data that correspond to those of the basis. Classification of the data samples according to their reconstruction in the basis, rather than their overall measured profiles, maps the cellular states of the data onto those of the basis and gives a global picture of the correlations and possibly also causal coordination of these two sets of states. We illustrate this framework with an integration of yeast genome-scale proteins' DNA-binding data with cell cycle mRNA expression time course data. Novel correlation between DNA replication initiation and RNA transcription during the yeast cell cycle, which might be due to a previously unknown mechanism of regulation, is predicted.
引用
收藏
页码:16577 / 16582
页数:6
相关论文
共 19 条
  • [1] Alberts B., 1994, MOL BIOL CELL
  • [2] Singular value decomposition for genome-wide expression data processing and modeling
    Alter, O
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) : 10101 - 10106
  • [3] Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms
    Alter, O
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (06) : 3351 - 3356
  • [4] Processing and modeling genome-wide expression data using singular value decomposition
    Alter, O
    Brown, PO
    Botstein, D
    [J]. MICROARRAYS: OPTICAL TECHNOLOGIES AND INFORMATICS, 2001, 4266 : 171 - 186
  • [5] [Anonymous], 1996, MATRIX COMPUTATION
  • [6] Regulatory element detection using correlation with expression
    Bussemaker, HJ
    Li, H
    Siggia, ED
    [J]. NATURE GENETICS, 2001, 27 (02) : 167 - 171
  • [7] Mcm1 binds replication origins
    Chang, VK
    Fitch, MJ
    Donato, JJ
    Christensen, TW
    Merchant, AM
    Tye, BK
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (08) : 6093 - 6100
  • [8] 2 STEPS IN THE ASSEMBLY OF COMPLEXES AT YEAST REPLICATION ORIGINS IN-VIVO
    DIFFLEY, JFX
    COCKER, JH
    DOWELL, SJ
    ROWLEY, A
    [J]. CELL, 1994, 78 (02) : 303 - 316
  • [9] Influences of the cell cycle on silencing
    Fox, CA
    Rine, J
    [J]. CURRENT OPINION IN CELL BIOLOGY, 1996, 8 (03) : 354 - 357
  • [10] Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF
    Iyer, VR
    Horak, CE
    Scafe, CS
    Botstein, D
    Snyder, M
    Brown, PO
    [J]. NATURE, 2001, 409 (6819) : 533 - 538