Surprisal Analysis of Transcripts Expression Levels in the Presence of Noise: A Reliable Determination of the Onset of a Tumor Phenotype

被引:15
作者
Gross, Ayelet [1 ]
Levine, Raphael D. [1 ,2 ,3 ]
机构
[1] Hebrew Univ Jerusalem, Fritz Haber Res Ctr, IL-91904 Jerusalem, Israel
[2] David Geffen Sch Med Calif, Crump Inst Mol Imaging, Dept Chem & Biochem, Los Angeles, CA USA
[3] David Geffen Sch Med Calif, Dept Mol & Med Pharmacol, Los Angeles, CA USA
关键词
INFORMATION; PROFILES; NETWORKS;
D O I
10.1371/journal.pone.0061554
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Towards a reliable identification of the onset in time of a cancer phenotype, changes in transcription levels in cell models were tested. Surprisal analysis, an information-theoretic approach grounded in thermodynamics, was used to characterize the expression level of mRNAs as time changed. Surprisal Analysis provides a very compact representation for the measured expression levels of many thousands of mRNAs in terms of very few - three, four - transcription patterns. The patterns, that are a collection of transcripts that respond together, can be assigned definite biological phenotypic role. We identify a transcription pattern that is a clear marker of eventual malignancy. The weight of each transcription pattern is determined by surprisal analysis. The weight of this pattern changes with time; it is never strictly zero but it is very low at early times and then rises rather suddenly. We suggest that the low weights at early time points are primarily due to experimental noise. We develop the necessary formalism to determine at what point in time the value of that pattern becomes reliable. Beyond the point in time when a pattern is deemed reliable the data shows that the pattern remain reliable. We suggest that this allows a determination of the presence of a cancer forewarning. We apply the same formalism to the weight of the transcription patterns that account for healthy cell pathways, such as apoptosis, that need to be switched off in cancer cells. We show that their weight eventually falls below the threshold. Lastly we discuss patient heterogeneity as an additional source of fluctuation and show how to incorporate it within the developed formalism.
引用
收藏
页数:6
相关论文
共 24 条
[1]   Information Theoretic Hierarchical Clustering [J].
Aghagolzadeh, Mehdi ;
Soltanian-Zadeh, Hamid ;
Araabi, Babak Nadjar .
ENTROPY, 2011, 13 (02) :450-465
[2]  
ALHASSID Y, 1980, CHEM PHYS LETT, V73, P16, DOI 10.1016/0009-2614(80)85192-X
[3]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[4]  
[Anonymous], 1948, Handbook of Mathematical Functions withFormulas, Graphs, and Mathematical Tables, DOI DOI 10.1119/1.15378
[5]  
[Anonymous], 2003, PRACTICAL APPROACH M, DOI [DOI 10.1007/0-306-47815-35, 10.1007/0-306-47815-35, DOI 10.1007/0-306-47815-3_5]
[6]  
Arfken G.B., 2012, Mathematical Methods for Physicists
[7]   Evidence Based Selection of Housekeeping Genes [J].
de Jonge, Hendrik J. M. ;
Fehrmann, Rudolf S. N. ;
de Bont, Eveline S. J. M. ;
Hofstra, Robert M. W. ;
Gerbens, Frans ;
Kamps, Willem A. ;
de Vries, Elisabeth G. E. ;
van der Zee, Ate G. J. ;
Meerman, Gerard J. te ;
ter Elst, Arja .
PLOS ONE, 2007, 2 (09)
[8]  
Golub G. H., 1996, MATRIX COMPUTATIONS
[9]  
Gross A, 2013, BIOCHEMISTRY
[10]   Fundamental patterns underlying gene expression profiles: Simplicity from complexity [J].
Holter, NS ;
Mitra, M ;
Maritan, A ;
Cieplak, M ;
Banavar, JR ;
Fedoroff, NV .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (15) :8409-8414