Shrinkage-based similarity metric for cluster analysis of microarray data

被引:24
|
作者
Cherepinsky, V
Feng, JW
Rejali, M
Mishra, B
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10012 USA
[2] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
关键词
D O I
10.1073/pnas.1633770100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The current standard correlation coefficient used in the analysis of microarray data was introduced by M. B. Eisen, P. T. Spellman, P. O. Brown, and D. Botstein [(1998) Proc. Nati. Acad Sci. USA 95, 1486314868]. Its formulation is rather arbitrary. We give a mathematically rigorous correlation coefficient of two data vectors based on James-Stein shrinkage estimators. We use the assumptions described by Eisen et al., also using the fact that the data can be treated as transformed into normal distributions. While Eisen et A use zero as an estimator for the expression vector mean mu, we start with the assumption that for each gene, IL is itself a zero-mean normal random variable [with a priori distribution N(0, tau(2))], and use Bayesian analysis to obtain a posteriori distribution of mu in terms of the data. The shrunk estimator for mu differs from the mean of the data vectors and ultimately leads to a statistically robust estimator for correlation coefficients. To evaluate the effectiveness of shrinkage, we conducted in silico experiments and also compared similarity metrics on a biological example by using the data set from Eisen et A For the latter, we classified genes involved in the regulation of yeast cell-cycle functions by computing clusters based on various definitions of correlation coefficients and contrasting them against clusters based on the activators known in the literature. The estimated false positives and false negatives from this study indicate that using the shrinkage metric improves the accuracy of the analysis.
引用
收藏
页码:9668 / 9673
页数:6
相关论文
共 50 条
  • [1] A multi-metric similarity based analysis of microarray data
    Altiparmak, Fatih
    Erdal, Selnur
    Ozturk, Ozgur
    Ferhatosmanoglu, Hakan
    2007 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2007, : 317 - +
  • [2] Shrinkage-Based Tests of Predictability
    Pincheira Brown, Pablo Matias
    JOURNAL OF FORECASTING, 2013, 32 (04) : 307 - 332
  • [3] Shrinkage-based Diagonal Discriminant Analysis and Its Applications in High-Dimensional Data
    Pang, Herbert
    Tong, Tiejun
    Zhao, Hongyu
    BIOMETRICS, 2009, 65 (04) : 1021 - 1029
  • [4] Decoding with Shrinkage-Based Language Models
    Emami, Ahmad
    Chen, Stanley
    Ittycheriah, Abraham
    Soltau, Hagen
    Zhao, Bing
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1033 - 1036
  • [5] Shrinkage-based regularization tests for high-dimensional data with application to gene set analysis
    Shen, Yanfeng
    Lin, Zhengyan
    Zhu, Jun
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (07) : 2221 - 2233
  • [6] Scaling Shrinkage-Based Language Models
    Chen, Stanley F.
    Mangu, Lidia
    Ramabhadran, Bhuvana
    Sarikaya, Ruhi
    Sethy, Abhinav
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 299 - 304
  • [7] Wavelet shrinkage-based strain estimation for elastography
    Xu, LJ
    Barnber, JC
    MelodeLima, D
    2005 IEEE ULTRASONICS SYMPOSIUM, VOLS 1-4, 2005, : 36 - 39
  • [8] Shrinkage-Based Capon and APES for Spectral Estimation
    Yang, Jun
    Ma, Xiaochuan
    Hou, Chaohuan
    Liu, Yicong
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (10) : 869 - 872
  • [9] Shrinkage-based die design for drying ceramic parts
    Stathatos, Emmanuel
    Vosniakos, George-Christopher
    Giannakakis, Titos
    Pantelis, Dimitrios
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2012, 226 (B5) : 919 - 929
  • [10] Stability-based cluster analysis applied to microarray data
    Giurcaneanu, CD
    Tabus, I
    Shmulevich, I
    Zhang, W
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 2, PROCEEDINGS, 2003, : 57 - 60