Shrinkage-based similarity metric for cluster analysis of microarray data

被引:24
|
作者
Cherepinsky, V
Feng, JW
Rejali, M
Mishra, B
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10012 USA
[2] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
关键词
D O I
10.1073/pnas.1633770100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The current standard correlation coefficient used in the analysis of microarray data was introduced by M. B. Eisen, P. T. Spellman, P. O. Brown, and D. Botstein [(1998) Proc. Nati. Acad Sci. USA 95, 1486314868]. Its formulation is rather arbitrary. We give a mathematically rigorous correlation coefficient of two data vectors based on James-Stein shrinkage estimators. We use the assumptions described by Eisen et al., also using the fact that the data can be treated as transformed into normal distributions. While Eisen et A use zero as an estimator for the expression vector mean mu, we start with the assumption that for each gene, IL is itself a zero-mean normal random variable [with a priori distribution N(0, tau(2))], and use Bayesian analysis to obtain a posteriori distribution of mu in terms of the data. The shrunk estimator for mu differs from the mean of the data vectors and ultimately leads to a statistically robust estimator for correlation coefficients. To evaluate the effectiveness of shrinkage, we conducted in silico experiments and also compared similarity metrics on a biological example by using the data set from Eisen et A For the latter, we classified genes involved in the regulation of yeast cell-cycle functions by computing clusters based on various definitions of correlation coefficients and contrasting them against clusters based on the activators known in the literature. The estimated false positives and false negatives from this study indicate that using the shrinkage metric improves the accuracy of the analysis.
引用
收藏
页码:9668 / 9673
页数:6
相关论文
共 50 条
  • [21] On the Robustness of Covariance Matrix Shrinkage-Based Robust Adaptive Beamforming
    Xiao, Zhitao
    Wang, Jiahao
    Geng, Lei
    Zhang, Fang
    Tong, Jun
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON ELECTRONICS AND ELECTRICAL ENGINEERING TECHNOLOGY (EEET 2018), 2018, : 196 - 201
  • [22] Exact hypothesis testing for shrinkage-based Gaussian graphical models
    Bernal, Victor
    Bischoff, Rainer
    Guryev, Victor
    Grzegorczyk, Marco
    Horvatovich, Peter
    BIOINFORMATICS, 2019, 35 (23) : 5011 - 5017
  • [23] Cluster analysis of mixed data based on Feature Space Instance Cluster Closeness Metric
    Balaji, K.
    Lavanya, K.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 215
  • [24] A shrinkage-based criterion for evaluating resistance spot weldability of alloyed steels
    Li, Shuoshuo
    Wang, Yanjun
    Hu, Bin
    Tao, Wu
    Yang, Shanglu
    Luo, Haiwen
    PNAS NEXUS, 2022, 1 (04):
  • [25] On shrinkage-based edge detection in a PCA subspace for penumbral imaging
    Han, Xian-Hua
    Nakao, Zensho
    Chen, Yen-Wei
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2006, 2 (01): : 153 - 166
  • [26] Recent advances in shrinkage-based high-dimensional inference
    Bodnar, Olha
    Bodnar, Taras
    Parolya, Nestor
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 188
  • [27] Cluster Analysis of Microarray data of Shewanella oneidensis
    Nilufar, Sharmin
    Ara, Mst. Jannatul Ferdousi
    2008 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY: ICCIT 2008, VOLS 1 AND 2, 2008, : 764 - +
  • [28] Analyzing microarray data using cluster analysis
    Shannon, W
    Culverhouse, R
    Duncan, J
    PHARMACOGENOMICS, 2003, 4 (01) : 41 - 52
  • [29] Optimal shrinkage estimation of variances with applications to microarray data analysis
    Tong, Tiejun
    Wang, Yuedong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (477) : 113 - 122
  • [30] Improved Deadzone Modeling for Bivariate Wavelet Shrinkage-Based Image Denoising
    DelMarco, Stephen
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2016, 2016, 9869