Inference for high-dimensional differential correlation matrices

被引:25
作者
Cai, T. Tony [1 ]
Zhang, Anru [1 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat, Philadelphia, PA 19104 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Adaptive thresholding; Covariance matrix; Differential co-expression analysis; Differential correlation matrix; Optimal rate of convergence; Sparse correlation matrix; Thresholding; COVARIANCE MATRICES; BREAST-CANCER; COEXPRESSION; EXPRESSION; EQUALITY;
D O I
10.1016/j.jmva.2015.08.019
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Motivated by differential co-expression analysis in genomics, we consider in this paper estimation and testing of high-dimensional differential correlation matrices. An adaptive thresholding procedure is introduced and theoretical guarantees are given. Minimax rate of convergence is established and the proposed estimator is shown to be adaptively rate-optimal over collections of paired correlation matrices with approximately sparse differences. Simulation results show that the procedure significantly outperforms two other natural methods that are based on separate estimation of the individual correlation matrices. The procedure is also illustrated through an analysis of a breast cancer dataset, which provides evidence at the gene co-expression level that several genes, of which a subset has been previously verified, are associated with the breast cancer. Hypothesis testing on the differential correlation matrices is also considered. A test, which is particularly well suited for testing against sparse alternatives, is introduced. In addition, other related problems, including estimation of a single sparse correlation matrix, estimation of the differential covariance matrices, and estimation of the differential cross-correlation matrices, are also discussed. Published by Elsevier Inc.
引用
收藏
页码:107 / 126
页数:20
相关论文
共 33 条
[1]   Rewiring of Genetic Networks in Response to DNA Damage [J].
Bandyopadhyay, Sourav ;
Mehta, Monika ;
Kuo, Dwight ;
Sung, Min-Kyung ;
Chuang, Ryan ;
Jaehnig, Eric J. ;
Bodenmiller, Bernd ;
Licon, Katherine ;
Copeland, Wilbert ;
Shales, Michael ;
Fiedler, Dorothea ;
Dutkowski, Janusz ;
Guenole, Aude ;
van Attikum, Haico ;
Shokat, Kevan M. ;
Kolodner, Richard D. ;
Huh, Won-Ki ;
Aebersold, Ruedi ;
Keogh, Michael-Christopher ;
Krogan, Nevan J. ;
Ideker, Trey .
SCIENCE, 2010, 330 (6009) :1385-1389
[2]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[3]   Network medicine: a network-based approach to human disease [J].
Barabasi, Albert-Laszlo ;
Gulbahce, Natali ;
Loscalzo, Joseph .
NATURE REVIEWS GENETICS, 2011, 12 (01) :56-68
[4]  
Bickel P., 2008, ANN STAT, V26, P879
[5]   OPTIMAL RATES OF CONVERGENCE FOR SPARSE COVARIANCE MATRIX ESTIMATION [J].
Cai, T. Tony ;
Zhou, Harrison H. .
ANNALS OF STATISTICS, 2012, 40 (05) :2389-2420
[6]   OPTIMAL RATES OF CONVERGENCE FOR COVARIANCE MATRIX ESTIMATION [J].
Cai, T. Tony ;
Zhang, Cun-Hui ;
Zhou, Harrison H. .
ANNALS OF STATISTICS, 2010, 38 (04) :2118-2144
[7]   Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings [J].
Cai, Tony ;
Liu, Weidong ;
Xia, Yin .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) :265-277
[8]   ADAPTIVE COVARIANCE MATRIX ESTIMATION THROUGH BLOCK THRESHOLDING [J].
Cai, Tony ;
Yuan, Ming .
ANNALS OF STATISTICS, 2012, 40 (04) :2014-2042
[9]   Adaptive Thresholding for Sparse Covariance Matrix Estimation [J].
Cai, Tony ;
Liu, Weidong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :672-684
[10]   Gene co-expression network topology provides a framework for molecular characterization of cellular state [J].
Carter, SL ;
Brechbühler, CM ;
Griffin, M ;
Bond, AT .
BIOINFORMATICS, 2004, 20 (14) :2242-2250