An Equivalent Measure of Partial Correlation Coefficients for High-Dimensional Gaussian Graphical Models

被引:36
作者
Liang, Faming [1 ]
Song, Qifan [2 ]
Qiu, Peihua [1 ]
机构
[1] Univ Florida, Dept Biostat, Gainesville, FL 32611 USA
[2] Purdue Univ, Dept Stat, W Lafayette, IN 47906 USA
基金
美国国家科学基金会;
关键词
Adjacency faithfulness; Gaussian graphical model; Markov property; Multiple hypothesis test; Partial correlation coefficient; INVERSE COVARIANCE ESTIMATION; BREAST-CANCER; SELECTION; DISCOVERY; EXPRESSION; INDEPENDENCE; INSIGHTS; LASSO;
D O I
10.1080/01621459.2015.1012391
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gaussian graphical models (GGMs) are frequently used to explore networks, such as gene regulatory networks, among a set of variables. Under the classical theory of GGMs, the construction of Gaussian graphical networks amounts to finding the pairs of variables with nonzero partial correlation coefficients. However, this is infeasible for high-dimensional problems for which the number of variables is larger than the sample size. In this article, we propose a new measure of partial correlation coefficient, which is evaluated with a reduced conditional set and thus feasible for high-dimensional problems. Under the Markov property and adjacency faithfulness conditions, the new measure of Partial correlation coefficient is equivalent to the true partial correlation coefficient in construction of Gaussian graphical networks. Based on the new measure of partial correlation coefficient, we propose a multiple hypothesis test-based method for the construction of Gaussian graphical networks. Furthermore, we establish the consistency of the proposed method under mild conditions. The proposed method outperforms the existing methods, such as the PC, graphical Lasso, nodewise regression, and qp-average methods, especially for the problems for which a large number of indirect associations are present. The proposed method has a computational complexity of nearly O(p(2)), and is flexible in data integration, network comparison, and covariate adjustment.
引用
收藏
页码:1248 / 1265
页数:18
相关论文
共 56 条
[1]   Multifunctional roles of insulin-like growth factor binding protein 5 in breast cancer [J].
Akkiprik, Mustafa ;
Feng, Yumei ;
Wang, Huamin ;
Chen, Kexin ;
Hu, Limei ;
Sahin, Aysegul ;
Krishnamurthy, Savitri ;
Ozer, Ayse ;
Hao, Xishan ;
Zhang, Wei .
BREAST CANCER RESEARCH, 2008, 10 (04)
[2]  
Banerjee O, 2008, J MACH LEARN RES, V9, P485
[3]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[4]   Adaptive linear step-up procedures that control the false discovery rate [J].
Benjamini, Yoav ;
Krieger, Abba M. ;
Yekutieli, Daniel .
BIOMETRIKA, 2006, 93 (03) :491-507
[5]  
Biihlmann P.L., 2011, STAT HIGH DIMENSIONA
[6]   Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm [J].
Buehlmann, P. ;
Kalisch, M. ;
Maathuis, M. H. .
BIOMETRIKA, 2010, 97 (02) :261-278
[7]   Covariate-adjusted precision matrix estimation with an application in genetical genomics [J].
Cai, T. Tony ;
Li, Hongzhe ;
Liu, Weidong ;
Xie, Jichun .
BIOMETRIKA, 2013, 100 (01) :139-156
[8]  
Castelo R, 2006, J MACH LEARN RES, V7, P2621
[9]   Reverse Engineering Molecular Regulatory Networks from Microarray Data with qp-Graphs [J].
Castelo, Robert ;
Roverato, Alberto .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (02) :213-227
[10]   Genetic susceptibility to breast cancer: HLA DQB*03032 and HLA DRB1*11 may represent protective alleles [J].
Chaudhuri, S ;
Cariappa, A ;
Tang, M ;
Bell, D ;
Haber, DA ;
Isselbacher, KJ ;
Finkelstein, D ;
Forcione, D ;
Pillai, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (21) :11451-11454