Statistical analysis of big data on pharmacogenomics

被引:41
作者
Fan, Jianqing [1 ]
Liu, Han [1 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
Big data; High dimensional statistics; Approximate factor model; Graphical model; Multiple testing; Variable selection; Marginal screening; Robust statistics; NONCONCAVE PENALIZED LIKELIHOOD; COVARIANCE-MATRIX ESTIMATION; FALSE DISCOVERY RATE; GENERALIZED LINEAR-MODELS; VARIABLE SELECTION; THRESHOLDING ALGORITHM; REGULARIZATION; CLASSIFICATION; REGRESSION; SHRINKAGE;
D O I
10.1016/j.addr.2013.04.008
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
This paper discusses statistical methods for estimating complex correlation structure from large pharmacogenomic datasets. We selectively review several prominent statistical methods for estimating large covariance matrix for understanding correlation structure, inverse covariance matrix for network modeling, large-scale simultaneous tests for selecting significantly differently expressed genes and proteins and generic markers for complex diseases, and high dimensional variable selection for identifying important molecules for understanding molecule mechanisms in pharmacogenomics. Their applications to gene network estimation and biomarker selection are used to illustrate the methodological power. Several new challenges of Big data analysis, including complex data distribution, missing data, measurement error, spurious correlation, endogeneity, and the need for robust statistical methods, are also discussed. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:987 / 1000
页数:14
相关论文
共 101 条
[91]   Nonparametric estimation of large covariance matrices of longitudinal data [J].
Wu, WB ;
Pourahmadi, M .
BIOMETRIKA, 2003, 90 (04) :831-844
[92]   REGULARIZED RANK-BASED ESTIMATION OF HIGH-DIMENSIONAL NONPARANORMAL GRAPHICAL MODELS [J].
Xue, Lingzhou ;
Zou, Hui .
ANNALS OF STATISTICS, 2012, 40 (05) :2541-2571
[93]   Positive-Definite l1-Penalized Estimation of Large Covariance Matrices [J].
Xue, Lingzhou ;
Ma, Shiqian ;
Zou, Hui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) :1480-1491
[94]   Nonparametric Modeling of Longitudinal Covariance Structure in Functional Mapping of Quantitative Trait Loci [J].
Yap, John Stephen ;
Fan, Jianqing ;
Wu, Rongling .
BIOMETRICS, 2009, 65 (04) :1068-1077
[95]   ON THE LIMIT OF THE LARGEST EIGENVALUE OF THE LARGE DIMENSIONAL SAMPLE COVARIANCE-MATRIX [J].
YIN, YQ ;
BAI, ZD ;
KRISHNAIAH, PR .
PROBABILITY THEORY AND RELATED FIELDS, 1988, 78 (04) :509-521
[96]  
Yuan M, 2010, J MACH LEARN RES, V11, P2261
[97]  
Zhang C-H, 2012, ADV NEURAL INF PROCE, P809
[98]   NEARLY UNBIASED VARIABLE SELECTION UNDER MINIMAX CONCAVE PENALTY [J].
Zhang, Cun-Hui .
ANNALS OF STATISTICS, 2010, 38 (02) :894-942
[99]   Principled sure independence screening for Cox models with ultra-high-dimensional covariates [J].
Zhao, Sihai Dave ;
Li, Yi .
JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 105 (01) :397-411
[100]  
Zhao T, 2012, J MACH LEARN RES, V13, P1059