Statistical predictions with glmnet

被引:716
作者
Engebretsen, Solveig [1 ,2 ]
Bohlin, Jon [1 ,3 ,4 ]
机构
[1] Norwegian Inst Publ Hlth, Dept Infect Dis Epidemiol & Modelling, Div Infect Control & Environm Hlth, Oslo, Norway
[2] Univ Oslo, Dept Biostat, Oslo Ctr Biostat & Epidemiol, Oslo, Norway
[3] Norwegian Inst Publ Hlth, Ctr Fertil & Hlth CEFH, Oslo, Norway
[4] Norwegian Univ Life Sci, Dept Prod Anim, Fac Vet Sci, As, Norway
关键词
Elastic net; glmnet package; Statistical prediction; Ultra-high dimensional regression; VARIABLE SELECTION; REGRESSION; REGULARIZATION; LASSO;
D O I
10.1186/s13148-019-0730-1
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Elastic net type regression methods have become very popular for prediction of certain outcomes in epigenome-wide association studies (EWAS). The methods considered accept biased coefficient estimates in return for lower variance thus obtaining improved prediction accuracy. We provide guidelines on how to obtain parsimonious models with low mean squared error and include easy to follow walk-through examples for each step in R.
引用
收藏
页数:3
相关论文
共 23 条
[1]  
[Anonymous], 1956, P 3 BERK S MATH STAT
[2]  
[Anonymous], 2001, ELEMENTS STAT LEARNI
[3]   Prediction of gestational age based on genome-wide differentially methylated regions [J].
Bohlin, J. ;
Haberg, S. E. ;
Magnus, P. ;
Reese, S. E. ;
Gjessing, H. K. ;
Magnus, M. C. ;
Parr, C. L. ;
Page, C. M. ;
London, S. J. ;
Nystad, W. .
GENOME BIOLOGY, 2016, 17
[4]   BETTER SUBSET REGRESSION USING THE NONNEGATIVE GARROTE [J].
BREIMAN, L .
TECHNOMETRICS, 1995, 37 (04) :373-384
[5]   DISCUSSION: "A SIGNIFICANCE TEST FOR THE LASSO" [J].
Buja, A. ;
Brown, L. .
ANNALS OF STATISTICS, 2014, 42 (02) :509-517
[6]   Bootstrapping Lasso Estimators [J].
Chatterjee, A. ;
Lahiri, S. N. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :608-625
[7]   STEINS PARADOX IN STATISTICS [J].
EFRON, B ;
MORRIS, C .
SCIENTIFIC AMERICAN, 1977, 236 (05) :119-127
[8]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[9]  
Fan J., 2006, P INT C MATHEMATICIA, pp595
[10]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360