Reducing bias and mitigating the influence of excess of zeros in regression covariates with multi-outcome adaptive LAD-lasso

被引:1
作者
Mottonen, Jyrki [1 ]
Lahderanta, Tero [2 ]
Salonen, Janne [3 ]
Sillanpaa, Mikko J. [2 ]
机构
[1] Univ Helsinki, Dept Math & Stat, Helsinki, Finland
[2] Univ Oulu, Res Unit Math Sci, Oulu, Finland
[3] Finnish Publ Sect Pens Provider Keva, Helsinki, Finland
关键词
Multivariate analysis; p >> n regression; penalized regression; robust procedures; variable selection; zero-inflated continuous data; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; SHRINKAGE; MODELS;
D O I
10.1080/03610926.2023.2189059
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Zero-inflated explanatory variables, as opposed to outcome variables, are common, for example, in environmental sciences. In this article, we address the problem of having excess of zero values in some continuous explanatory variables, which are subject to multi-outcome lasso-regularized variable selection. In short, the problem results from the failure of the lasso-type of shrinkage methods to recognize any difference between zero value occurring either in the regression coefficient or in the corresponding value of the explanatory variable. This kind of confounding will obviously increase the number of false positives - all non-zero regression coefficients do not necessarily represent true outcome effects. We present here the adaptive LAD-lasso for multiple outcomes, which extends the earlier work of multi-outcome LAD-lasso with adaptive penalization. In addition to well-known property of having less biased regression coefficients, we show that the adaptivity also improves method's ability to recover from influences of excess of zero values measured in continuous covariates.
引用
收藏
页码:4730 / 4744
页数:15
相关论文
共 32 条