Exploiting Disagreement Between High-Dimensional Variable Selectors for Uncertainty Visualization

被引:2
作者
Yuen, Christine [1 ]
Fryzlewicz, Piotr [1 ]
机构
[1] London Sch Econ & Polit Sci, Dept Stat, London, England
基金
英国工程与自然科学研究理事会;
关键词
High-dimensional data; Uncertainty visualization; Variable selection; MODEL SELECTION; REGRESSION; LASSO; INFERENCE;
D O I
10.1080/10618600.2021.2000421
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose combined selection and uncertainty visualizer (CSUV), which visualizes selection uncertainties for covariates in high-dimensional linear regression by exploiting the (dis)agreement among different base selectors. Our proposed method highlights covariates that get selected the most frequently by the different base variable selection methods on subsampled data. The method is generic and can be used with different existing variable selection methods. We demonstrate its performance using real and simulated data.
引用
收藏
页码:351 / 359
页数:9
相关论文
共 37 条
[1]  
[Anonymous], 2014, Advances in Neural Information Processing Systems
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B, DOI DOI 10.1111/J.1467-9868.2005.00532.X
[3]  
Bach F., 2008, Proceedings of the 25th International Conference on Machine Learning, ICML'08, page, P33
[4]   RANKING-BASED VARIABLE SELECTION FOR HIGH-DIMENSIONAL DATA [J].
Baranowski, Rafal ;
Chen, Yining ;
Fryzlewicz, Piotr .
STATISTICA SINICA, 2020, 30 (03) :1485-1516
[5]  
BEALE EML, 1967, BIOMETRIKA, V54, P357
[6]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[7]   Bootstrapping Lasso Estimators [J].
Chatterjee, A. ;
Lahiri, S. N. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :608-625
[8]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[9]  
Fan JQ, 2010, STAT SINICA, V20, P101
[10]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360