Exploiting Disagreement Between High-Dimensional Variable Selectors for Uncertainty Visualization

被引:2
作者
Yuen, Christine [1 ]
Fryzlewicz, Piotr [1 ]
机构
[1] London Sch Econ & Polit Sci, Dept Stat, London, England
基金
英国工程与自然科学研究理事会;
关键词
High-dimensional data; Uncertainty visualization; Variable selection; MODEL SELECTION; REGRESSION; LASSO; INFERENCE;
D O I
10.1080/10618600.2021.2000421
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose combined selection and uncertainty visualizer (CSUV), which visualizes selection uncertainties for covariates in high-dimensional linear regression by exploiting the (dis)agreement among different base selectors. Our proposed method highlights covariates that get selected the most frequently by the different base variable selection methods on subsampled data. The method is generic and can be used with different existing variable selection methods. We demonstrate its performance using real and simulated data.
引用
收藏
页码:351 / 359
页数:9
相关论文
共 37 条
[21]   Variable selection with error control: another look at stability selection [J].
Shah, Rajen D. ;
Samworth, Richard J. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (01) :55-80
[22]   PROSTATE SPECIFIC ANTIGEN IN THE DIAGNOSIS AND TREATMENT OF ADENOCARCINOMA OF THE PROSTATE .2. RADICAL PROSTATECTOMY TREATED PATIENTS [J].
STAMEY, TA ;
KABALIN, JN ;
MCNEAL, JE ;
JOHNSTONE, IM ;
FREIHA, F ;
REDWINE, EA ;
YANG, N .
JOURNAL OF UROLOGY, 1989, 141 (05) :1076-1083
[24]   UNIFORM ASYMPTOTIC INFERENCE AND THE BOOTSTRAP AFTER MODEL SELECTION [J].
Tibshirani, Ryan J. ;
Rinaldo, Alessandro ;
Tibshirani, Rob ;
Wasserman, Larry .
ANNALS OF STATISTICS, 2018, 46 (03) :1255-1287
[25]   Exact Post-Selection Inference for Sequential Regression Procedures [J].
Tibshirani, Ryan J. ;
Taylor, Jonathan ;
Lockhart, Richard ;
Tibshirani, Robert .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (514) :600-614
[26]   Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches [J].
Tsai, Chih-Fong ;
Hsiao, Yu-Chieh .
DECISION SUPPORT SYSTEMS, 2010, 50 (01) :258-269
[27]   ON ASYMPTOTICALLY OPTIMAL CONFIDENCE REGIONS AND TESTS FOR HIGH-DIMENSIONAL MODELS [J].
Van de Geer, Sara ;
Buehlmann, Peter ;
Ritov, Ya'acov ;
Dezeure, Ruben .
ANNALS OF STATISTICS, 2014, 42 (03) :1166-1202
[28]   Toward an Objective and Reproducible Model Choice via Variable Selection Deviation [J].
Yang, Wenjing ;
Yang, Yuhong .
BIOMETRICS, 2017, 73 (01) :20-30
[29]   Adaptive regression by mixing [J].
Yang, YH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) :574-588
[30]   Combining linear regression models: When and how? [J].
Yuan, Z ;
Yang, YH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (472) :1202-1214