Robust and consistent variable selection in high-dimensional generalized linear models

被引:23
作者
Avella-Medina, Marco [1 ]
Ronchetti, Elvezio [2 ]
机构
[1] MIT, Sloan Sch Management, 30 Mem Dr, Cambridge, MA 02142 USA
[2] Univ Geneva, Res Ctr Stat, Blvd Pont Arve 40, CH-1205 Geneva, Switzerland
基金
瑞士国家科学基金会;
关键词
Contamination neighbourhood; Generalized linear model; Infinitesimal robustness; Lasso; Oracle estimator; Robust quasilikelihood; NONCONCAVE PENALIZED LIKELIHOOD; REGRESSION SHRINKAGE; CONFIDENCE-INTERVALS; ADAPTIVE LASSO; INFERENCE; ESTIMATORS; REGULARIZATION;
D O I
10.1093/biomet/asx070
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Generalized linear models are popular for modelling a large variety of data. We consider variable selection through penalized methods by focusing on resistance issues in the presence of outlying data and other deviations from assumptions. We highlight the weaknesses of widely-used penalized M-estimators, propose a robust penalized quasilikelihood estimator, and show that it enjoys oracle properties in high dimensions and is stable in a neighbourhood of the model. We illustrate its finite-sample performance on simulated and real data.
引用
收藏
页码:31 / 44
页数:14
相关论文
共 63 条
  • [1] SPARSE LEAST TRIMMED SQUARES REGRESSION FOR ANALYZING HIGH-DIMENSIONAL LARGE DATA SETS
    Alfons, Andreas
    Croux, Christophe
    Gelper, Sarah
    [J]. ANNALS OF APPLIED STATISTICS, 2013, 7 (01) : 226 - 248
  • [2] [Anonymous], 2006, Journal of the Royal Statistical Society, Series B
  • [3] Influence functions for penalized M-estimators
    Avella-Medina, Marco
    [J]. BERNOULLI, 2017, 23 (4B) : 3178 - 3196
  • [4] Uniform post-selection inference for least absolute deviation regression and other Z-estimation problems
    Belloni, A.
    Chernozhukov, V.
    Kato, K.
    [J]. BIOMETRIKA, 2015, 102 (01) : 77 - 94
  • [5] Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain
    Belloni, A.
    Chen, D.
    Chernozhukov, V.
    Hansen, C.
    [J]. ECONOMETRICA, 2012, 80 (06) : 2369 - 2429
  • [6] Bianco A.M., 1996, Lecture Notes in Statistics, P17, DOI DOI 10.1007/978-1-4612-2380-1_2
  • [7] BETTER SUBSET REGRESSION USING THE NONNEGATIVE GARROTE
    BREIMAN, L
    [J]. TECHNOMETRICS, 1995, 37 (04) : 373 - 384
  • [8] High-Dimensional Statistics with a View Toward Applications in Biology
    Buehlmann, Peter
    Kalisch, Markus
    Meier, Lukas
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 1, 2014, 1 : 255 - U809
  • [9] Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
  • [10] Robust inference for generalized linear models
    Cantoni, E
    Ronchetti, E
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) : 1022 - 1030