Robust sparse regression by modeling noise as a mixture of gaussians

被引:4
|
作者
Xu, Shuang [1 ]
Zhang, Chun-Xia [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Robust regression; penalized regression; variable selection; mixture of Gaussians; lasso; VARIABLE SELECTION; REGULARIZATION; SHRINKAGE; ALGORITHM;
D O I
10.1080/02664763.2019.1566448
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Regression analysis has been proven to be a quite effective tool in a large variety of fields. In many regression models, it is often assumed that noise is with a specific distribution. Although the theoretical analysis can be greatly facilitated, the model-fitting performance may be poor since the supposed noise distribution may deviate from real noise to a large extent. Meanwhile, the model is also expected to be robust in consideration of the complexity of real-world data. Without any assumption about noise, we propose in this paper a novel sparse regression method called MoG-Lasso to directly model noise in linear regression models via a mixture of Gaussian distributions (MoG). Meanwhile, the penalty is included as a part of the loss function of MoG-Lasso to enhance its ability to identify a sparse model. As for the parameters in MoG-Lasso, we present an efficient algorithm to estimate them via the EM (expectation maximization) and ADMM (alternating direction method of multipliers) algorithms. With some simulated and real data contaminated by complex noise, the experiments show that the novel model MoG-Lasso performs better than several other popular methods in both 'p>n' and 'p<n' situations, including Lasso, LAD-Lasso and Huber-Lasso.
引用
收藏
页码:1738 / 1755
页数:18
相关论文
共 50 条
  • [41] Sparse regression with exact clustering
    She, Yiyuan
    ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 : 1055 - 1096
  • [42] Robust mixture regression modeling based on the normal mean-variance mixture distributions
    Naderi, Mehrdad
    Mirfarah, Elham
    Wang, Wan-Lun
    Lin, Tsung-, I
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
  • [43] Robust finite mixture regression for heterogeneous targets
    Liang, Jian
    Chen, Kun
    Lin, Ming
    Zhang, Changshui
    Wang, Fei
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (06) : 1509 - 1560
  • [44] Sparse Multivariate Gaussian Mixture Regression
    Weruaga, Luis
    Via, Javier
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (05) : 1098 - 1108
  • [45] Penalized MM regression estimation with L penalty: a robust version of bridge regression
    Arslan, Olcay
    STATISTICS, 2016, 50 (06) : 1236 - 1260
  • [46] THE SPARSE LAPLACIAN SHRINKAGE ESTIMATOR FOR HIGH-DIMENSIONAL REGRESSION
    Huang, Jian
    Ma, Shuangge
    Li, Hongzhe
    Zhang, Cun-Hui
    ANNALS OF STATISTICS, 2011, 39 (04) : 2021 - 2046
  • [47] ADMM for High-Dimensional Sparse Penalized Quantile Regression
    Gu, Yuwen
    Fan, Jun
    Kong, Lingchen
    Ma, Shiqian
    Zou, Hui
    TECHNOMETRICS, 2018, 60 (03) : 319 - 331
  • [48] FULLY EFFICIENT ROBUST ESTIMATION, OUTLIER DETECTION AND VARIABLE SELECTION VIA PENALIZED REGRESSION
    Kong, Dehan
    Bondell, Howard D.
    Wu, Yichao
    STATISTICA SINICA, 2018, 28 (02) : 1031 - 1052
  • [49] Joint sparse principal component regression with robust property
    Qi, Kai
    Tu, Jingwen
    Yang, Hu
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [50] Sparse relative risk regression models
    Wit, Ernst C.
    Augugliaro, Luigi
    Pazira, Hassan
    Gonzalez, Javier
    Abegaz, Fentaw
    BIOSTATISTICS, 2020, 21 (02) : E131 - E147