Regularized Estimation and Feature Selection in Mixtures of Gaussian-Gated Experts Models

被引:2
作者
Chamroukhi, Faicel [1 ]
Lecocq, Florian [2 ]
Nguyen, Hien D. [3 ]
机构
[1] Univ Queensland, Dept Math, Brisbane, Qld 4072, Australia
[2] Univ Caen, Lab Math Nicolas Oresme LMNO UMR CNRS, Unicaen Campus 2, F-14000 Caen, France
[3] La Trobe Univ, Dept Math & Stat, Melbourne, Vic 3086, Australia
来源
STATISTICS AND DATA SCIENCE, RSSDS 2019 | 2019年 / 1150卷
关键词
Mixtures-of-experts; Clustering; Feature selection; EM algorithm; Lasso; High-dimensional data; MAXIMUM-LIKELIHOOD; REGRESSION;
D O I
10.1007/978-981-15-1960-4_3
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Mixtures-of-Experts models and their maximum likelihood estimation (MLE) via the EM algorithm have been thoroughly studied in the statistics and machine learning literature. They are subject of a growing investigation in the context of modeling with high-dimensional predictors with regularized MLE. We examine MoE with Gaussian gating network, for clustering and regression, and propose an l(1)-regularized MLE to encourage sparse models and deal with the high-dimensional setting. We develop an EM-Lasso algorithm to perform parameter estimation and utilize a BIC-like criterion to select the model parameters, including the sparsity tuning hyperparameters. Experiments conducted on simulated data show the good performance of the proposed regularized MLE compared to the standard MLE with the EM algorithm.
引用
收藏
页码:42 / 56
页数:15
相关论文
共 24 条
  • [1] [Anonymous], 2015, ARXIV150606707
  • [2] Skew t mixture of experts
    Chamroukhi, F.
    [J]. NEUROCOMPUTING, 2017, 266 : 390 - 408
  • [3] Robust mixture of experts modeling using the t distribution
    Chamroukhi, F.
    [J]. NEURAL NETWORKS, 2016, 79 : 20 - 36
  • [4] Joint segmentation of multivariate time series with hidden process regression for human activity recognition
    Chamroukhi, F.
    Mohammed, S.
    Trabelsi, D.
    Oukhellou, L.
    Amirat, Y.
    [J]. NEUROCOMPUTING, 2013, 120 : 633 - 644
  • [5] Chamroukhi Faicel, 2009, Proceedings 2009 International Joint Conference on Neural Networks (IJCNN 2009 - Atlanta), P489, DOI 10.1109/IJCNN.2009.5178921
  • [6] Chamroukhi F, 2019, J SFDS, V160, P57
  • [7] Chamroukhi F, 2016, IEEE IJCNN, P3000, DOI 10.1109/IJCNN.2016.7727580
  • [8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [9] PATHWISE COORDINATE OPTIMIZATION
    Friedman, Jerome
    Hastie, Trevor
    Hoefling, Holger
    Tibshirani, Robert
    [J]. ANNALS OF APPLIED STATISTICS, 2007, 1 (02) : 302 - 332
  • [10] Hastie T., 2019, Statistical Learning with Sparsity: The Lasso and Generalizations