Generalized Variable Selection Algorithms for Gaussian Process Models by LASSO-Like Penalty

被引:3
作者
Hu, Zhiyong [1 ]
Dey, Dipak K. [1 ,2 ]
机构
[1] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
[2] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
关键词
Automatic relevance determination; Electroencephalography data; Gaussian process; Principal component analysis; Variable selection; STOCHASTIC-PROCESS; EMULATION;
D O I
10.1080/10618600.2023.2256802
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
With the rapid development of modern technology, massive amounts of data with complex pattern are generated. Gaussian process models that can easily fit the nonlinearity in data become more and more popular nowadays. It is often the case that in some data only a few features are important or active. However, unlike classical linear models, it is challenging to identify active variables in Gaussian process models. One of the most commonly used methods for variable selection in Gaussian process models is automatic relevance determination, which is known to be open-ended. There is no rule of thumb to determine the threshold for dropping features, which makes the variable selection in Gaussian process models ambiguous. In this work, we propose two variable selection algorithms for Gaussian process models, which use the artificial nuisance columns as baseline for identifying the active features. Moreover, the proposed methods work for both regression and classification problems. The algorithms are demonstrated using comprehensive simulation experiments and an application to multi-subject electroencephalography data that studies alcoholic levels of experimental subjects. Supplementary materials for this article are available online.
引用
收藏
页码:477 / 486
页数:10
相关论文
共 25 条
  • [1] CONTROLLING THE FALSE DISCOVERY RATE VIA KNOCKOFFS
    Barber, Rina Foygel
    Candes, Emmanuel J.
    [J]. ANNALS OF STATISTICS, 2015, 43 (05) : 2055 - 2085
  • [2] Bingham E, 2019, J MACH LEARN RES, V20
  • [3] Variational Inference: A Review for Statisticians
    Blei, David M.
    Kucukelbir, Alp
    McAuliffe, Jon D.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 859 - 877
  • [4] Dance H., 2022, INT C ARTIFICIAL INT, P7976
  • [5] RobustGaSP: Robust Gaussian Stochastic Process Emulation in R
    Gu, Mengyang
    Palomo, Jesus
    Berger, James O.
    [J]. R JOURNAL, 2019, 11 (01): : 112 - 136
  • [6] Jointly Robust Prior for Gaussian Stochastic Process in Emulation, Calibration and Variable Selection
    Gu, Mengyang
    [J]. BAYESIAN ANALYSIS, 2019, 14 (03): : 857 - 885
  • [7] Hensman J, 2015, JMLR WORKSH CONF PRO, V38, P351
  • [8] Hensman James, 2013, ARXIV
  • [9] Hoffman MD, 2013, J MACH LEARN RES, V14, P1303
  • [10] Local-Aggregate Modeling for Big Data via Distributed Optimization: Applications to Neuroimaging
    Hu, Yue
    Allen, Genevera I.
    [J]. BIOMETRICS, 2015, 71 (04) : 905 - 917