Feature selection with scalable variational gaussian process via sensitivity analysis based on L2 divergence

被引:2
|
作者
Jeon, Younghwan [1 ]
Hwang, Ganguk [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Dept Math Sci, Daejeon 305701, South Korea
基金
新加坡国家研究基金会;
关键词
Gaussian processes; Scalable variational gaussian process; Feature selection; L2; divergence; VARIABLE SELECTION; MODELS;
D O I
10.1016/j.neucom.2022.11.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the most important issues in supervised learning and there are a lot of different feature selection approaches in the literature. Among them one recent approach is to use Gaussian pro-cess (GP) because it can capture well the hidden relevance between the features of the input and the out-put. However, the existing feature selection approaches with GP suffer from the scalability problem due to high computational cost of inference with GP. Moreover, they use the Kullback-Leibler (KL) divergence in the sensitivity analysis for feature selection, but we show in this paper that the KL divergence under-estimates the relevance of important features in some cases of classification. To remedy such drawbacks of the existing GP based approaches, we propose a new feature selection method with scalable variational Gaussian process (SVGP) and L2 divergence. With the help of SVGP the proposed method exploits given large data sets well for feature selection through so-called inducing points while avoiding the scalability problem. Moreover, we provide theoretical analysis to motivate the choice of L2 divergence for feature selection in both classification and regression. To validate the perfor-mance of the proposed method, we compare it with other existing methods through experiments with synthetic and real data sets. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:577 / 592
页数:16
相关论文
共 50 条
  • [41] PV Power Forecasting Using an Integrated GA-PSO-ANFIS Approach and Gaussian Process Regression Based Feature Selection Strategy
    Semero, Yordanos Kassa
    Zhang, Jianhua
    Zheng, Dehua
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2018, 4 (02): : 210 - 218
  • [42] Feature Selection and Cancer Classification via Sparse Logistic Regression with the Hybrid L1/2+2 Regularization
    Huang, Hai-Hui
    Liu, Xiao-Ying
    Liang, Yong
    PLOS ONE, 2016, 11 (05):
  • [43] Feature Selection and Clustering via Robust Graph-Laplacian PCA Based on Capped L1-Norm
    Wu, Ming-Juan
    Liu, Jin-Xing
    Gao, Ying-Lian
    Kong, Xiang-Zhen
    Feng, Chun-Mei
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1741 - 1745
  • [44] l2,1 norm regularized multi-kernel based joint nonlinear feature selection and over-sampling for imbalanced data classification
    Cao, Peng
    Liu, Xiaoli
    Zhang, Jian
    Zhao, Dazhe
    Huang, Min
    Zaiane, Osmar
    NEUROCOMPUTING, 2017, 234 : 38 - 57
  • [45] Feature Selection Method of Radar-based Road Target Recognition via Histogram Analysis and Adaptive Genetics
    Waqi R.
    Li G.
    Zhao Z.
    Ze Z.
    Journal of Radars, 2023, 12 (05) : 1014 - 1030
  • [46] Risk Factor Analysis of Bone Mineral Density Based on Feature Selection in Type 2 Diabetes
    Wang, Wei
    Jiang, Bingbing
    Ye, Shandong
    Qian, Liting
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 221 - 226
  • [47] Exploring diagnosis and imaging biomarkers of Parkinson's disease via iterative canonical correlation analysis based feature selection
    Liu, Luyan
    Wang, Qian
    Adeli, Ehsan
    Zhang, Lichi
    Zhang, Han
    Shen, Dinggang
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2018, 67 : 21 - 29
  • [48] Reverse designs of doubly reinforced concrete beams using Gaussian process regression models enhanced by sequence training/designing technique based on feature selection algorithms
    Hong, Won-Kee
    Pham, Tien Dat
    JOURNAL OF ASIAN ARCHITECTURE AND BUILDING ENGINEERING, 2022, 21 (06) : 2345 - 2370
  • [49] Semi-supervised multi-label feature selection via label correlation analysis with l1-norm graph embedding
    Wang, Xiao-dong
    Chen, Rung-Ching
    Hong, Chao-qun
    Zeng, Zhi-qiang
    Zhou, Zhi-li
    IMAGE AND VISION COMPUTING, 2017, 63 : 10 - 23
  • [50] AUTOMATED HYPERSPECTRAL IMAGERY ANALYSIS VIA SUPPORT VECTOR MACHINES BASED MULTI-CLASSIFIER SYSTEM WITH NON-UNIFORM RANDOM FEATURE SELECTION
    Samiappan, Sathishkumar
    Prasad, Saurabh
    Bruce, Lori M.
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 3915 - 3918