Exploiting Feature Relationships Towards Stable Feature Selection

被引:0
作者
Kamkar, Iman [1 ]
Gupta, Sunil Kumar [1 ]
Dinh Phung [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Ctr Pattern Recognit & Data Analyt, Geelong, Vic 3217, Australia
来源
PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015) | 2015年
关键词
Stability; Lasso; Correlated features; Prediction; VARIABLE SELECTION; MODEL SELECTION; REGRESSION; PREDICTOR; MORTALITY; SURVIVAL; CANCER;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is an important step in building predictive models for most real-world problems. One of the popular methods in feature selection is Lasso. However, it shows instability in selecting features when dealing with correlated features. In this work, we propose a new method that aims to increase the stability of Lasso by encouraging similarities between features based on their relatedness, which is captured via a feature covariance matrix. Besides modeling positive feature correlations, our method can also identify negative correlations between features. We propose a convex formulation for our model along with an alternating optimization algorithm that can learn the weights of the features as well as the relationship between them. Using both synthetic and real-world data, we show that the proposed method is more stable than Lasso and many state-of-the-art shrinkage and feature selection methods. Also, its predictive performance is comparable to other methods.
引用
收藏
页码:727 / 736
页数:10
相关论文
共 35 条
  • [1] [Anonymous], 2009, P 26 ANN INT C MACH, DOI DOI 10.1145/1553374.1553431
  • [2] [Anonymous], 2006, Journal of the Royal Statistical Society, Series B
  • [3] [Anonymous], 2008, P 14 ACM SIGKDD INT
  • [4] Attribute clustering for grouping, selection, and classification of gene expression data
    Au, WH
    Chan, KCC
    Wong, AKC
    Wang, Y
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (02) : 83 - 101
  • [5] Bach F. R., 2008, P 25 INT C MACH LEAR, P31
  • [6] Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR
    Bondell, Howard D.
    Reich, Brian J.
    [J]. BIOMETRICS, 2008, 64 (01) : 115 - 123
  • [7] Distributed optimization and statistical learning via the alternating direction method of multipliers
    Boyd S.
    Parikh N.
    Chu E.
    Peleato B.
    Eckstein J.
    [J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122
  • [8] Boyd S., 2004, CONVEX OPTIMIZATION, VFirst, DOI DOI 10.1017/CBO9780511804441
  • [9] Correlated variables in regression: Clustering and sparse estimation
    Buehlmann, Peter
    Ruetimann, Philipp
    van de Geer, Sara
    Zhang, Cun-Hui
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2013, 143 (11) : 1835 - 1858
  • [10] Caro JJ, 2001, CANCER-AM CANCER SOC, V91, P2214, DOI 10.1002/1097-0142(20010615)91:12<2214::AID-CNCR1251>3.0.CO