Max-Margin feature selection

被引:5
|
作者
Prasad, Yamuna [1 ]
Khandelwal, Dinesh [1 ]
Biswas, K. K. [1 ]
机构
[1] Indian Inst Technol Delhi, Dept Comp Sci & Engn, New Delhi 110016, India
关键词
Feature selection; One class SVM; Max-Margin; INFORMATION;
D O I
10.1016/j.patrec.2017.04.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many machine learning applications such as in vision, biology and social networking deal with data in high dimensions. Feature selection is typically employed to select a subset of features which improves generalization accuracy as well as reduces the computational cost of learning the model. One of the criteria used for feature selection is to jointly minimize the redundancy and maximize the relevance of the selected features. In this paper, we formulate the task of feature selection as a one class SVM problem in a space where features correspond to the data points and instances correspond to the dimensions. The goal is to look for a representative subset of the features (support vectors) which describes the boundary for the region where the set of the features (data points) exists. This leads to a joint optimization of relevance and redundancy in a principled max-margin framework. Additionally, our formulation enables us to leverage existing techniques for optimizing the SVM objective resulting in highly computationally efficient solutions for the task of feature selection. Specifically, we employ the dual coordinate descent algorithm (Hsieh et al., 2008), originally proposed for SVMs, for our formulation. We use a sparse representation to deal with data in very high dimensions. Experiments on seven publicly available benchmark datasets from a variety of domains show that our approach results in orders of magnitude faster solutions even while retaining the same level of accuracy compared to the state of the art feature selection techniques. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:51 / 57
页数:7
相关论文
共 50 条
  • [31] Spectral Regularization for Max-Margin Sequence Tagging
    Quattoni, Ariadna
    Balle, Borja
    Carreras, Xavier
    Globerson, Amir
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1710 - 1718
  • [32] A Max-Margin Learning Algorithm with Additional Features
    Liu, Xinwang
    Yin, Jianping
    Zhu, En
    Zhan, Yubin
    Li, Miaomiao
    Zhang, Changwang
    FRONTIERS IN ALGORITHMICS, PROCEEDINGS, 2009, 5598 : 196 - +
  • [33] Primal Sparse Max-Margin Markov Networks
    Zhu, Jun
    Xing, Eric P.
    Zhang, Bo
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 1047 - 1055
  • [34] Scalable Inference in Max-margin Topic Models
    Zhu, Jun
    Zheng, Xun
    Zhou, Li
    Zhang, Bo
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 964 - 972
  • [35] Optimizing for Measure of Performance in Max-Margin Parsing
    Bauer, Alexander
    Nakajima, Shinichi
    Goernitz, Nico
    Mueller, Klaus-Robert
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2680 - 2684
  • [36] Max-Margin Infinite Hidden Markov Models
    Zhang, Aonan
    Zhu, Jun
    Zhang, Bo
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
  • [37] Max-margin classification of data with absent features
    Chechik, Gal
    Heitz, Geremy
    Elidan, Gal
    Abbeel, Pieter
    Koller, Daphne
    Journal of Machine Learning Research, 2008, 9 : 1 - 21
  • [38] Max-Margin Boltzmann Machines for Object Segmentation
    Yang, Jimei
    Safar, Simon
    Yang, Ming-Hsuan
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 320 - 327
  • [39] Infinite Bayesian Max-Margin Discriminant Projection
    Wen, Wei
    Chen, Bo
    Cao, Xuefei
    Zhang, Xuefeng
    Wang, Zhengjue
    Liu, Hongwei
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 3936 - 3946
  • [40] Max-margin classification of data with absent features
    Chechik, Gal
    Heitz, Geremy
    Elidan, Gal
    Abbeel, Pieter
    Koller, Daphne
    JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 1 - 21