Linear Cost-sensitive Max-margin Embedded Feature Selection for SVM

被引:15
|
作者
Aram, Khalid Y. [1 ]
Lam, Sarah S. [2 ]
Khasawneh, Mohammad T. [2 ]
机构
[1] Emporia State Univ, Dept Business Adm, Emporia, KS 66801 USA
[2] SUNY Binghamton, Dept Syst Sci & Ind Engn, Binghamton, NY 13902 USA
关键词
Classification; Cost-sensitive learning; Feature selection; Mathematical programming; Support vector machines; VECTOR; CLASSIFICATION; MACHINE; CANCER;
D O I
10.1016/j.eswa.2022.116683
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The information needed for a certain machine application can be often obtained from a subset of the available features. Strongly relevant features should be retained to achieve desirable model performance. This research focuses on selecting relevant independent features for Support Vector Machine (SVM) classifiers in a cost-sensitive manner. A review of recent literature about feature selection for SVM revealed a lack of linear pro-gramming embedded SVM feature selection models. Most reviewed models were mixed-integer linear or nonlinear. Further, the review highlighted a lack of cost-sensitive SVM feature selection models. Cost sensitivity improves the generalization of SVM feature selection models, making them applicable to various cost-of-error situations. It also helps with handling imbalanced data. This research introduces an SVM-based filter method named Knapsack Max-Margin Feature Selection (KS-MMFS), which is a proposed linearization of the quadratic Max-Margin Feature Selection (MMFS) model. MMFS provides explicit estimates of feature importance in terms of relevance and redundancy. KS-MMFS was then used to develop a linear cost-sensitive SVM embedded feature selection model. The proposed model was tested on a group of 11 benchmark datasets and compared to relevant models from the literature. The results and analysis showed that different cost sensitivity (i.e., sensitivity-spe-cificity tradeoff) requirements influence the features selected. The analysis demonstrated the competitive per-formance of the proposed model compared with relevant models. The model achieved an average improvement of 31.8% on classification performance with a 22.4% average reduction in solution time. The results and analysis in this research demonstrated the competitive performance of the proposed model as an efficient cost-sensitive embedded feature selection method.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Linear Cost-sensitive Max-margin Embedded Feature Selection for SVM
    Department of Business Administration, Emporia State University, Emporia
    KS
    66801, United States
    不详
    NY
    13902, United States
    Expert Sys Appl, 2022,
  • [2] Cost-sensitive max-margin feature selection for SVM using alternated sorting method genetic algorithm
    Aram, Khalid Y.
    Lam, Sarah S.
    Khasawneh, Mohammad T.
    KNOWLEDGE-BASED SYSTEMS, 2023, 267
  • [3] Max-Margin feature selection
    Prasad, Yamuna
    Khandelwal, Dinesh
    Biswas, K. K.
    PATTERN RECOGNITION LETTERS, 2017, 95 : 51 - 57
  • [4] Cost-Sensitive Feature Selection on Heterogeneous Data
    Qian, Wenbin
    Shu, Wenhao
    Yang, Jun
    Wang, Yinglong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 397 - 408
  • [5] Max-Margin Token Selection in Attention Mechanism
    Tarzanagh, Davoud Ataee
    Li, Yingcong
    Zhang, Xuechen
    Oymak, Samet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Cost-Sensitive Feature Selection for Class Imbalance Problem
    Bach, Malgorzata
    Werner, Aleksandra
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 182 - 194
  • [7] Cost-sensitive Feature Selection for Support Vector Machines
    Benitez-Pena, S.
    Blanquero, R.
    Carrizosa, E.
    Ramirez-Cobo, P.
    COMPUTERS & OPERATIONS RESEARCH, 2019, 106 : 169 - 178
  • [8] Structural max-margin discriminant analysis for feature extraction
    Chen, Xiaobo
    Xiao, Yan
    Cai, Yinfeng
    Chen, Long
    KNOWLEDGE-BASED SYSTEMS, 2014, 70 : 154 - 166
  • [9] Nonlinear Feature Extraction with Max-Margin Data Shifting
    Wangni, Jianqiao
    Chen, Ning
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2208 - 2214
  • [10] Cost-Sensitive Universum-SVM
    Dhar, Sauptik
    Cherkassky, Vladimir
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 220 - 225