A new Unsupervised Spectral Feature Selection Method for mixed data: A filter approach

被引:60
|
作者
Solorio-Fernandez, Saul [1 ]
Fco Martinez-Trinidad, Jose [1 ]
Ariel Carrasco-Ochoa, J. [1 ]
机构
[1] Natl Inst Astrophys Opt & Elect, Dept Comp Sci, Luis Enrique Erro 1, Puebla 72840, Mexico
关键词
Unsupervised feature selection; Spectral feature selection; Mixed data; Feature ranking; REDUNDANCY FEATURE-SELECTION; VARIABLE SELECTION; ALGORITHM; CLASSIFICATION; RELEVANCE;
D O I
10.1016/j.patcog.2017.07.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the current unsupervised feature selection methods are designed to process only numerical datasets. Therefore, in practical problems, where the objects under study are described through both numerical and non-numerical features (mixed datasets), these methods cannot be directly applied. In this work, we propose a new unsupervised filter feature selection method that can be used on datasets with both numerical and non-numerical features. The proposed method is inspired by the spectral feature selection, by using together a kernel and a new spectrum based feature evaluation measure for quantifying the feature relevance. Experiments on synthetic datasets show that in the 99% of the cases where the relevant features are known our method identifies and ranks the most relevant features at the beginning of a sorted list. Additionally, we contrast our method against state-of-the-art unsupervised filter methods over real datasets, and our method in most cases significantly outperforms them. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:314 / 326
页数:13
相关论文
共 50 条
  • [1] Filter unsupervised spectral feature selection method for mixed data based on a new feature correlation measure
    Solorio-Fernandez, Saul
    Carrasco-Ochoa, J. Ariel
    Martinez-Trinidad, Jose Fco.
    NEUROCOMPUTING, 2024, 571
  • [2] A Supervised Filter Feature Selection Method for Mixed Data Based on the Spectral Gap Score
    Solorio-Fernandez, Saul
    Fco Martinez-Trinidad, Jose
    Ariel Carrasco-Ochoa, Jesus
    PATTERN RECOGNITION, MCPR 2019, 2019, 11524 : 3 - 13
  • [3] A PARTITION-BASED FEATURE SELECTION METHOD FOR MIXED DATA: A FILTER APPROACH
    Dutt, Ashish
    Ismail, Maizatul Akmar
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2020, 33 (02) : 152 - 169
  • [4] A Supervised Filter Feature Selection method for mixed data based on Spectral Feature Selection and Information-theory redundancy analysis
    Solorio-Fernandez, Saul
    Fco Martinez-Trinidad, Jose
    Ariel Carrasco-Ochoa, J.
    PATTERN RECOGNITION LETTERS, 2020, 138 : 321 - 328
  • [5] Hierarchical fuzzy filter method for unsupervised feature selection
    Li, Yun
    Lu, Bao-Liang
    Wu, Zhong-Fu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2007, 18 (02) : 157 - 169
  • [6] An unsupervised approach for feature selection in linked data
    Hoseini, Elham
    Mansoori, Eghbal G.
    2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 1881 - 1886
  • [7] Compactness score: a fast filter method for unsupervised feature selection
    Zhu, Peican
    Hou, Xin
    Tang, Keke
    Wang, Zhen
    Nie, Feiping
    ANNALS OF OPERATIONS RESEARCH, 2023,
  • [8] Unsupervised spectral feature selection algorithms for high dimensional data
    Wang, Mingzhao
    Han, Henry
    Huang, Zhao
    Xie, Juanying
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (05)
  • [9] Unsupervised spectral feature selection algorithms for high dimensional data
    WANG Mingzhao
    HAN Henry
    HUANG Zhao
    XIE Juanying
    Frontiers of Computer Science, 2023, 17 (05)
  • [10] RISC: A new filter approach for feature selection from proteomic data
    Vu, Trung-Nghia
    Ohn, Syng-Yup
    Kim, Chul-Woo
    MEDICAL BIOMETRICS, PROCEEDINGS, 2007, 4901 : 17 - +