A new Unsupervised Spectral Feature Selection Method for mixed data: A filter approach

被引:60
作者
Solorio-Fernandez, Saul [1 ]
Fco Martinez-Trinidad, Jose [1 ]
Ariel Carrasco-Ochoa, J. [1 ]
机构
[1] Natl Inst Astrophys Opt & Elect, Dept Comp Sci, Luis Enrique Erro 1, Puebla 72840, Mexico
关键词
Unsupervised feature selection; Spectral feature selection; Mixed data; Feature ranking; REDUNDANCY FEATURE-SELECTION; VARIABLE SELECTION; ALGORITHM; CLASSIFICATION; RELEVANCE;
D O I
10.1016/j.patcog.2017.07.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the current unsupervised feature selection methods are designed to process only numerical datasets. Therefore, in practical problems, where the objects under study are described through both numerical and non-numerical features (mixed datasets), these methods cannot be directly applied. In this work, we propose a new unsupervised filter feature selection method that can be used on datasets with both numerical and non-numerical features. The proposed method is inspired by the spectral feature selection, by using together a kernel and a new spectrum based feature evaluation measure for quantifying the feature relevance. Experiments on synthetic datasets show that in the 99% of the cases where the relevant features are known our method identifies and ranks the most relevant features at the beginning of a sorted list. Additionally, we contrast our method against state-of-the-art unsupervised filter methods over real datasets, and our method in most cases significantly outperforms them. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:314 / 326
页数:13
相关论文
共 50 条
  • [31] Two-Dimensional Unsupervised Feature Selection via Sparse Feature Filter
    Li, Junyu
    Chen, Jiazhou
    Qi, Fei
    Dan, Tingting
    Weng, Wanlin
    Zhang, Bin
    Yuan, Haoliang
    Cai, Hongmin
    Zhong, Cheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (09) : 5605 - 5617
  • [32] A new hybrid feature selection approach using feature association map for supervised and unsupervised classification
    Das, Amit Kumar
    Goswami, Saptarsi
    Chakrabarti, Amlan
    Chakraborty, Basabi
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 88 : 81 - 94
  • [33] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [34] A Hybrid Filter/Wrapper Approach of Feature Selection for Gene Expression Data
    Ke, Chao-Hsuan
    Yang, Cheng-Hong
    Chuang, Li-Yeh
    Yang, Cheng-San
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2663 - +
  • [35] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Shamsinejadbabki, Pirooz
    Saraee, Mohammad
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (03) : 669 - 684
  • [36] Soft Set Based Quick Reduct Approach for Unsupervised Feature Selection
    Jothi, G.
    Inbarani, Hannah H.
    2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2012, : 277 - 281
  • [37] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Pirooz Shamsinejadbabki
    Mohammad Saraee
    Journal of Intelligent Information Systems, 2012, 38 : 669 - 684
  • [38] Predictable Features Elimination: An Unsupervised Approach to Feature Selection
    Barbiero, Pietro
    Squillero, Giovanni
    Tonda, Alberto
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT I, 2022, 13163 : 399 - 412
  • [39] An unsupervised feature selection approach for actionable warning identification
    Ge, Xiuting
    Fang, Chunrong
    Liu, Jia
    Qing, Mingshuang
    Li, Xuanye
    Zhao, Zhihong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
  • [40] An Unsupervised Feature Selection Framework for Social Media Data
    Tang, Jiliang
    Liu, Huan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) : 2914 - 2927