Information-theoretic partially labeled heterogeneous feature selection based on neighborhood rough sets

被引:14
|
作者
Zhang, Hongying [1 ]
Sun, Qianqian [1 ]
Dong, Kezhen [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Monotonic entropy; Partially labeled heterogeneous data; ATTRIBUTE REDUCTION;
D O I
10.1016/j.ijar.2022.12.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid increase of large-scale, real-world datasets, it becomes critical to address the problem of partially labeled heterogeneous feature selection (i.e., some samples, which own numerical and categorical features, have no labels). Existing solutions typically adopt linear correlations between features. In this paper, three different monotonic uncertainty measures are defined on equivalence classes and neighborhood classes to study the partially labeled heterogeneous feature selection by exploring the nonlinear correlations. First, consistent entropy and monotonic neighborhood entropy, based on classical rough set theory and neighborhood rough set theory, are proposed to construct a uniform measure for feature selection in heterogeneous datasets. Furthermore, a maximal neighborhood entropy strategy is developed by considering the inconsistency of neighborhood classes described by the features and partial labels. Finally, two feature selection algorithms are presented by three novel monotonic uncertainty measures. The comparative experiments demonstrate the effectiveness and superiority of the newly proposed feature selection measures.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:200 / 217
页数:18
相关论文
共 50 条
  • [31] Feature Selection Using Fuzzy Neighborhood Entropy-Based Uncertainty Measures for Fuzzy Neighborhood Multigranulation Rough Sets
    Sun, Lin
    Wang, Lanying
    Ding, Weiping
    Qian, Yuhua
    Xu, Jiucheng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (01) : 19 - 33
  • [32] Information-Theoretic Feature Selection via Tensor Decomposition and Submodularity
    Amiridi, Magda
    Kargas, Nikos
    Sidiropoulos, Nicholas D.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 6195 - 6205
  • [33] A Rigorous Information-Theoretic Definition of Redundancy and Relevancy in Feature Selection Based on (Partial) Information Decomposition
    Wollstadt, Patricia
    Schmitt, Sebastian
    Wibral, Michael
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [34] Feature subset selection wrapper based on mutual information and rough sets
    Foithong, Sombut
    Pinngern, Ouen
    Attachoo, Boonwat
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 574 - 584
  • [35] Information granularity-based incremental feature selection for partially labeled hybrid data
    Shu, Wenhao
    Yan, Zhenchao
    Chen, Ting
    Yu, Jianhui
    Qian, Wenbin
    INTELLIGENT DATA ANALYSIS, 2022, 26 (01) : 33 - 56
  • [36] Incremental feature selection based on fuzzy rough sets
    Ni, Peng
    Zhao, Suyun
    Wang, Xizhao
    Chen, Hong
    Li, Cuiping
    Tsang, Eric C. C.
    INFORMATION SCIENCES, 2020, 536 : 185 - 204
  • [37] Discernibility matrix-based feature selection approaches with fuzzy dominance-based neighborhood rough sets
    Chen, Jiayue
    Zhu, Ping
    FUZZY SETS AND SYSTEMS, 2025, 513
  • [38] Feature Selection Based on Weighted Fuzzy Rough Sets
    Wang, Changzhong
    Wang, Changyue
    Qian, Yuhua
    Leng, Qiangkui
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (07) : 4027 - 4037
  • [39] Heterogeneous Feature Selection Based on Neighborhood Combination Entropy
    Zhang, Pengfei
    Li, Tianrui
    Yuan, Zhong
    Luo, Chuan
    Liu, Keyu
    Yang, Xiaoling
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3514 - 3527
  • [40] Online streaming feature selection based on neighborhood rough set
    Li, Shuangjie
    Zhang, Kaixiang
    Li, Yali
    Wang, Shuqin
    Zhang, Shaoqiang
    APPLIED SOFT COMPUTING, 2021, 113