Information-theoretic partially labeled heterogeneous feature selection based on neighborhood rough sets

被引:14
|
作者
Zhang, Hongying [1 ]
Sun, Qianqian [1 ]
Dong, Kezhen [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Monotonic entropy; Partially labeled heterogeneous data; ATTRIBUTE REDUCTION;
D O I
10.1016/j.ijar.2022.12.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid increase of large-scale, real-world datasets, it becomes critical to address the problem of partially labeled heterogeneous feature selection (i.e., some samples, which own numerical and categorical features, have no labels). Existing solutions typically adopt linear correlations between features. In this paper, three different monotonic uncertainty measures are defined on equivalence classes and neighborhood classes to study the partially labeled heterogeneous feature selection by exploring the nonlinear correlations. First, consistent entropy and monotonic neighborhood entropy, based on classical rough set theory and neighborhood rough set theory, are proposed to construct a uniform measure for feature selection in heterogeneous datasets. Furthermore, a maximal neighborhood entropy strategy is developed by considering the inconsistency of neighborhood classes described by the features and partial labels. Finally, two feature selection algorithms are presented by three novel monotonic uncertainty measures. The comparative experiments demonstrate the effectiveness and superiority of the newly proposed feature selection measures.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:200 / 217
页数:18
相关论文
共 50 条
  • [41] Feature selection for blind image steganalysis using neighborhood rough sets
    Chen, Yingyue
    Chen, Yumin
    Yin, Aimin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (03) : 3709 - 3720
  • [42] Neighborhood multigranulation rough sets for cost-sensitive feature selection on hybrid data
    Shu, Wenhao
    Xia, Qiang
    Qian, Wenbin
    NEUROCOMPUTING, 2024, 565
  • [43] Incremental feature selection: Parallel approach with local neighborhood rough sets and composite entropy
    Xu, Weihua
    Ye, Weirui
    PATTERN RECOGNITION, 2025, 159
  • [44] A semi-parallel framework for greedy information-theoretic feature selection
    Liu, Heng
    Ditzler, Gregory
    INFORMATION SCIENCES, 2019, 492 : 13 - 28
  • [45] Feature Selection and Classification Based on Directed Fuzzy Rough Sets
    Wang, Changyue
    Wang, Changzhong
    An, Shuang
    Ding, Weiping
    Qian, Yuhua
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (01): : 699 - 711
  • [46] Incremental Feature Selection Using a Conditional Entropy Based on Fuzzy Dominance Neighborhood Rough Sets
    Sang, Binbin
    Chen, Hongmei
    Yang, Lei
    Li, Tianrui
    Xu, Weihua
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (06) : 1683 - 1697
  • [47] Feature Selection for Multi-Label Learning Based on F-Neighborhood Rough Sets
    Deng, Zhixuan
    Zheng, Zhonglong
    Deng, Dayong
    Wang, Tianxiang
    He, Yiran
    Zhang, Dawei
    IEEE ACCESS, 2020, 8 : 39678 - 39688
  • [48] An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets
    Yanzhou Pan
    Weihua Xu
    Qinwen Ran
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1217 - 1233
  • [49] Rough sets-based tri-trade for partially labeled data
    Luo, Ziming
    Gao, Can
    Zhou, Jie
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17708 - 17726
  • [50] Feature Selection Based on Neighborhood Self-Information
    Wang, Changzhong
    Huang, Yang
    Shao, Mingwen
    Hu, Qinghua
    Chen, Degang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 4031 - 4042