Feature selection by utilizing kernel-based fuzzy rough set and entropy-based non-dominated sorting genetic algorithm in multi-label data

被引:0
作者
Hamidzadeh, Javad [1 ]
Mehravaran, Zahra [2 ]
Harati, Ahad [2 ]
机构
[1] Sadjad Univ, Fac Comp Engn & Informat Technol, Mashhad, Iran
[2] Ferdowsi Univ Mashhad FUM, Dept Comp Engn, Mashhad, Iran
关键词
Feature selection; Kernel fuzzy rough set; Mixed data; Multi-label learning; CLASSIFICATION;
D O I
10.1007/s10115-025-02341-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label learning, which involves assigning multiple class labels to each instance, becomes increasingly complex when dealing with large-scale mixed datasets featuring high-dimensional feature spaces. These mixed datasets often involve a combination of numerical and categorical features, which exacerbate the challenges of multi-label learning by introducing additional layers of uncertainty and variability. Traditional classification methods, although effective in simpler scenarios, often fail to address these complexities resulting in significant errors. To overcome this, we have developed an entropy-based objective function that captures the intricate interplay between features and classes, while accounting for the inherent uncertainty of mixed data. This objective function explicitly accounts for the heterogeneous nature of mixed datasets, ensuring robust feature selection across diverse attribute types. To tackle these challenges, we propose a memetic algorithm that integrates fuzzy rough sets with enhancements from kernel fuzzy rough sets (KFRS), and the Non-dominated Sorting Genetic Algorithm II. This synergy enables the extraction of optimal feature subsets that significantly improve classification performance. By leveraging kernel-based similarity measures, KFRS refines the partitions formed by fuzzy set memberships for distinct classes, ensuring precise alignment of data samples with multiple labels, while effectively handling the complexities of mixed-data representation. A key strength of our approach lies in its ability to preserve valuable information through KFRS-driven feature selection. Empirical evaluations on three benchmark datasets highlight the effectiveness of the proposed methodology. The results validate the superiority of our feature selection strategy, grounded in kernel-modulated neighborhoods; furthermore, the implementation demonstrates a notable improvement in both solution quality and search efficiency, establishing it as a highly promising method for multi-label learning tasks.
引用
收藏
页码:3789 / 3819
页数:31
相关论文
共 66 条
  • [1] [Anonymous], 1997, Icml
  • [2] Exploiting local label correlation from sample perspective for multi-label classification via three-way decision theory
    Che, Xiaoya
    Chen, Degang
    Deng, Jiang
    Mi, Jusheng
    [J]. APPLIED SOFT COMPUTING, 2023, 149
  • [3] Margin-based discriminant embedding guided sparse matrix regression for image supervised feature selection
    Chen, Xiuhong
    Lu, Yun
    Zhang, Jun
    Zhu, Xingyu
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 212
  • [4] An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints
    Deb, Kalyanmoy
    Jain, Himanshu
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2014, 18 (04) : 577 - 601
  • [5] A many-objective feature selection for multi-label classification
    Dong, Hongbin
    Sun, Jing
    Sun, Xiaohang
    Ding, Rui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 208
  • [6] ECC plus plus : An algorithm family based on ensemble of classifier chains for classifying imbalanced multi-label data
    Duan, Jicong
    Gu, Yan
    Yu, Hualong
    Yang, Xibei
    Gao, Shang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 236
  • [7] Learning correlation information for multi-label feature selection
    Fan, Yuling
    Liu, Jinghua
    Tang, Jianeng
    Liu, Peizhong
    Lin, Yaojin
    Du, Yongzhao
    [J]. PATTERN RECOGNITION, 2024, 145
  • [8] Multi-label feature selection with constraint regression and adaptive spectral graph
    Fan, Yuling
    Liu, Jinghua
    Weng, Wei
    Chen, Baihua
    Chen, Yannan
    Wu, Shunxiang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 212
  • [9] Neural Generalization of Multiple Kernel Learning
    Ghanizadeh, Ahmad Navid
    Ghiasi-Shirazi, Kamaledin
    Monsefi, Reza
    Qaraei, Mohammadreza
    [J]. NEURAL PROCESSING LETTERS, 2024, 56 (01)
  • [10] Distributed multi-label feature selection using individual mutual information measures
    Gonzalez-Lopez, Jorge
    Ventura, Sebastian
    Cano, Alberto
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 188 (188)