Feature selection by utilizing kernel-based fuzzy rough set and entropy-based non-dominated sorting genetic algorithm in multi-label data

被引:0
作者
Hamidzadeh, Javad [1 ]
Mehravaran, Zahra [2 ]
Harati, Ahad [2 ]
机构
[1] Sadjad Univ, Fac Comp Engn & Informat Technol, Mashhad, Iran
[2] Ferdowsi Univ Mashhad FUM, Dept Comp Engn, Mashhad, Iran
关键词
Feature selection; Kernel fuzzy rough set; Mixed data; Multi-label learning; CLASSIFICATION;
D O I
10.1007/s10115-025-02341-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label learning, which involves assigning multiple class labels to each instance, becomes increasingly complex when dealing with large-scale mixed datasets featuring high-dimensional feature spaces. These mixed datasets often involve a combination of numerical and categorical features, which exacerbate the challenges of multi-label learning by introducing additional layers of uncertainty and variability. Traditional classification methods, although effective in simpler scenarios, often fail to address these complexities resulting in significant errors. To overcome this, we have developed an entropy-based objective function that captures the intricate interplay between features and classes, while accounting for the inherent uncertainty of mixed data. This objective function explicitly accounts for the heterogeneous nature of mixed datasets, ensuring robust feature selection across diverse attribute types. To tackle these challenges, we propose a memetic algorithm that integrates fuzzy rough sets with enhancements from kernel fuzzy rough sets (KFRS), and the Non-dominated Sorting Genetic Algorithm II. This synergy enables the extraction of optimal feature subsets that significantly improve classification performance. By leveraging kernel-based similarity measures, KFRS refines the partitions formed by fuzzy set memberships for distinct classes, ensuring precise alignment of data samples with multiple labels, while effectively handling the complexities of mixed-data representation. A key strength of our approach lies in its ability to preserve valuable information through KFRS-driven feature selection. Empirical evaluations on three benchmark datasets highlight the effectiveness of the proposed methodology. The results validate the superiority of our feature selection strategy, grounded in kernel-modulated neighborhoods; furthermore, the implementation demonstrates a notable improvement in both solution quality and search efficiency, establishing it as a highly promising method for multi-label learning tasks.
引用
收藏
页码:3789 / 3819
页数:31
相关论文
共 66 条
  • [11] Predicting users' preferences by Fuzzy Rough Set Quarter-Sphere Support Vector Machine
    Hamidzadeh, Javad
    Rezaeenik, Ebrahim
    Moradi, Mona
    [J]. APPLIED SOFT COMPUTING, 2021, 112
  • [12] Enhancing data analysis: uncertainty-resistance method for handling incomplete data
    Hamidzadeh, Javad
    Moradi, Mona
    [J]. APPLIED INTELLIGENCE, 2020, 50 (01) : 74 - 86
  • [13] Large-Scale Multimodality Attribute Reduction With Multi-Kernel Fuzzy Rough Sets
    Hu, Qinghua
    Zhang, Lingjun
    Zhou, Yucan
    Pedrycz, Witold
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (01) : 226 - 238
  • [14] Joint Feature Selection and Classification for Multilabel Learning
    Huang, Jun
    Li, Guorong
    Huang, Qingming
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (03) : 876 - 889
  • [15] Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification
    Huang, Jun
    Li, Guorong
    Huang, Qingming
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3309 - 3323
  • [16] Multi-label feature selection via manifold regularization and dependence maximization
    Huang, Rui
    Wu, Zhejun
    [J]. PATTERN RECOGNITION, 2021, 120
  • [17] Isa A, 2024, J Theor Appl Inf Technol, V102
  • [18] Naive Multi-label classification of YouTube comments using comparative opinion mining
    Khan, Asad Ullah Rafiq
    Khan, Madiha
    Khan, Mohammad Badruddin
    [J]. 4TH SYMPOSIUM ON DATA MINING APPLICATIONS (SDMA2016), 2016, 82 : 57 - 64
  • [19] Kong DG, 2012, PROC CVPR IEEE, P2352, DOI 10.1109/CVPR.2012.6247947
  • [20] Kononenko I., 1994, Machine Learning: ECML-94. European Conference on Machine Learning. Proceedings, P171