Online multi-label stream feature selection based on neighborhood rough set with missing labels

被引:27
作者
Liang, Shunpan [1 ]
Liu, Ze [1 ]
You, Dianlong [1 ]
Pan, Weiwei [1 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066004, Hebei, Peoples R China
关键词
Online feature selection; Neighborhood rough set; Missing labels; Stream feature; Multi-label; ALGORITHM;
D O I
10.1007/s10044-022-01067-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label feature selection has been essential in many big data applications and plays a significant role in processing high-dimensional data. However, the existing online stream feature selection methods ignore the existence of missing labels. Inspired by the neighborhood rough set that does not require prior knowledge of the feature space, we propose a novel online multi-label stream feature selection algorithm called OFS-Mean. We define a neighborhood relationship that can automatically select an appropriate number of neighbors. Without any prior space and parameters, the algorithm's performance of the algorithm is improved by real-time online prediction of missing labels based on the similarity between the instance and its neighbors. The proposed OFS-Mean divides the feature selection process into two stages: online feature importance evaluation and online redundancy update to screen important features. With the support of neighborhood rough set, the proposed OFS-Mean can adapt to various types of datasets, improving the algorithm generalization ability. In the experiment, the similarity test is used to verify the prediction results; the comparison with the traditional semi-supervised feature selection method under the condition of selecting the same number of features has achieved ideal results.
引用
收藏
页码:1025 / 1039
页数:15
相关论文
共 50 条
  • [21] Multi-label feature selection based on label distribution and feature complementarity
    Qian, Wenbin
    Long, Xuandong
    Wang, Yinglong
    Xie, Yonghong
    [J]. APPLIED SOFT COMPUTING, 2020, 90
  • [22] Multi-label feature selection based on rough granular-ball and label distribution
    Qian, Wenbin
    Xu, Fankang
    Qian, Jin
    Shu, Wenhao
    Ding, Weiping
    [J]. INFORMATION SCIENCES, 2023, 650
  • [23] Online Multi-Label Streaming Feature Selection Based on Label Group Correlation and Feature Interaction
    Liu, Jinghua
    Yang, Songwei
    Zhang, Hongbo
    Sun, Zhenzhen
    Du, Jixiang
    [J]. ENTROPY, 2023, 25 (07)
  • [24] Multi-Label Attribute Reduction Based on Variable Precision Fuzzy Neighborhood Rough Set
    Chen, Panpan
    Lin, Menglei
    Liu, Jinghua
    [J]. IEEE ACCESS, 2020, 8 (08): : 133565 - 133576
  • [25] Multi-Label classification with Missing Labels by Preserving Feature-Label Space Consistency
    Zhang, Zan
    Zhang, Depeng
    Wu, Gongqing
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH, ICKG, 2023, : 192 - 199
  • [26] New Online Streaming Feature Selection Based on Neighborhood Rough Set for Medical Data
    Lei, Dingfei
    Liang, Pei
    Hu, Junhua
    Yuan, Yuan
    [J]. SYMMETRY-BASEL, 2020, 12 (10): : 1 - 31
  • [27] Local rough set-based feature selection for label distribution learning with incomplete labels
    Qian, Wenbin
    Dong, Ping
    Wang, Yinglong
    Dai, Shiming
    Huang, Jintao
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) : 2345 - 2364
  • [28] Online streaming feature selection using adapted Neighborhood Rough Set
    Zhou, Peng
    Hu, Xuegang
    Li, Peipei
    Wu, Xindong
    [J]. INFORMATION SCIENCES, 2019, 481 : 258 - 279
  • [29] Multi-objective PSO based online feature selection for multi-label classification
    Paul, Dipanjyoti
    Jain, Anushree
    Saha, Sriparna
    Mathew, Jimson
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [30] Multi-label feature selection based on the division of label topics
    Zhang, Ping
    Gao, Wanfu
    Hu, Juncheng
    Li, Yonghao
    [J]. INFORMATION SCIENCES, 2021, 553 : 129 - 153