ASFS: A novel streaming feature selection for multi-label data based on neighborhood rough set

被引:26
作者
Liu, Jinghua [1 ,2 ,3 ]
Lin, Yaojin [2 ,4 ]
Du, Jixiang [1 ,3 ,5 ]
Zhang, Hongbo [1 ,3 ,5 ]
Chen, Ziyi [1 ,3 ,5 ]
Zhang, Jia [6 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen 361021, Peoples R China
[2] Minnan Normal Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Fujian, Peoples R China
[3] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361021, Peoples R China
[4] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China
[5] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361021, Peoples R China
[6] Jinan Univ, Coll Informat Sci & Technol, Guangzhou 510632, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label feature selection; Streaming features; Neighborhood rough set; Adaptive neighborhood; ATTRIBUTE REDUCTION; CLASSIFICATION;
D O I
10.1007/s10489-022-03366-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neighborhood rough set based online streaming feature selection methods have aroused wide concern in recent years and played a vital role in processing high-dimensional data. However, most of the existing methods are directly applied to handle single-label data, or to handle multi-label data by converting multi-label data into a combination of multiple single-label datasets, which ignores that the label set of multi-label data is an integral whole. In this paper, we propose a novel online streaming feature selection for multi-label learning via the neighborhoorough set model, in which feature significance, feature redundancy, and label space integrity are taken into account, simultaneously. To be specific, we first define a new adaptive neighborhood relation to avoid the setting of neighborhood parameter and restructure the neighborhood rough set model to be suitable for processing multi-label data directly. Based on this model, we introduce a evaluation criterion to select features that are important relative to label set and the currently selected features, and present an optimization objective function to update the selected feature subset and filter out redundant features. Comparative experiments on different types of data sets explicitly verify the advantages of the proposed method.
引用
收藏
页码:1707 / 1724
页数:18
相关论文
共 52 条
  • [1] [Anonymous], 2011, NIPS
  • [2] Kernelized fuzzy rough sets based online streaming feature selection for large-scale hierarchical classification
    Bai, Shengxing
    Lin, Yaojin
    Lv, Yan
    Chen, Jinkun
    Wang, Chenxi
    [J]. APPLIED INTELLIGENCE, 2021, 51 (03) : 1602 - 1615
  • [3] A context-aware recommendation approach based on feature selection
    Chen, Lei
    Xia, Meimei
    [J]. APPLIED INTELLIGENCE, 2021, 51 (02) : 865 - 875
  • [4] Cheng Yusheng, 2018, Journal of Computer Applications, V38, P3105, DOI 10.11772/j.issn.1001-9081.2018041275
  • [5] Multi-label feature selection with application to TCM state identification
    Dai, Liang
    Zhang, Jia
    Li, Candong
    Zhou, Changen
    Li, Shaozi
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (23)
  • [6] MULTIPLE COMPARISONS AMONG MEANS
    DUNN, OJ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) : 52 - &
  • [7] Elisseeff A, 2002, ADV NEUR IN, V14, P681
  • [8] Online streaming feature selection using rough sets
    Eskandari, S.
    Javidi, M. M.
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2016, 69 : 35 - 57
  • [9] Multi-label feature selection with local discriminant model and label correlations
    Fan, Yuling
    Liu, Jinghua
    Weng, Wei
    Chen, Baihua
    Chen, Yannan
    Wu, Shunxiang
    [J]. NEUROCOMPUTING, 2021, 442 : 98 - 115
  • [10] Multi-label feature selection with constraint regression and adaptive spectral graph
    Fan, Yuling
    Liu, Jinghua
    Weng, Wei
    Chen, Baihua
    Chen, Yannan
    Wu, Shunxiang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 212