Multi-label Online Streaming Feature Selection Algorithms via Extending Alpha-Investing Strategy

被引:0
作者
Ji, Tianqi [1 ]
Guo, Xizhi [1 ]
Li, Yunqian [1 ]
Li, Dan [1 ]
Li, Jun [1 ]
Xu, Jianhua [1 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing 210023, Jiangsu, Peoples R China
来源
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2022 | 2022年 / 13428卷
关键词
Multi-label learning; Feature selection; Streaming features; Online learning; Alpha investing; CLASSIFICATION;
D O I
10.1007/978-3-031-12670-3_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label learning is a special supervised pattern classification issue, in which an instance is possibly associated with multiple class labels simultaneously. As various real world applications emerge continuously in the big data field, more attention has been paid to streaming data forms recently, i.e., instance, feature and label streams. In this paper, we focus on multi-label online streaming feature selection (OSFS) problem, whose features arrive sequentially over time, and instances and labels are given, to choose an optimal subset of features dynamically. Alpha-investing method is one of the most cited embedded-type single-label OSFS techniques, which mainly involves the linear regression model as its classifier. In this paper, we generalize such a technique to build two new multi-label OSFS algorithms (simply ML-AIBR and ML-AIMOR), which are based on binary relevance (BR) decomposition way and multioutput regression (MOR), respectively. To the best of our knowledge, such two algorithms are the first proposed embedded-type OSFS technique for multi-label streaming features so far. Our extensive experiments conducted on six benchmark data sets demonstrate that our two proposed methods performs better than three existing algorithms. Specially, our ML-AIMOR could filter out more irrelevant and redundant features effectively.
引用
收藏
页码:112 / 124
页数:13
相关论文
共 29 条
  • [1] AlNuaimi N., 2022, APPL COMPUTING INFOR, V18, P113
  • [2] A survey on multi-output regression
    Borchani, Hanen
    Varando, Gherardo
    Bielza, Concha
    Larranaga, Pedro
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 5 (05) : 216 - 233
  • [3] Freeedman D.A., 2009, STAT MODELS THEORY P
  • [4] Herrera F., 2016, Multilabel Classification:Problem Analysis, Metrics and Techniques
  • [5] A survey on online feature selection with streaming features
    Hu, Xuegang
    Zhou, Peng
    Li, Peipei
    Wang, Jing
    Wu, Xindong
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (03) : 479 - 493
  • [6] Multilabel feature selection: A comprehensive review and guiding experiments
    Kashef, Shima
    Nezamabadi-pour, Hossein
    Nikpour, Bahareh
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (02)
  • [7] Feature selection for multi-label classification using multivariate mutual information
    Lee, Jaesung
    Kim, Dae-Won
    [J]. PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357
  • [8] Streaming Feature Selection for Multi-Label Data with Dynamic Sliding Windows and Feature Repulsion Loss
    Li, Yu
    Cheng, Yusheng
    [J]. ENTROPY, 2019, 21 (12)
  • [9] Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Li, Jinjin
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2017, 25 (06) : 1491 - 1507
  • [10] Multi-label feature selection with streaming labels
    Lin, Yaojin
    Hu, Qinghua
    Zhang, Jia
    Wu, Xindong
    [J]. INFORMATION SCIENCES, 2016, 372 : 256 - 275