Fuzzy Mutual Information-Based Multilabel Feature Selection With Label Dependency and Streaming Labels

被引:48
作者
Liu, Jinghua [1 ,2 ]
Lin, Yaojin [2 ]
Ding, Weiping [3 ]
Zhang, Hongbo [4 ]
Du, Jixiang [4 ]
机构
[1] Huaqiao Univ, Sch Comp Sci & Technol, Xiamen 361021, Peoples R China
[2] Minnan Normal Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Peoples R China
[3] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[4] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361021, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; fuzzy mutual information; label dependency; multilabel learning; streaming labels; CLASSIFICATION;
D O I
10.1109/TFUZZ.2022.3182441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilabel feature selection (MFS) has received widespread attention in various big data applications. However, most of the existing methods either explicitly or implicitly assume that all labels are given in advance before feature selection starts; or that all labels are independent. In fact, in many practical applications, the available labels usually arrive dynamically, and they may be interdependent with each other. Moreover, labels may be generated dynamically in a minibatch manner, which makes it more difficult to explore label dependency. In this article, we propose a novel fuzzy mutual information-based multilabel feature selection approach MSDS, which is able to solve single streaming label, minibatch streaming labels, and exploit label dependency simultaneously. In specific, we first promote fuzzy mutual information to be suitable for multilabel learning. This model can effectively consider the relationship between two labels, and has good applicability for measuring the relationship between multiple labels. Then, we analyze feature relevance and feature redundancy based on the combination of label dependency and streaming labels, which helps to facilitate the selection of high-quality feature subsets. Finally, a feature conversion is designed to fuse the representative features of new arrival streaming labels. Comprehensive experiments on twelve multilabel datasets clearly reveal the superiority of the proposed method against two streaming labels based algorithms and five state-of-the-art static label space based algorithms.
引用
收藏
页码:77 / 91
页数:15
相关论文
共 49 条
[1]  
[Anonymous], 2012, P 26 AAAI C ART INT
[2]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[3]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&
[4]   A comparison of alternative tests of significance for the problem of m rankings [J].
Friedman, M .
ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 :86-92
[5]   Multilabel classification via calibrated label ranking [J].
Fuernkranz, Johannes ;
Huellermeier, Eyke ;
Mencia, Eneldo Loza ;
Brinker, Klaus .
MACHINE LEARNING, 2008, 73 (02) :133-153
[6]   Fuzzy probabilistic approximation spaces and their information measures [J].
Hu, QH ;
Yu, DR ;
Xie, ZX ;
Liu, JF .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2006, 14 (02) :191-201
[7]   Large-Scale Multimodality Attribute Reduction With Multi-Kernel Fuzzy Rough Sets [J].
Hu, Qinghua ;
Zhang, Lingjun ;
Zhou, Yucan ;
Pedrycz, Witold .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (01) :226-238
[8]   Improving multi-label classification with missing labels by learning label-specific features [J].
Huang, Jun ;
Qin, Feng ;
Zheng, Xiao ;
Cheng, Zekai ;
Yuan, Zhixiang ;
Zhang, Weigang ;
Huang, Qingming .
INFORMATION SCIENCES, 2019, 492 :124-146
[9]   Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification [J].
Huang, Jun ;
Li, Guorong ;
Huang, Qingming ;
Wu, Xindong .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) :3309-3323
[10]   Fast multi-label feature selection based on information-theoretic feature ranking [J].
Lee, Jaesung ;
Kim, Dae-Won .
PATTERN RECOGNITION, 2015, 48 (09) :2761-2771