Online Multi-label Group Feature Selection

被引:46
作者
Liu, Jinghua [1 ]
Lin, Yaojin [2 ]
Wu, Shunxiang [1 ]
Wang, Chenxi [2 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361000, Peoples R China
[2] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China
基金
中国国家自然科学基金;
关键词
Online feature selection; Multi-label learning; Streaming feature; Group feature selection; INCREMENTAL UPDATING APPROXIMATIONS; ROUGH SETS; INFORMATION; GRANULATION; QUALITY; MODEL;
D O I
10.1016/j.knosys.2017.12.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection for multi-label learning has received intensive interest in recent years. However, traditional multi-label feature selection are incapable of considering intrinsic group structures of features and handling streaming features simultaneously. To solve this problem, we develop an algorithm called Online Multi-label Group Feature Selection (OMGFS). Our proposed method consists of two-phase: online group selection and online inter-group selection. In the group selection, we design a criterion to select feature groups which is important to label set. In the inter-group selection, we consider feature interaction and feature redundancy to select an optimal feature subset. This two-phase procedure continues until there are no more features arriving. An empirical study using a series of benchmark data sets demonstrates that the proposed method outperforms other state-of-the-art multi-label feature selection methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:42 / 57
页数:16
相关论文
共 57 条
[1]  
[Anonymous], 2014, INT C MACH LEARN
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]  
[Anonymous], 2008, ISMIR
[4]  
Cherman E., 2011, C LAT INF, P960
[5]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&
[6]  
Elisseeff A, 2002, ADV NEUR IN, V14, P681
[7]   A comparison of alternative tests of significance for the problem of m rankings [J].
Friedman, M .
ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 :86-92
[8]  
Gu Q., 2011, Proceedings of the 20th ACM International Conference on Information and Knowledge Management, P1087
[9]  
Hoi Steven CH., 2012, P 1 INT WORKSH BIG D, P93, DOI DOI 10.1145/2351316.2351329
[10]   Mixed feature selection based on granulation and approximation [J].
Hu, Qinghua ;
Liu, Jinfu ;
Yu, Daren .
KNOWLEDGE-BASED SYSTEMS, 2008, 21 (04) :294-304