Multi-label feature selection with streaming labels

被引:70
作者
Lin, Yaojin [1 ,2 ]
Hu, Qinghua [2 ]
Zhang, Jia [1 ]
Wu, Xindong [3 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China
[2] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
[3] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Feature selection; Streaming labels; Multi-label learning; Supervised learning; MUTUAL INFORMATION; CLASSIFICATION;
D O I
10.1016/j.ins.2016.08.039
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study a novel and challenging issue, multi-label feature selection with streaming labels, in which the number of labels is unknown in advance, and the size of the feature set is constant. In this problem, we assume that the labels arrive one at a time, and the learning task is to rank features iteratively when a new label arrives. Traditional multi-label feature selection methods cannot perform well in this scenario. Therefore, we present an optimization framework where the weight of each label's feature rank list and the final feature rank list are defined as two sets of unknown variables. The objective is to minimize the overall weighted deviation between the final feature rank list and each label's feature rank list. Extensive experiments on benchmark data sets demonstrate that the proposed method outperforms other multi-label feature selection methods. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:256 / 275
页数:20
相关论文
共 49 条
[1]  
[Anonymous], 1999, Athena scientific Belmont
[2]  
[Anonymous], AAAI99 WORKSH TEXT L
[3]  
Bi Wei, 2011, P 28 INT C INT C MAC, P17
[4]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[5]  
Boyd S, 2004, CONVEX OPTIMIZATION
[6]  
Carvalho V. R., 2006, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, P548
[7]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&
[8]  
Elisseeff A, 2002, ADV NEUR IN, V14, P681
[9]   A comparison of alternative tests of significance for the problem of m rankings [J].
Friedman, M .
ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 :86-92
[10]   Multilabel classification via calibrated label ranking [J].
Fuernkranz, Johannes ;
Huellermeier, Eyke ;
Mencia, Eneldo Loza ;
Brinker, Klaus .
MACHINE LEARNING, 2008, 73 (02) :133-153