Local and global feature selection for multilabel classification with binary relevanceAn empirical comparison on flat and hierarchical problems

被引:0
作者
André Melo
Heiko Paulheim
机构
来源
Artificial Intelligence Review | 2019年 / 51卷
关键词
Multilabel classification; Transformation methods; Local feature selection; Global feature selection; Binary relevance;
D O I
暂无
中图分类号
学科分类号
摘要
Multilabel classification has become increasingly important for various use cases. Amongst the existing multilabel classification methods, problem transformation approaches, such as Binary Relevance, Pruned Problem Transformation, and Classifier Chains, are some of the most popular, since they break a global multilabel classification problem into a set of smaller binary or multiclass classification problems. Transformation methods enable the use of two different feature selection approaches: local, where the selection is performed independently for each of the transformed problems, and global, where the selection is performed on the original dataset, meaning that all local classifiers work on the same set of features. While global methods have been widely researched, local methods have received little attention so far. In this paper, we compare those two strategies on one of the most straight forward transformation approaches, i.e., Binary Relevance. We empirically compare their performance on various flat and hierarchical multilabel datasets of different application domains. We show that local outperforms global feature selection in terms of classification accuracy, without drawbacks in runtime performance.
引用
收藏
页码:33 / 60
页数:27
相关论文
共 58 条
[1]  
Bizer C(2009)Linked data—the story so far Int J Semant Web Inf Syst 5 1-22
[2]  
Heath T(2009)DBpedia—a crystallization point for the Web of Data Web Semant 7 154-165
[3]  
Berners-Lee T(2015)An extensive evaluation of decision tree-based hierarchical multilabel classification methods and performance measures Comput Intell 31 1-46
[4]  
Bizer C(1997)Feature selection for classification Intell Data Anal 1 131-156
[5]  
Lehmann J(2015)Evaluation measures for hierarchical classification: a unified view and novel approaches Data Min Knowl Discov 29 820-865
[6]  
Kobilarov G(1987)Sample estimate of the entropy of a random vector Probl Inf Trans 23 95-101
[7]  
Auer S(2012)An extensive experimental comparison of methods for multi-label learning Pattern Recogn 45 3084-3104
[8]  
Becker C(2011)A multi-label classification algorithm based on label-specific features Wuhan Univ J Nat Sci 16 520-524
[9]  
Cyganiak R(2012)Scalable and efficient multi-label classification for evolving data streams Mach Learn 88 243-272
[10]  
Hellmann S(2000)Boostexter: a boosting-based system for text categorization Mach Learn 39 135-168