Classifier chains for multi-label classification

被引:9
作者
Jesse Read
Bernhard Pfahringer
Geoff Holmes
Eibe Frank
机构
[1] The University of Waikato,Department of Computer Science
[2] Universidad Carlos III,Department of Signal Theory and Communications
来源
Machine Learning | 2011年 / 85卷
关键词
Multi-label classification; Problem transformation; Ensemble methods; Scalable methods;
D O I
暂无
中图分类号
学科分类号
摘要
The widely known binary relevance method for multi-label classification, which considers each label as an independent binary problem, has often been overlooked in the literature due to the perceived inadequacy of not directly modelling label correlations. Most current methods invest considerable complexity to model interdependencies between labels. This paper shows that binary relevance-based methods have much to offer, and that high predictive performance can be obtained without impeding scalability to large datasets. We exemplify this with a novel classifier chains method that can model label correlations while maintaining acceptable computational complexity. We extend this approach further in an ensemble framework. An extensive empirical evaluation covers a broad range of multi-label datasets with a variety of evaluation metrics. The results illustrate the competitiveness of the chaining method against related and state-of-the-art methods, both in terms of predictive performance and time complexity.
引用
收藏
相关论文
共 29 条
  • [1] Boutell M. R.(2004)Learning multi-label scene classification Pattern Recognition 37 1757-1771
  • [2] Luo J.(1996)Bagging predictors Machine Learning 24 123-140
  • [3] Shen X.(2009)Combining instance-based learning and logistic regression for multilabel classification Machine Learning 76 211-225
  • [4] Brown C. M.(2006)Statistical comparisons of classifiers over multiple data sets Journal of Machine Learning Research 7 1-30
  • [5] Breiman L.(1999)A short introduction to boosting Jinkō Chinō Gakkaishi 14 771-780
  • [6] Cheng W.(2002)Round robin classification Machine Learning 2 721-747
  • [7] Hüllermeier E.(2008)Multilabel classification via calibrated label ranking Machine Learning 73 133-153
  • [8] Demšar J.(1986)Induction of decision trees Machine Learning 1 81-106
  • [9] Freund Y.(1999)Improved boosting algorithms using confidence-rated predictions Machine Learning 37 297-336
  • [10] Schapire R. E.(2000)Boostexter: a boosting-based system for text categorization Machine Learning 39 135-168