Active multi-label learning with optimal label subset selection

被引:4
作者
机构
[1] School of Computer Science and Technology, Soochow University, Suzhou
来源
Jiao, Yang | 1600年 / Springer Verlag卷 / 8933期
基金
中国国家自然科学基金;
关键词
Active learning; Data mining; Multi-label learning; Optimal label subset; Sampling;
D O I
10.1007/978-3-319-14717-8_41
中图分类号
学科分类号
摘要
Multi-label classification, where each instance is assigned with multiple labels, has been an attractive research topic in data mining. The annotations of multi-label instances are typically more difficult and time consuming, since they are simultaneously associated with multiple labels. Therefore, active learning, which reduces the labeling cost by actively querying the labels of the most valuable data, becomes particularly important for multi-label learning. Study reveals that methods querying instance-label pairs are more effective than those query instances, since for each sample, only some effective labels need to be annotated while others can be inferred by exploring the label correlations. However, with the high dimensionality of label space, the instance-label pair selective algorithm will be affected since the computational cost of training a multi-label model may be strongly affected by the number of labels. In this paper we propose an approach that combines instance sampling with optimal label subset selection, which can effectively improve the classification model performance and substantially reduce the annotation cost. Experimental results demonstrate the superiority of the proposed approach to state-of-the-art methods on three benchmark datasets. © Springer International Publishing Switzerland 2014.
引用
收藏
页码:523 / 534
页数:11
相关论文
共 26 条
  • [1] Settles B., Active learning literature survey, 52, pp. 55-66, (2010)
  • [2] Carneiro G., Chan A.B., Moreno P.J., Vasconcelos N., Supervised learning of semantic classes for image annotation and retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, 29, 3, pp. 394-410, (2007)
  • [3] Qi G.J., Hua X.S., Rui Y., Tang J., Mei T., Zhang H.J., Correlative multilabel video annotation, Proceedings of the 15th International Conference on Multimedia, pp. 17-26, (2007)
  • [4] Yang Y., Wu F., Nie F., Shen H.T., Zhuang Y., Hauptmann A.G., Web and personal image annotation by mining label correlation with relaxed visual graph embedding, IEEE Transactions on Image Processing, 21, 3, pp. 1339-1351, (2012)
  • [5] Zhang M., Zhou Z., A review on multi-label learning algorithms, (2013)
  • [6] Luo T., Kramer K., Samson S., Remsen A., Goldgof D., Hall L., Hopkins T., Active learning to recognize multiple types of plankton, Proceedings of the 17th International Conference on Pattern Recognition, ICPR, 3, pp. 478-481, (2004)
  • [7] Yan R., Yang L., Hauptmann A., Automatically labeling video data using multiclass active learning, Proceedings of the Ninth IEEE International Conference on Computer Vision, pp. 516-523, (2003)
  • [8] Li X., Wang L., Sung E., Multilabel svm active learning for image classification, 2004 International Conference on Image Processing, ICIP 2004, 4, pp. 2207-2210, (2004)
  • [9] Brinker K., On active learning in multi-label classification, From Data and Information Analysis to Knowledge Engineering, pp. 206-213, (2006)
  • [10] Kruse M.S.R., Crnberger C.B.A., Gaul W., From data and information analysis to knowledge engineering