Constrained nonnegative matrix factorization-based semi-supervised multilabel learning

被引:0
作者
Dingguo Yu
Bin Fu
Guandong Xu
Aihong Qin
机构
[1] Zhejiang University of Media and Communications,School of New Media
[2] University of Technology,Advanced Analytics Institute
来源
International Journal of Machine Learning and Cybernetics | 2019年 / 10卷
关键词
Semi-supervised learning; Nonnegative matrix factorization (NMF); Multilabel learning; Weak label;
D O I
暂无
中图分类号
学科分类号
摘要
In many multilabel learning applications, instances with labels being fully provided are scarce, while partially labelled data and unlabelled data are more common due to the expensive cost of manual labelling. However, most of existing models are based on the assumption that the fully labelled training data is sufficient. To deal with the partially labelled and unlabelled data effectively, we present a novel semi-supervised multilabel learning approach based on constrained non-negative matrix factorization in this paper. This approach assumes that if two instances are highly similar in terms of their features, they would also be similar in their associated labels set. Specifically, We first define three matrices to measure the similarity of each pair of instances in two different ways. Then, the optimal assignation of labels to the unlabelled instance is determined by minimizing the differentiation between these two similarity sets via a non-negative matrix factorization process. We also present a threshold learning algorithm to determine the classification threshold for each label in our proposed approach. Extensive experiment is conducted on various datasets, and the results demonstrate that our method show significantly better performance than other state-of-the-art approaches. It is especially suitable for the situations with a smaller size of labelled training data, or subset of the training data are partially labelled.
引用
收藏
页码:1093 / 1100
页数:7
相关论文
共 61 条
[1]  
Ashfaq RAR(2017)Fuzziness based semi-supervised learning approach for intrusion detection system Inf Sci 378 484-497
[2]  
Wang XZ(2006)Manifold regularization: a geometric framework for learning from labeled and unlabeled examples J Mach Learn Res 7 2399-2434
[3]  
Huang JZ(2004)Learning multi-label scene classification Pattern Recogn 37 1757-1771
[4]  
Abbas H(2009)Combining instance-based learning and logistic regression for multilabel classification Mach Learn 76 211-225
[5]  
He YL(2016)Label consistent semi-supervised non-negative matrix factorization for maintenance activities identification Eng Appl Artif Intell 52 161-167
[6]  
Belkin M(2008)Multilabel classification via calibrated label ranking Mach Learn 73 133-153
[7]  
Niyogi P(1999)Learning the parts of objects by non-negative matrix factorization Nature 401 788-791
[8]  
Sindhwani V(2012)Constrained nonnegative matrix factorization for image representation IEEE Trans Pattern Anal Mach Intell 34 1299-1311
[9]  
Boutell MR(2010)Semi-supervised dimension reduction for multi-label classification AAAI 10 569-574
[10]  
Luo J(2011)Classifier chains for multi-label classification Mach Learn 85 333-359