Semi-supervised concept factorization for document clustering

被引:48
作者
Lu, Mei [1 ,2 ]
Zhao, Xiang-Jun [2 ]
Zhang, Li [1 ]
Li, Fan-Zhang [1 ]
机构
[1] Suzhou Univ, Coll Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
[2] Jiangsu Normal Univ, Coll Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Concept factorization; Locally consistent concept factorization; Semi-supervised document clustering; NONNEGATIVE MATRIX FACTORIZATION;
D O I
10.1016/j.ins.2015.10.038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nonnegative Matrix Factorization (NMF) and Concept Factorization (CF) are two popular methods for finding the low-rank approximation of nonnegative matrix. Different from NMF, CF can be applied not only to the matrix containing negative values but also to the kernel space. Based on NMF and CF, many methods, such as Graph regularized Nonnegative Matrix Factorization (GNMF) and Locally Consistent Clustering Factorization (LCCF) can significandy improve the performance of clustering. Unfortunately, these are unsupervised learning methods. In order to enhance the clustering performance with the supervisory information, a Semi-Supervised Concept Factorization (SSCF) is proposed in this paper by incorporating the pairwise constraints into CF as the reward and penalty terms, which can guarantee that the data points belonging to a cluster in the original space are still in the same cluster in the transformed space. By comparing with the state-of-the-arts algorithms (KM, NMF, CF, LCCF, GNMF, PCCF), experimental results on document clustering show that the proposed algorithm has better performance in terms of accuracy and mutual information. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:86 / 98
页数:13
相关论文
共 50 条
[21]   Hierarchical Semi-Supervised Factorization for Learning the Semantics [J].
Shen, Bin ;
Makhambetov, Olzhas .
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2014, 18 (03) :366-374
[22]   Continuous Semi-Supervised Nonnegative Matrix Factorization [J].
Lindstrom, Michael R. R. ;
Ding, Xiaofu ;
Liu, Feng ;
Somayajula, Anand ;
Needell, Deanna .
ALGORITHMS, 2023, 16 (04)
[23]   Multiple graph regularized semi-supervised nonnegative matrix factorization with adaptive weights for clustering [J].
Zhang, Kexin ;
Zhao, Xuezhuan ;
Peng, Siyuan .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
[24]   Semi-supervised non-negative matrix factorization with structure preserving for image clustering [J].
Jing, Wenjing ;
Lu, Linzhang ;
Ou, Weihua .
NEURAL NETWORKS, 2025, 187
[25]   Semi-supervised non-negative matrix factorization for image clustering with graph Laplacian [J].
He, Yangcheng ;
Lu, Hongtao ;
Xie, Saining .
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (02) :1441-1463
[26]   Semi-supervised non-negative matrix factorization for image clustering with graph Laplacian [J].
Yangcheng He ;
Hongtao Lu ;
Saining Xie .
Multimedia Tools and Applications, 2014, 72 :1441-1463
[27]   Concept Factorization With Adaptive Neighbors for Document Clustering [J].
Pei, Xiaobing ;
Chen, Chuanbo ;
Gong, Weihua .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (02) :343-352
[28]   A novel regularized concept factorization for document clustering [J].
Yan, Wei ;
Zhang, Bob ;
Ma, Sihan ;
Yang, Zuyuan .
KNOWLEDGE-BASED SYSTEMS, 2017, 135 :147-158
[29]   Semi-Supervised Psychometric Scoring of Document Collections [J].
Suyunu, Burak ;
Ayci, Gonul ;
Ogretir, Mine ;
Cemgil, Ali Taylan ;
Uskudarli, Suzan ;
Zeytinoglu, Hamza ;
Ozel, Bulent ;
Boyaci, Arman .
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, :1367-1374
[30]   Semi-supervised Non-negative Local Coordinate Factorization [J].
Zhou, Cherong ;
Zhang, Xiang ;
Guan, Naiyang ;
Huang, Xuhui ;
Luo, Zhigang .
NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 :106-113