Semi-supervised concept factorization for document clustering

被引:48
作者
Lu, Mei [1 ,2 ]
Zhao, Xiang-Jun [2 ]
Zhang, Li [1 ]
Li, Fan-Zhang [1 ]
机构
[1] Suzhou Univ, Coll Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
[2] Jiangsu Normal Univ, Coll Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Concept factorization; Locally consistent concept factorization; Semi-supervised document clustering; NONNEGATIVE MATRIX FACTORIZATION;
D O I
10.1016/j.ins.2015.10.038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nonnegative Matrix Factorization (NMF) and Concept Factorization (CF) are two popular methods for finding the low-rank approximation of nonnegative matrix. Different from NMF, CF can be applied not only to the matrix containing negative values but also to the kernel space. Based on NMF and CF, many methods, such as Graph regularized Nonnegative Matrix Factorization (GNMF) and Locally Consistent Clustering Factorization (LCCF) can significandy improve the performance of clustering. Unfortunately, these are unsupervised learning methods. In order to enhance the clustering performance with the supervisory information, a Semi-Supervised Concept Factorization (SSCF) is proposed in this paper by incorporating the pairwise constraints into CF as the reward and penalty terms, which can guarantee that the data points belonging to a cluster in the original space are still in the same cluster in the transformed space. By comparing with the state-of-the-arts algorithms (KM, NMF, CF, LCCF, GNMF, PCCF), experimental results on document clustering show that the proposed algorithm has better performance in terms of accuracy and mutual information. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:86 / 98
页数:13
相关论文
共 50 条
[41]   Semi-supervised clustering ensemble based on genetic algorithm model [J].
Sheng Bi ;
Xiangli Li .
Multimedia Tools and Applications, 2024, 83 :55851-55865
[42]   MEGA: Multi-View Semi-Supervised Clustering of Hypergraphs [J].
Whang, Joyce Jiyoung ;
Du, Rundong ;
Jung, Sangwon ;
Lee, Geon ;
Drake, Barry ;
Liu, Qingqing ;
Kang, Seonggoo ;
Park, Haesun .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (05) :698-711
[43]   Image Clustering Based on Supervised Graph Regularized Discriminative Concept Factorization [J].
Long, Xianzhong ;
Li, Yun .
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, :735-741
[44]   Anchor-graph regularized orthogonal concept factorization for document clustering [J].
Yang, Ben ;
Xue, Zhiyuan ;
Wu, Jinghan ;
Zhang, Xuetao ;
Nie, Feiping ;
Chen, Badong .
NEUROCOMPUTING, 2024, 573
[45]   Robust Semi-Supervised Non-Negative Matrix Factorization With Structured Normalization [J].
Wang, Liujing ;
Guan, Naiyang ;
Shi, Dianxi ;
Fan, Zunlin ;
Su, Longfei .
IEEE ACCESS, 2019, 7 :133996-134013
[46]   Semi-Supervised Unmixing of Hyperspectral Data via Spectral-Spatial Factorization [J].
Tan, Xintong ;
Yu, Qi ;
Wang, Zelong ;
Zhu, Jubo .
IEEE SENSORS JOURNAL, 2021, 21 (22) :25963-25972
[47]   Semi-supervised multi-view clustering with Graph-regularized Partially Shared Non-negative Matrix Factorization [J].
Liang, Naiyao ;
Yang, Zuyuan ;
Li, Zhenni ;
Xie, Shengli ;
Su, Chun-Yi .
KNOWLEDGE-BASED SYSTEMS, 2020, 190
[48]   Stopping Criteria for Non-Negative Matrix Factorization Based Supervised and Semi-Supervised Source Separation [J].
Germain, Franois G. ;
Mysore, Gautham J. .
IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (10) :1284-1288
[49]   A term correlation based semi-supervised microblog clustering with dual constraints [J].
Huifang Ma ;
Di Zhang ;
Meihuizi Jia ;
Xianghong Lin .
International Journal of Machine Learning and Cybernetics, 2019, 10 :679-692
[50]   A term correlation based semi-supervised microblog clustering with dual constraints [J].
Ma, Huifang ;
Zhang, Di ;
Jia, Meihuizi ;
Lin, Xianghong .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (04) :679-692