Distributional Clustering Using Nonnegative Matrix Factorization

被引:0
作者
Zhu, Zhenfeng [1 ]
Ye, Yangdong [1 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou, Henan Province, Peoples R China
来源
PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012) | 2012年
关键词
Nonnegative matrix factorization; conditional distribution; clustering; fuzziness; PARTS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose an iterative distributional clustering algorithm based on non-negative matrix factorization (DCMF). When factorizing a data matrix A into C x M, an objective function is defined to impose the conditional distribution constraints on the base matrix C and the coefficient matrix M. It has been observed that, in many applications, the conditional distributions of instances are often employed to normalize the data dimensions. Taking these factors into account, we simplify the existent updating rules and obtain the iterative algorithm DCMF. This algorithm satisfies the constraints described above on condition that the instance matrix is preprocessed as a conditional distribution. DCMF is simple, effective, and only needs to initialize the coefficient matrix. As a result, the base matrix can be viewed as a centroid matrix and the coefficient matrix just records the membership of fuzzy clustering. Compared with several other factorization algorithms, the experimental results on text, gene, and image data demonstrate that. DCMF achieves 8.06% clustering accuracy improvement, 35.08% computational time reduction, and 61.30% hard clustering fuzziness decrease.
引用
收藏
页码:4705 / 4711
页数:7
相关论文
共 26 条
[1]  
Albright R., 2006, ALGORITHMS INITIALIZ, P795
[2]  
[Anonymous], 2006, TECHNICAL REPORT
[3]  
[Anonymous], 2003, P 26 ANN INT ACM SIG, DOI DOI 10.1145/860435.860485
[4]  
[Anonymous], 2005, P 22 INT C MACH LEAR, DOI DOI 10.1145/1102351.1102451
[5]   Algorithms and applications for approximate nonnegative matrix factorization [J].
Berry, Michael W. ;
Browne, Murray ;
Langville, Amy N. ;
Pauca, V. Paul ;
Plemmons, Robert J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173
[6]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[7]  
Chu M., 2004, TECHNICAL REPORT
[8]   Non-negative matrix factorization with α-divergence [J].
Cichocki, Andrzej ;
Lee, Hyekyoung ;
Kim, Yong-Deok ;
Choi, Seungjin .
PATTERN RECOGNITION LETTERS, 2008, 29 (09) :1433-1440
[9]  
Cover T. M., 1999, Elements of information theory
[10]  
Ding C, 2005, SIAM PROC S, P606