Subtractive clustering for seeding non-negative matrix factorizations

被引:45
作者
Casalino, Gabriella [1 ]
Del Buono, Nicoletta [2 ]
Mencar, Corrado [1 ]
机构
[1] Univ Bari Aldo Moro, Dept Comp Sci, I-70125 Bari, Italy
[2] Univ Bari Aldo Moro, Dept Math, I-70125 Bari, Italy
关键词
Non-negative matrix factorization; Subtractive clustering; Initialization method; INITIALIZATION; ALGORITHMS;
D O I
10.1016/j.ins.2013.05.038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Non-negative matrix factorization is a multivariate analysis method which is proven to be useful in many areas such as bio-informatics, molecular pattern discovery, pattern recognition, document clustering and so on. It seeks a reduced representation of a multivariate data matrix into the product of basis and encoding matrices possessing only non-negative elements, in order to learn the so called part-based representations of data. All algorithms for computing non-negative matrix factorization are iterative, therefore particular emphasis must be placed on a proper initialization of NMF because of its local convergence. The problem of selecting appropriate starting matrices becomes more complex when data possess special meaning as in document clustering. In this paper, we propose the adoption of the subtractive clustering algorithm as a scheme to generate initial matrices for non-negative matrix factorization algorithms. Comparisons with other commonly adopted initializations of non-negative matrix factorization algorithms have been performed and the proposed scheme reveals to be a good trade-off between effectiveness and speed. Moreover, the effectiveness of the proposed initialization to suggest a number of basis for NMF, when data distances are estimated, is illustrated when NMF is used for solving clustering problems where the number of groups in which the data are grouped is not known a priori. The influence of a proper rank factor on the interpretability and the effectiveness of the results are also discussed. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:369 / 387
页数:19
相关论文
共 45 条
[1]  
Anbumalar S., 2013, INT J COMPUTER APPL, V21, P1
[2]  
[Anonymous], 2006, TECHNICAL REPORT
[3]  
[Anonymous], 1979, NONNEGATIVE MATRICES
[4]  
[Anonymous], 1994, Journal of intelligent and Fuzzy systems
[5]  
[Anonymous], 2003, P 26 ANN INT ACM SIG, DOI DOI 10.1145/860435.860485
[6]  
[Anonymous], NEUR INF PROC SYST C
[7]   Email surveillance using non-negative matrix factorization [J].
Berry M.W. ;
Browne M. .
Computational & Mathematical Organization Theory, 2005, 11 (3) :249-264
[8]   Algorithms and applications for approximate nonnegative matrix factorization [J].
Berry, Michael W. ;
Browne, Murray ;
Langville, Amy N. ;
Pauca, V. Paul ;
Plemmons, Robert J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173
[9]   SVD based initialization: A head start for nonnegative matrix factorization [J].
Boutsidis, C. ;
Gallopoulos, E. .
PATTERN RECOGNITION, 2008, 41 (04) :1350-1362
[10]  
Buciu I, 2006, IEEE INT SYMP CIRC S, P4671