Nonnegative Matrix Factorization (NMF) is a popular dimension reduction technique of clustering by extracting latent features from high-dimensional data and is widely used for text mining. Several optimization algorithms have been developed for NMF with different cost functions. In this paper we apply several methods of NMF that have been developed for data analysis. These methods vary in using different cost function for matrix factorization and different optimization algorithms for minimizing the cost function. Reuters Document Corpus is used for evaluating the performance of each method. The methods are compared with respect to their accuracy, entropy, purity and computational complexity and residual mean square root error. The most efficient methods in terms of each performance measure are also recognized.
机构:
Univ Mons, Fac Polytech, Dept Math & Operat Res, Rue Houdain 9, B-7000 Mons, BelgiumUniv Mons, Fac Polytech, Dept Math & Operat Res, Rue Houdain 9, B-7000 Mons, Belgium
机构:
Univ Mons, Fac Polytech, Dept Math & Operat Res, Rue Houdain 9, B-7000 Mons, BelgiumUniv Mons, Fac Polytech, Dept Math & Operat Res, Rue Houdain 9, B-7000 Mons, Belgium
Pan, Junjun
Gillis, Nicolas
论文数: 0引用数: 0
h-index: 0
机构:
Univ Mons, Fac Polytech, Dept Math & Operat Res, Rue Houdain 9, B-7000 Mons, BelgiumUniv Mons, Fac Polytech, Dept Math & Operat Res, Rue Houdain 9, B-7000 Mons, Belgium