Non-negative Matrix Factorization for Binary Data

被引:0
作者
Larsen, Jacob Sogaard [1 ]
Clemmensen, Line Katrine Harder [1 ]
机构
[1] Tech Univ Denmark, DTU Compute, DK-2800 Lyngby, Denmark
来源
2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K) | 2015年
关键词
Non-negative Matrix Factorization; Binary Data; Binary Matrix Factorization; Text Modelling; ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose the Logistic Non-negative Matrix Factorization for decomposition of binary data. Binary data are frequently generated in e.g. text analysis, sensory data, market basket data etc. A common method for analysing non-negative data is the Non-negative Matrix Factorization, though this is in theory not appropriate for binary data, and thus we propose a novel Non-negative Matrix Factorization based on the logistic link function. Furthermore we generalize the method to handle missing data. The formulation of the method is compared to a previously proposed logistic matrix factorization without non-negativity constraint on the features. We compare the performance of the Logistic Non-negative Matrix Factorization to Least Squares Non-negative Matrix Factorization and Kullback-Leibler (KL) Non-negative Matrix Factorization on sets of binary data: a synthetic dataset, a set of student comments on their professors collected in a binary term-document matrix and a sensory dataset. We find that choosing the number of components is an essential part in the modelling and interpretation, that is still unresolved.
引用
收藏
页码:555 / 563
页数:9
相关论文
共 11 条
[1]  
[Anonymous], 2009, CONVEX OPTIMIZATION
[2]  
Gillis N, 2014, ARXIV E PRINTS
[3]   Accelerated Multiplicative Updates and Hierarchical ALS Algorithms for Nonnegative Matrix Factorization [J].
Gillis, Nicolas ;
Glineur, Francois .
NEURAL COMPUTATION, 2012, 24 (04) :1085-1105
[4]  
Hastie T., 2009, ELEMENTS STAT LEARNI, DOI 10.1007/978-0-387-84858-7
[5]   Learning the parts of objects by non-negative matrix factorization [J].
Lee, DD ;
Seung, HS .
NATURE, 1999, 401 (6755) :788-791
[6]  
Lee DD, 2001, ADV NEUR IN, V13, P556
[7]  
Nielsen SFV, 2014, IEEE INT WORKS MACH
[8]  
Paukkeri M.-S., 2012, THESIS
[9]  
Randall J., 1989, BIOMETRICAL J, V7, P781
[10]   A logistic non-negative matrix factorization approach to binary data sets [J].
Tome, A. M. ;
Schachtner, R. ;
Vigneron, V. ;
Puntonet, C. G. ;
Lang, E. W. .
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2015, 26 (01) :125-143