Learning the parts of objects by non-negative matrix factorization

被引:9534
作者
Lee, DD
Seung, HS
机构
[1] AT&T Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
[2] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
关键词
D O I
10.1038/44565
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Is perception of the whole based on perception of its parts? There is psychological(1) and physiological(2,3) evidence for parts-based representations in the brain, and certain computational theories of object recognition rely on such representations(4,5). But little is known about how brains or computers might learn the parts of objects. Here we demonstrate an algorithm for non-negative matrix factorization that is able to learn parts of faces and semantic features of text. This is in contrast to other methods, such as principal components analysis and vector quantization, that learn holistic, not parts-based, representations. Non-negative matrix factorization is distinguished from the other methods by its use of non-negativity constraints. These constraints lead to a parts-based representation because they allow only additive, not subtractive, combinations. When non-negative matrix factorization is implemented as a neural network, parts-based representations emerge by virtue of two properties: the firing rates of neurons are never negative and synaptic strengths do not change sign.
引用
收藏
页码:788 / 791
页数:4
相关论文
共 22 条
[1]  
[Anonymous], 1995, The Handbook of Brain Theory and Neural Networks
[2]  
[Anonymous], 1996, HIGH LEVEL VISION OB
[3]   Independent component representations for face recognition [J].
Bartlett, MS ;
Lades, HM ;
Sejnowski, TJ .
HUMAN VISION AND ELECTRONIC IMAGING III, 1998, 3299 :528-539
[4]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[5]   RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING [J].
BIEDERMAN, I .
PSYCHOLOGICAL REVIEW, 1987, 94 (02) :115-147
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]   WHAT IS THE GOAL OF SENSORY CODING [J].
FIELD, DJ .
NEURAL COMPUTATION, 1994, 6 (04) :559-601
[8]   THE WAKE-SLEEP ALGORITHM FOR UNSUPERVISED NEURAL NETWORKS [J].
HINTON, GE ;
DAYAN, P ;
FREY, BJ ;
NEAL, RM .
SCIENCE, 1995, 268 (5214) :1158-1161
[9]   BLIND SEPARATION OF SOURCES .1. AN ADAPTIVE ALGORITHM BASED ON NEUROMIMETIC ARCHITECTURE [J].
JUTTEN, C ;
HERAULT, J .
SIGNAL PROCESSING, 1991, 24 (01) :1-10
[10]   A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge [J].
Landauer, TK ;
Dumais, ST .
PSYCHOLOGICAL REVIEW, 1997, 104 (02) :211-240