Non-Negative Matrix Factorization with Auxiliary Information on Overlapping Groups

被引：14

作者：

Shiga, Motoki ^{[1
]}

Mamitsuka, Hiroshi ^{[2
]}

机构：

[1] Gifu Univ, Fac Engn, Dept Elect Elect & Comp Engn, Informat Course, Gifu 5011193, Japan

[2] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Uji, Kyoto 6110011, Japan

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2015年 / 27卷 / 06期

关键词：

Non-negative matrix factorization; auxiliary information; semi-supervised learning; sparse structured norm; ALGORITHMS; CLASSIFICATION; SELECTION;

D O I：

10.1109/TKDE.2014.2373361

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Matrix factorization is useful to extract the essential low-rank structure from a given matrix and has been paid increasing attention. A typical example is non-negative matrix factorization (NMF), which is one type of unsupervised learning, having been successfully applied to a variety of data including documents, images and gene expression, where their values are usually non-negative. We propose a new model of NMF which is trained by using auxiliary information of overlapping groups. This setting is very reasonable in many applications, a typical example being gene function estimation where functional gene groups are heavily overlapped with each other. To estimate true groups from given overlapping groups efficiently, our model incorporates latent matrices with the regularization term using a mixed norm. This regularization term allows group-wise sparsity on the optimized low-rank structure. The latent matrices and other parameters are efficiently estimated by a block coordinate gradient descent method. We empirically evaluated the performance of our proposed model and algorithm from a variety of viewpoints, comparing with four methods including MMF for auxiliary graph information, by using both synthetic and real world document and gene expression data sets.

引用

页码：1615 / 1628

页数：14

共 34 条

[1]

[Anonymous], 2009, IEEE Trans. Neural Networks

[2]

[Anonymous], P AISTATS 2010

[3] Gene Ontology: tool for the unification of biology [J].

Ashburner, M ;

Ball, CA ;

Blake, JA ;

Botstein, D ;

Butler, H ;

Cherry, JM ;

Davis, AP ;

Dolinski, K ;

Dwight, SS ;

Eppig, JT ;

Harris, MA ;

Hill, DP ;

Issel-Tarver, L ;

Kasarskis, A ;

Lewis, S ;

Matese, JC ;

Richardson, JE ;

Ringwald, M ;

Rubin, GM ;

Sherlock, G .

NATURE GENETICS, 2000, 25 (01) :25-29

[4]

Basu S., 2004, P 10 ACM SIGKDD INT, P59, DOI DOI 10.1145/1014052.1014062

[5] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].

Beck, Amir ;

Teboulle, Marc .

SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202

[6]

Bengio S., 2009, Advances in Neural Information Processing Systems, V22, P82

[7] Algorithms and applications for approximate nonnegative matrix factorization [J].

Berry, Michael W. ;

Browne, Murray ;

Langville, Amy N. ;

Pauca, V. Paul ;

Plemmons, Robert J. .

COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173

[8] Metagenes and molecular pattern discovery using matrix factorization [J].

Brunet, JP ;

Tamayo, P ;

Golub, TR ;

Mesirov, JP .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169

[9] Locally Consistent Concept Factorization for Document Clustering [J].

Cai, Deng ;

He, Xiaofei ;

Han, Jiawei .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (06) :902-913

[10] Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].

Cai, Deng ;

He, Xiaofei ;

Han, Jiawei ;

Huang, Thomas S. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560

← 1 2 3 4 →