Nonnegative Matrix Factorization Via Archetypal Analysis

被引：7

作者：

Javadi, Hamid ^{[1
]}

Montanari, Andrea ^{[2
]}

机构：

[1] Rice Univ, Dept Elect & Comp Engn, POB 1892, Houston, TX 77005 USA

[2] Stanford Univ, Dept Elect Engn & Stat, Stanford, CA 94305 USA

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 2020年 / 115卷 / 530期

基金：

美国国家科学基金会;

关键词：

Dimensionality reduction; Matrix factorization; Separability; ALGORITHMS;

D O I：

10.1080/01621459.2019.1594832

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Given a collection of data points, nonnegative matrix factorization (NMF) suggests expressing them as convex combinations of a small set of "archetypes" with nonnegative entries. This decomposition is unique only if the true archetypes are nonnegative and sufficiently sparse (or the weights are sufficiently sparse), a regime that is captured by the separability condition and its generalizations. In this article, we study an approach to NMF that can be traced back to the work of Cutler and Breiman [(1994), "Archetypal Analysis," Technometrics, 36, 338-347] and does not require the data to be separable, while providing a generally unique decomposition. We optimize a trade-off between two objectives: we minimize the distance of the data points from the convex envelope of the archetypes (which can be interpreted as an empirical risk), while also minimizing the distance of the archetypes from the convex envelope of the data (which can be interpreted as a data-dependent regularization). The archetypal analysis method of Cutler and Breiman is recovered as the limiting case in which the last term is given infinite weight. We introduce a "uniqueness condition" on the data which is necessary for identifiability. We prove that, under uniqueness (plus additional regularity conditions on the geometry of the archetypes), our estimator is robust. While our approach requires solving a nonconvex optimization problem, we find that standard optimization methods succeed in finding good solutions for both real and synthetic data. for this article are available online

引用

页码：896 / 907

页数：12

共 50 条

[1] Community Detection via Multihop Nonnegative Matrix Factorization
Guan, Jiewen
Chen, Bilian
Huang, Xin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 10033 - 10044
[2] Boolean Matrix Factorization via Nonnegative Auxiliary Optimization
Truong, Duc P.
Skau, Erik
Desantis, Derek
Alexandrov, Boian
IEEE ACCESS, 2021, 9 : 117169 - 117177
[3] Generalized Separable Nonnegative Matrix Factorization
Pan, Junjun
Gillis, Nicolas
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1546 - 1561
[4] Adaptive Clustering via Symmetric Nonnegative Matrix Factorization of the Similarity Matrix
Favati, Paola
Lotti, Grazia
Menchi, Ornella
Romani, Francesco
ALGORITHMS, 2019, 12 (10)
[5] Nonnegative Matrix Factorization: A Comprehensive Review
Wang, Yu-Xiong
Zhang, Yu-Jin
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (06) : 1336 - 1353
[6] DSANLS: Accelerating Distributed Nonnegative Matrix Factorization via Sketching
Qian, Yuqiu
Tan, Conghui
Mamoulis, Nikos
Cheung, David W.
WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 450 - 458
[7] Social Spammer Detection via Convex Nonnegative Matrix Factorization
Shen, Hua
Wang, Bangyu
Liu, Xinyue
Zhang, Xianchao
IEEE ACCESS, 2022, 10 : 91192 - 91202
[8] Efficient Nonnegative Matrix Factorization via projected Newton method
Gong, Pinghua
Zhang, Changshui
PATTERN RECOGNITION, 2012, 45 (09) : 3557 - 3565
[9] Quadratic nonnegative matrix factorization
Yang, Zhirong
Oja, Erkki
PATTERN RECOGNITION, 2012, 45 (04) : 1500 - 1510
[10] Elastic Nonnegative Matrix Factorization
Ballen, Peter
Guha, Sudipto
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1271 - 1278

← 1 2 3 4 5 →