Nonnegative Matrix Factorization Via Archetypal Analysis

被引：7

作者：

Javadi, Hamid ^{[1
]}

Montanari, Andrea ^{[2
]}

机构：

[1] Rice Univ, Dept Elect & Comp Engn, POB 1892, Houston, TX 77005 USA

[2] Stanford Univ, Dept Elect Engn & Stat, Stanford, CA 94305 USA

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 2020年 / 115卷 / 530期

基金：

美国国家科学基金会;

关键词：

Dimensionality reduction; Matrix factorization; Separability; ALGORITHMS;

D O I：

10.1080/01621459.2019.1594832

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Given a collection of data points, nonnegative matrix factorization (NMF) suggests expressing them as convex combinations of a small set of "archetypes" with nonnegative entries. This decomposition is unique only if the true archetypes are nonnegative and sufficiently sparse (or the weights are sufficiently sparse), a regime that is captured by the separability condition and its generalizations. In this article, we study an approach to NMF that can be traced back to the work of Cutler and Breiman [(1994), "Archetypal Analysis," Technometrics, 36, 338-347] and does not require the data to be separable, while providing a generally unique decomposition. We optimize a trade-off between two objectives: we minimize the distance of the data points from the convex envelope of the archetypes (which can be interpreted as an empirical risk), while also minimizing the distance of the archetypes from the convex envelope of the data (which can be interpreted as a data-dependent regularization). The archetypal analysis method of Cutler and Breiman is recovered as the limiting case in which the last term is given infinite weight. We introduce a "uniqueness condition" on the data which is necessary for identifiability. We prove that, under uniqueness (plus additional regularity conditions on the geometry of the archetypes), our estimator is robust. While our approach requires solving a nonconvex optimization problem, we find that standard optimization methods succeed in finding good solutions for both real and synthetic data. for this article are available online

引用

页码：896 / 907

页数：12

共 50 条

[31] A PROJECTIVE APPROACH TO NONNEGATIVE MATRIX FACTORIZATION
Groetzner, Patrick
ELECTRONIC JOURNAL OF LINEAR ALGEBRA, 2021, 37 : 583 - 597
[32] Robust Manifold Nonnegative Matrix Factorization
Huang, Jin
Nie, Feiping
Huang, Heng
Ding, Chris
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 8 (03)
[33] LIBNMF - A LIBRARY FOR NONNEGATIVE MATRIX FACTORIZATION
Janecek, Andreas
Grotthoff, Stefan Schulze
Gansterer, Wilfried N.
COMPUTING AND INFORMATICS, 2011, 30 (02) : 205 - 224
[34] Localized user-driven topic discovery via boosted ensemble of nonnegative matrix factorization
Suh, Sangho
Shin, Sungbok
Lee, Joonseok
Reddy, Chandan K.
Choo, Jaegul
KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (03) : 503 - 531
[35] Sparse Separable Nonnegative Matrix Factorization
Nadisic, Nicolas
Vandaele, Arnaud
Cohen, Jeremy E.
Gillis, Nicolas
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT I, 2021, 12457 : 335 - 350
[36] A survey of deep nonnegative matrix factorization
Chen, Wen-Sheng
Zeng, Qianwen
Pan, Binbin
NEUROCOMPUTING, 2022, 491 : 305 - 320
[37] Multiview clustering via consistent and specific nonnegative matrix factorization with graph regularization
Xu, Haixia
Gong, Limin
Xuan, Haizhen
Zheng, Xusheng
Gao, Zan
Wen, Xianbing
MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1559 - 1572
[38] Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization
Kim, Hannah
Choo, Jaegul
Kim, Jingu
Reddy, Chandan K.
Park, Haesun
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 567 - 576
[39] A Provably Correct and Robust Algorithm for Convolutive Nonnegative Matrix Factorization
Degleris, Anthony
Gillis, Nicolas
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (2499-2512) : 2499 - 2512
[40] Distributionally Robust and Multi-Objective Nonnegative Matrix Factorization
Gillis, Nicolas
Le Thi Khanh Hien
Leplat, Valentin
Tan, Vincent Y. F.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4052 - 4064

← 1 2 3 4 5 →