Nonnegative Matrix Factorization Via Archetypal Analysis

被引:7
|
作者
Javadi, Hamid [1 ]
Montanari, Andrea [2 ]
机构
[1] Rice Univ, Dept Elect & Comp Engn, POB 1892, Houston, TX 77005 USA
[2] Stanford Univ, Dept Elect Engn & Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Dimensionality reduction; Matrix factorization; Separability; ALGORITHMS;
D O I
10.1080/01621459.2019.1594832
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Given a collection of data points, nonnegative matrix factorization (NMF) suggests expressing them as convex combinations of a small set of "archetypes" with nonnegative entries. This decomposition is unique only if the true archetypes are nonnegative and sufficiently sparse (or the weights are sufficiently sparse), a regime that is captured by the separability condition and its generalizations. In this article, we study an approach to NMF that can be traced back to the work of Cutler and Breiman [(1994), "Archetypal Analysis," Technometrics, 36, 338-347] and does not require the data to be separable, while providing a generally unique decomposition. We optimize a trade-off between two objectives: we minimize the distance of the data points from the convex envelope of the archetypes (which can be interpreted as an empirical risk), while also minimizing the distance of the archetypes from the convex envelope of the data (which can be interpreted as a data-dependent regularization). The archetypal analysis method of Cutler and Breiman is recovered as the limiting case in which the last term is given infinite weight. We introduce a "uniqueness condition" on the data which is necessary for identifiability. We prove that, under uniqueness (plus additional regularity conditions on the geometry of the archetypes), our estimator is robust. While our approach requires solving a nonconvex optimization problem, we find that standard optimization methods succeed in finding good solutions for both real and synthetic data. for this article are available online
引用
收藏
页码:896 / 907
页数:12
相关论文
共 50 条
  • [31] A PROJECTIVE APPROACH TO NONNEGATIVE MATRIX FACTORIZATION
    Groetzner, Patrick
    ELECTRONIC JOURNAL OF LINEAR ALGEBRA, 2021, 37 : 583 - 597
  • [32] Robust Manifold Nonnegative Matrix Factorization
    Huang, Jin
    Nie, Feiping
    Huang, Heng
    Ding, Chris
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 8 (03)
  • [33] LIBNMF - A LIBRARY FOR NONNEGATIVE MATRIX FACTORIZATION
    Janecek, Andreas
    Grotthoff, Stefan Schulze
    Gansterer, Wilfried N.
    COMPUTING AND INFORMATICS, 2011, 30 (02) : 205 - 224
  • [34] Localized user-driven topic discovery via boosted ensemble of nonnegative matrix factorization
    Suh, Sangho
    Shin, Sungbok
    Lee, Joonseok
    Reddy, Chandan K.
    Choo, Jaegul
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 56 (03) : 503 - 531
  • [35] Sparse Separable Nonnegative Matrix Factorization
    Nadisic, Nicolas
    Vandaele, Arnaud
    Cohen, Jeremy E.
    Gillis, Nicolas
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT I, 2021, 12457 : 335 - 350
  • [36] A survey of deep nonnegative matrix factorization
    Chen, Wen-Sheng
    Zeng, Qianwen
    Pan, Binbin
    NEUROCOMPUTING, 2022, 491 : 305 - 320
  • [37] Multiview clustering via consistent and specific nonnegative matrix factorization with graph regularization
    Xu, Haixia
    Gong, Limin
    Xuan, Haizhen
    Zheng, Xusheng
    Gao, Zan
    Wen, Xianbing
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1559 - 1572
  • [38] Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization
    Kim, Hannah
    Choo, Jaegul
    Kim, Jingu
    Reddy, Chandan K.
    Park, Haesun
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 567 - 576
  • [39] A Provably Correct and Robust Algorithm for Convolutive Nonnegative Matrix Factorization
    Degleris, Anthony
    Gillis, Nicolas
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (2499-2512) : 2499 - 2512
  • [40] Distributionally Robust and Multi-Objective Nonnegative Matrix Factorization
    Gillis, Nicolas
    Le Thi Khanh Hien
    Leplat, Valentin
    Tan, Vincent Y. F.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4052 - 4064