Nonnegative Matrix Factorization Via Archetypal Analysis

被引:7
|
作者
Javadi, Hamid [1 ]
Montanari, Andrea [2 ]
机构
[1] Rice Univ, Dept Elect & Comp Engn, POB 1892, Houston, TX 77005 USA
[2] Stanford Univ, Dept Elect Engn & Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Dimensionality reduction; Matrix factorization; Separability; ALGORITHMS;
D O I
10.1080/01621459.2019.1594832
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Given a collection of data points, nonnegative matrix factorization (NMF) suggests expressing them as convex combinations of a small set of "archetypes" with nonnegative entries. This decomposition is unique only if the true archetypes are nonnegative and sufficiently sparse (or the weights are sufficiently sparse), a regime that is captured by the separability condition and its generalizations. In this article, we study an approach to NMF that can be traced back to the work of Cutler and Breiman [(1994), "Archetypal Analysis," Technometrics, 36, 338-347] and does not require the data to be separable, while providing a generally unique decomposition. We optimize a trade-off between two objectives: we minimize the distance of the data points from the convex envelope of the archetypes (which can be interpreted as an empirical risk), while also minimizing the distance of the archetypes from the convex envelope of the data (which can be interpreted as a data-dependent regularization). The archetypal analysis method of Cutler and Breiman is recovered as the limiting case in which the last term is given infinite weight. We introduce a "uniqueness condition" on the data which is necessary for identifiability. We prove that, under uniqueness (plus additional regularity conditions on the geometry of the archetypes), our estimator is robust. While our approach requires solving a nonconvex optimization problem, we find that standard optimization methods succeed in finding good solutions for both real and synthetic data. for this article are available online
引用
收藏
页码:896 / 907
页数:12
相关论文
共 50 条
  • [1] Community Detection via Multihop Nonnegative Matrix Factorization
    Guan, Jiewen
    Chen, Bilian
    Huang, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 10033 - 10044
  • [2] Boolean Matrix Factorization via Nonnegative Auxiliary Optimization
    Truong, Duc P.
    Skau, Erik
    Desantis, Derek
    Alexandrov, Boian
    IEEE ACCESS, 2021, 9 : 117169 - 117177
  • [3] Generalized Separable Nonnegative Matrix Factorization
    Pan, Junjun
    Gillis, Nicolas
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1546 - 1561
  • [4] Adaptive Clustering via Symmetric Nonnegative Matrix Factorization of the Similarity Matrix
    Favati, Paola
    Lotti, Grazia
    Menchi, Ornella
    Romani, Francesco
    ALGORITHMS, 2019, 12 (10)
  • [5] Nonnegative Matrix Factorization: A Comprehensive Review
    Wang, Yu-Xiong
    Zhang, Yu-Jin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (06) : 1336 - 1353
  • [6] DSANLS: Accelerating Distributed Nonnegative Matrix Factorization via Sketching
    Qian, Yuqiu
    Tan, Conghui
    Mamoulis, Nikos
    Cheung, David W.
    WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 450 - 458
  • [7] Social Spammer Detection via Convex Nonnegative Matrix Factorization
    Shen, Hua
    Wang, Bangyu
    Liu, Xinyue
    Zhang, Xianchao
    IEEE ACCESS, 2022, 10 : 91192 - 91202
  • [8] Efficient Nonnegative Matrix Factorization via projected Newton method
    Gong, Pinghua
    Zhang, Changshui
    PATTERN RECOGNITION, 2012, 45 (09) : 3557 - 3565
  • [9] Quadratic nonnegative matrix factorization
    Yang, Zhirong
    Oja, Erkki
    PATTERN RECOGNITION, 2012, 45 (04) : 1500 - 1510
  • [10] Elastic Nonnegative Matrix Factorization
    Ballen, Peter
    Guha, Sudipto
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1271 - 1278