Biased unconstrained non-negative matrix factorization for clustering

被引:8
作者
Deng, Ping [1 ]
Zhang, Fan [1 ]
Li, Tianrui [1 ,3 ,4 ]
Wang, Hongjun [1 ]
Horng, Shi-Jinn [2 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
[3] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data App, Chengdu 611756, Peoples R China
[4] Mfg Ind Chains Collaborat & Informat Support Tech, Chengdu 611756, Peoples R China
基金
美国国家科学基金会;
关键词
Non-negative matrix factorization; Unconstrained regularization; Stochastic gradient descent; Clustering; SIGMOID FUNCTION; ALGORITHMS;
D O I
10.1016/j.knosys.2021.108040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering remains a challenging research hotspot in data mining. Non-negative matrix factorization (NMF) is an effective technique for clustering, which aims to find the product of two non-negative low-dimensional matrices that approximates the original matrix. Since the matrices must satisfy the non-negative constraints, the Karush-Kuhn-Tucker conditions need to be used to obtain the update rules for the matrices, which limits the choice of update methods. Moreover, this method has no learning rate and the updating process is completely dependent on the data itself. In addition, the two low-dimensional matrices in NMF are randomly initialized, and the clustering performance of the model is reduced. To address these problems, this paper proposes a biased unconstrained non negative matrix factorization (BUNMF) model, which integrates the l2 norm and adds bias. Specifically, BUNMF uses a non-linear activation function to make elements of the matrices to remain non negative, and converts the constrained problem into an unconstrained problem. The matrices are renewed by sequentially updating the matrices' elements using stochastic gradient descent to obtain an update rule with a learning rate. Furthermore, the BUNMF model is constructed by three different activation functions and their iteration update algorithms are given through detailed reasoning. Finally, experimental results on eight public datasets show the effectiveness of the proposed model. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 67 条
  • [1] A Block Coordinate Descent-Based Projected Gradient Algorithm for Orthogonal Non-Negative Matrix Factorization
    Asadi, Soodabeh
    Povh, Janez
    [J]. MATHEMATICS, 2021, 9 (05) : 1 - 22
  • [2] Balakrishnama S., 1998, Linear discriminant analysis‐a brief tutorial, V18, P1
  • [3] Algorithms and applications for approximate nonnegative matrix factorization
    Berry, Michael W.
    Browne, Murray
    Langville, Amy N.
    Pauca, V. Paul
    Plemmons, Robert J.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 155 - 173
  • [4] Graph Regularized Nonnegative Matrix Factorization for Data Representation
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1548 - 1560
  • [5] Non-negative Matrix Factorization on Manifold
    Cai, Deng
    He, Xiaofei
    Wu, Xiaoyun
    Han, Jiawei
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 63 - +
  • [6] Chen D., 2010, BIRTH NUMERICAL ANAL, P109
  • [7] Algorithms for Orthogonal Nonnegative Matrix Factorization
    Choi, Seungjin
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1828 - 1832
  • [8] Supporting healthcare management decisions via robust clustering of event logs
    Delias, Pavlos
    Doumpos, Michael
    Grigoroudis, Evangelos
    Manolitzas, Panagiotis
    Matsatsinis, Nikolaos
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 84 : 203 - 213
  • [9] QUASI-NEWTON METHODS, MOTIVATION AND THEORY
    DENNIS, JE
    MORE, JJ
    [J]. SIAM REVIEW, 1977, 19 (01) : 46 - 89
  • [10] Ding C., 2006, P 12 ACM SIGKDD INT, DOI DOI 10.1145/1150402.1150420