Parallel Non-Negative Matrix Tri-Factorization for Text Data Co-Clustering

被引:14
作者
Chen, Yufu [1 ]
Lei, Zhiqi [1 ]
Rao, Yanghui [1 ]
Xie, Haoran [2 ]
Wang, Fu Lee [3 ]
Yin, Jian [4 ,5 ]
Li, Qing [6 ]
机构
[1] Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] Lingnan Univ, Dept Comp & Decis Sci, Hong Kong, Peoples R China
[3] Hong Kong Metropolitan Univ, Sch Sci & Technol, Kowloon, Hong Kong, Peoples R China
[4] Sun Yat sen Univ, Sch Artificial Intelligence, Zhuhai 519082, Peoples R China
[5] Sun Yat sen Univ, Guangdong Key Lab Big Data Anal & Proc, Guangzhou 510006, Peoples R China
[6] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Matrix decomposition; Computational modeling; Data models; Convergence; Optimization; Scalability; Partitioning algorithms; Non-negative matrix tri-factorization; parallel computing; message passing; Newton iteration; FRAMEWORK; MODEL; ALGORITHMS;
D O I
10.1109/TKDE.2022.3145489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a novel paradigm for data mining and dimensionality reduction, Non-negative Matrix Tri-Factorization (NMTF) has attracted much attention due to its notable performance and elegant mathematical derivation, and it has been applied to a plethora of real-world applications, such as text data co-clustering. However, the existing NMTF-based methods usually involve intensive matrix multiplications, which exhibits a major limitation of high computational complexity. With the explosion at both the size and the feature dimension of texts, there is a growing need to develop a parallel and scalable NMTF-based algorithm for text data co-clustering. To this end, we first show in this paper how to theoretically derive the original optimization problem of NMTF by introducing the Lagrangian multipliers. Then, we propose to solve the Lagrange dual objective function in parallel through an efficient distributed implementation. Extensive experiments on five benchmark corpora validate the effectiveness, efficiency, and scalability of our distributed parallel update algorithm for an NMTF-based text data co-clustering method.
引用
收藏
页码:5132 / 5146
页数:15
相关论文
共 50 条
  • [21] Robust capped norm dual hyper-graph regularized non-negative matrix tri-factorization
    Yu, Jiyang
    Pan, Baicheng
    Yu, Shanshan
    Leung, Man-Fai
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (07) : 12486 - 12509
  • [22] Graph Regularized Sparse Non-Negative Matrix Factorization for Clustering
    Deng, Ping
    Li, Tianrui
    Wang, Hongjun
    Wang, Dexian
    Horng, Shi-Jinn
    Liu, Rui
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (03) : 910 - 921
  • [23] Semi-supervised non-negative matrix tri-factorization with adaptive neighbors and block-diagonal learning
    Li, Songtao
    Li, Weigang
    Lu, Hao
    Li, Yang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [24] Biased unconstrained non-negative matrix factorization for clustering
    Deng, Ping
    Zhang, Fan
    Li, Tianrui
    Wang, Hongjun
    Horng, Shi-Jinn
    KNOWLEDGE-BASED SYSTEMS, 2022, 239
  • [25] Anomaly-aware symmetric non-negative matrix factorization for short text clustering
    Li, Ximing
    Guan, Yuanyuan
    Fu, Bo
    Luo, Zhongxuan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1481 - 1506
  • [26] Non-negative Matrix Factorization for Binary Data
    Larsen, Jacob Sogaard
    Clemmensen, Line Katrine Harder
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 555 - 563
  • [27] Discriminative semi-supervised non-negative matrix factorization for data clustering
    Xing, Zhiwei
    Wen, Meng
    Peng, Jigen
    Feng, Jinqian
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 103
  • [28] Semi-supervised community detection on attributed networks using non-negative matrix tri-factorization with node popularity
    Jin, Di
    He, Jing
    Chai, Bianfang
    He, Dongxiao
    FRONTIERS OF COMPUTER SCIENCE, 2021, 15 (04)
  • [29] Application of non-negative matrix factorization to LC/MS data
    Rapin, Jeremy
    Souloumiac, Antoine
    Bobin, Jerome
    Larue, Anthony
    Junot, Chistophe
    Ouethrani, Minale
    Starck, Jean-Luc
    SIGNAL PROCESSING, 2016, 123 : 75 - 83
  • [30] Kernel Joint Non-Negative Matrix Factorization for Genomic Data
    Salazar, Diego
    Rios, Juan
    Aceros, Sara
    Florez-Vargas, Oscar
    Valencia, Carlos
    IEEE ACCESS, 2021, 9 : 101863 - 101875