ORKM: Online regularized K-means clustering for online multi-view data

被引:5
作者
Guo, Guangbao [1 ]
Yu, Miao [1 ]
Qian, Guoqi [2 ]
机构
[1] Shandong Univ Technol, Sch Math & Stat, Zibo, Peoples R China
[2] Univ Melbourne, Sch Math & Stat, Melbourne, Australia
关键词
Projected gradient decent; Online clustering; Textual data; Matrix factorization; SELECTION; FRAMEWORK; FUSION;
D O I
10.1016/j.ins.2024.121133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data generated from different sources are sometimes referred to as multi-view data, and as online multi-view data if a time dimension is involved in generating the data. This paper concerns clustering online multi-view data where overfitting and computation intensity are existent challenges. Here we propose an Online Regularized K-Means Clustering (ORKMC) method to tackle these challenges. Specifically, we use a matrix factorization strategy to identify the cluster indicator matrix and cluster mean matrix for all generated data points; and this strategy also includes a clustering complexity regularization term to harness the possible overfitting or overclustering. To reduce computation intensity, we propose an online update step in clustering where clustering is performed on only the latest view data at each update. Through a simulation study and analysis of two real-world data examples, we show that the proposed ORKMC method performs better than the current widely-used clustering methods in terms of clustering accuracy and computation efficiency. Finally, we develop an R package ORKM to implement ORKMC.
引用
收藏
页数:25
相关论文
共 43 条
[1]  
CAI X., 2013, 23 INT JOINT C ARTIF, P2598, DOI DOI 10.5555/2540128.2540503
[2]   Multi-view clustering guided by unconstrained non-negative matrix factorization [J].
Deng, Ping ;
Li, Tianrui ;
Wang, Dexian ;
Wang, Hongjun ;
Peng, Hong ;
Horng, Shi-Jinn .
KNOWLEDGE-BASED SYSTEMS, 2023, 266
[3]   Multi-view constrained clustering with an incomplete mapping between views [J].
Eaton, Eric ;
desJardins, Marie ;
Jacob, Sara .
KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 38 (01) :231-257
[4]   Matrix factorization-based data fusion for the prediction of lncRNA-disease associations [J].
Fu, Guangyuan ;
Wang, Jun ;
Domeniconi, Carlotta ;
Yu, Guoxian .
BIOINFORMATICS, 2018, 34 (09) :1529-1537
[5]  
Gong YS, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1310
[6]  
Guo Guangbao, 2024, CRAN, DOI 10.32614/CRAN.package.ORKM
[7]   Multimodal Face-Pose Estimation With Multitask Manifold Deep Learning [J].
Hong, Chaoqun ;
Yu, Jun ;
Zhang, Jian ;
Jin, Xiongnan ;
Lee, Kyong-Ho .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (07) :3952-3961
[8]   Multimodal Deep Autoencoder for Human Pose Recovery [J].
Hong, Chaoqun ;
Yu, Jun ;
Wan, Jian ;
Tao, Dacheng ;
Wang, Meng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5659-5670
[9]   Multi-view ensemble manifold regularization for 3D object recognition [J].
Hong, Chaoqun ;
Yu, Jun ;
You, Jane ;
Chen, Xuhui ;
Tao, Dapeng .
INFORMATION SCIENCES, 2015, 320 :395-405
[10]   Image-based 3D Human Pose Recovery with Locality Sensitive Sparse Retrieval [J].
Hong, Chaoqun ;
Yu, Jun ;
Chen, Xuhui .
2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, :2103-2108