A High-Order Clustering Algorithm Based on Dropout Deep Learning for Heterogeneous Data in Cyber-Physical-Social Systems

被引:19
作者
Bu, Fanyu [1 ]
机构
[1] Inner Mongolia Univ Finance & Econ, Dept Biomed Informat, Hohhot 010070, Peoples R China
基金
中国国家自然科学基金;
关键词
Cyber-physical-social systems; dropout deep learning model; heterogeneous data; high-order clustering; C-MEANS ALGORITHMS; BIG DATA;
D O I
10.1109/ACCESS.2017.2759509
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An explosive growth of cyber-physical-social systems has been witnessed owing to the wide use of various mobile devices recently. A large volume of heterogeneous data has been collected from cyber-physical-social systems in the past few years. Each object in the heterogeneous dataset is typically multi-modal, posing a remarkable challenge on heterogeneous data clustering. In this paper, we propose a high-order k-means algorithm based on the dropout deep learning model for clustering heterogeneous objects in cyber-physical-social systems. We first build three dropout stacked auto-encoders, each with three hidden layers to learn the features for the different modalities of each object. Furthermore, we establish a feature tensor for each object by using the vector outer product to fuse the learned features. At last, we devise a tensor k-means algorithm to cluster the heterogeneous objects based on the tensor distance. We evaluate the proposed high-order k-means algorithm on two representative heterogeneous data sets and results imply that the proposed high-order k-means algorithm can achieve more accurate clustering results than other heterogeneous data clustering methods.
引用
收藏
页码:11687 / 11693
页数:7
相关论文
共 27 条
[1]  
[Anonymous], 2008, Advances in neural information processing systems
[2]  
[Anonymous], 2006, P 23 INT C MACHINE L, DOI DOI 10.1145/1143844.1143918
[3]  
[Anonymous], 2009, P ACM INT C IMAGE VI
[4]  
Cai X., 2011, P IEEE C COMP VIS PA, P20
[5]   Big Data Deep Learning: Challenges and Perspectives [J].
Chen, Xue-Wen ;
Lin, Xiaotong .
IEEE ACCESS, 2014, 2 :514-525
[6]   Non-Negative Matrix Factorization for Semisupervised Heterogeneous Data Coclustering [J].
Chen, Yanhua ;
Wang, Lijun ;
Dong, Ming .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) :1459-1474
[7]  
Cichocki A., 2013, P INT WORKSH SMART I
[8]  
Gao B., 2005, P 11 INT C KNOWLEDGE, P41, DOI DOI 10.1145/1081870.1081879
[9]   Fuzzy c-Means Algorithms for Very Large Data [J].
Havens, Timothy C. ;
Bezdek, James C. ;
Leckie, Christopher ;
Hall, Lawrence O. ;
Palaniswami, Marimuthu .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (06) :1130-1146
[10]   Mobile Phone Computing and the Internet of Things: A Survey [J].
Kamilaris, Andreas ;
Pitsillides, Andreas .
IEEE INTERNET OF THINGS JOURNAL, 2016, 3 (06) :885-898