Feature Extraction of High-dimensional Data Based on J-HOSVD for Cyber-Physical-Social Systems

被引:3
作者
Gao, Yuan [1 ]
Yang, Laurence T. [1 ,2 ]
Zhao, Yaliang [3 ]
Yang, Jing [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS B2G 2W5, Canada
[3] Henan Univ, Sch Comp & Informat Engn, Henan Key Lab Big Data Anal & Proc, Kaifeng 475004, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; feature extraction; dimensionality reduction; joint tensor decomposition; big data; Cyber-Physical-Social systems; TENSOR; DECOMPOSITIONS; REDUCTION;
D O I
10.1145/3483448
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the further integration of Cyber-Physical-Social systems (CPSSs), there is explosive growth of the data in CPSSs. How to discover effective information or knowledge from CPSSs big data and provide support for subsequent learning tasks has become a core issue. Moreover, modern applications in CPSSs increasingly rely on the processing and analysis of high-dimensional data; the correlation and internal structure of these high-dimensional data are gradually becoming more complex, which further makes traditional machine learning algorithms a little inadequate in processing these data. In this article, we propose two general dimension reduction and feature extraction methods for high-dimensional data based on joint tensor decomposition, namely core feature extraction methods and factor feature extraction methods, which can effectively mine out the common components and hidden patterns of high-dimensional data by joint analysis while maintaining the original data structure. We also verified the effectiveness of the methods from both theoretical and practical aspects. Furthermore, we extend the two feature extraction methods to the tensor distance scenario and illustrate that the compressed features extracted by our models can keep the global information of original data well. Finally, we evaluated proposed methods on two benchmark datasets through classification tasks, and experimental results show that the low-dimensional features extracted by the proposed models have higher classification accuracy than the direct classification of the original data, which further verifies the effectiveness and robustness of our methods.
引用
收藏
页数:21
相关论文
共 32 条
[1]   Tensor decompositions for feature extraction and classification of high dimensional datasets [J].
Anh Huy Phan ;
Ciehoeki, Andrzej .
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2010, 1 (01) :37-68
[2]  
Chen HT, 2005, PROC CVPR IEEE, P846
[3]   Tensor Networks for Dimensionality Reduction and Large-Scale Optimization Part 1 Low-Rank Tensor Decompositions [J].
Cichocki, Andrzej ;
Lee, Namgil ;
Oseledets, Ivan ;
Anh-Huy Phan ;
Zhao, Qibin ;
Mandic, Danilo P. .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2016, 9 (4-5) :I-+
[4]   Resolving ambiguity in sentiment classification: The role of dependency features [J].
Deng S. ;
Sinha A.P. ;
Zhao H. .
ACM Transactions on Management Information Systems, 2017, 8 (2-3)
[5]   Stable Orthogonal Local Discriminant Embedding for Linear Dimensionality Reduction [J].
Gao, Quanxue ;
Ma, Jingjie ;
Zhang, Hailin ;
Gao, Xinbo ;
Liu, Yamin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (07) :2521-2531
[6]   Improving clustering performance using independent component analysis and unsupervised feature learning [J].
Gultepe, Eren ;
Makrehchi, Masoud .
HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
[7]   Virtual Machines Scheduling in Mobile Edge Computing: A Formal Concept Analysis Approach [J].
Hao, Fei ;
Pang, Guangyao ;
Pei, Zheng ;
Qin, Keyun ;
Zhang, Yu ;
Wang, Xiaoming .
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2020, 5 (03) :319-328
[8]   Tensor Decompositions and Applications [J].
Kolda, Tamara G. ;
Bader, Brett W. .
SIAM REVIEW, 2009, 51 (03) :455-500
[9]   A Tensor-Based Approach for Big Data Representation and Dimensionality Reduction [J].
Kuang, Liwei ;
Hao, Fei ;
Yang, Laurence T. ;
Lin, Man ;
Luo, Changqing ;
Min, Geyong .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) :280-291
[10]  
Li Kuan-Ching., 2015, BIG DATA ALGORITHMS